Gene Ndas_5168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5168 
Symbol 
ID9249061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp310241 
End bp311815 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content73% 
IMG OID 
ProductNCS1 nucleoside transporter family 
Protein accessionYP_003683054 
Protein GI297564081 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.409771 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCACA GCCCGACCCC GGCGGAACCC GCCGCCGTCA CCCACCCCGA CGGCCGGGTG 
TCCCTCGCCG AGGGGACCTC CCTGCCCGAG GGCCCCTACG TCAACGCCGA CCTCCAACCC
GTCCCGATGT CCCGGCGCAC CTGGGGCACC GGCAGCTTCG GCGCCCTGTG GGTGAGCATG
TCGGTCAGCA TCCCGGCCTG GACCCTGGCC AGCGGCCTGA TCGCGGCGGG CATGGACTGG
CGCCAGGCCA TGCTCGCCGT GGTCCTGGGC AACCTCGTCG TCCTGCTGCC CATGGTGCTG
ACCGGCCACG CGGGCGCCAG GTACGGCATC CCCTTCCCCG TGTTCGCGCG CGCCTCCTTC
GGCCTGCGCG GCGCCAACCT GCCCGCGCTG CTGCGCGGCG CGGTCGCCTG CGGCTGGTAC
GGCATCCAGA CCTGGGTCGG CGGCCAGGGG GTGTACATCC TGCTCGGGCG GCTGCTCGGC
GACGGGTGGA CGGGGTCGGC GGCGCTGGGC GGCCAGCCGT GGACGCTGTG GCTGTCCTTC
GGCCTGTTCT GGGTCGCCCA GCTCGCGATC ATCCTGTGGG GCATGGAGGG CGTGCGCCGG
ACCCAGGTGT GGGCCGCGCC GCTGATGATC CTCGGCGGCG TCGCGCTGCT GGCCTGGATG
GCCGTGGAGG CCGGGGGCCT GGCCCCGATG CTGTCCCTGG ACTCGGGCGA GCCCCTGGAC
TGGGGGCCCT CCTTCTGGGC GCTGTTCTTC CCGTCGCTGA TGGGCGTGAT CGGCTACTGG
GCGACCCTGA CCCTCAACAT CAGCGACTTC ACCCGCTTCT CGTCCTCCCA GCGCGCCCAG
GTGGTGGGCC AGACGCTGGG CCTGCCCACC ACGATGACGC TGTTCTCGCT GCTGGCCGTC
ATGGTCACGG CGGGCACCGC GGCCGTCTAC GGCGAACCCC TGTGGAACCC GATCGACGTC
GTGGCGCGGA TGGACAGCGG GATCGGCCTG CTCTTCGCGG TCTTCGTCGT GCTGCTGGCC
ACGGTCTCCA CCAACATCGC GGCCAACCTG GTCGGTCCGG CCTACGACCT GTCCAACCTC
AGGCCCCGGC TGATCAGCTT CCGCGCCGGG GCGATCACCA CCTGCGTGCT CAGCGTGCTG
ATCATGCCGT GGCGGCTGCT GGAGAACGAG AGCGTCTACA TCTTCACCTG GCTGGGCACG
GTGGGCGGCC TCCTGGGCAC CGTGGCGGGC GTCCTGCTCG CCGACTACTG GCTGCTGCGC
CGCACGCGGA TGAACCTGCC CGCGCTGTAC GAGCGCGGCT CGGAGTACTG GTACCGGCAC
GGGTGGAACT GGCGCGCCCT GGTGGCCTTC GGCGTCGGCT CGGTGCTGGC CGTGGGCGGT
TCGCACTCCC CCGAGGGGTC GGGCCCCTTC CCGGCCGAGG GTCTGGTCCC GTTCCTGGCG
CCGCTGGCGG ACTACGGGTG GCTCGTGGGG CTGGCCAGCG GCCTGCTGCT GCACTGGGGC
CTGGGCGTGC TCCTGCCCCA CCGGGACGCG GGGGAGCGGG CCGCGAGGAG GACGGAGGGG
GCCGCCGCCG GCTGA
 
Protein sequence
MTHSPTPAEP AAVTHPDGRV SLAEGTSLPE GPYVNADLQP VPMSRRTWGT GSFGALWVSM 
SVSIPAWTLA SGLIAAGMDW RQAMLAVVLG NLVVLLPMVL TGHAGARYGI PFPVFARASF
GLRGANLPAL LRGAVACGWY GIQTWVGGQG VYILLGRLLG DGWTGSAALG GQPWTLWLSF
GLFWVAQLAI ILWGMEGVRR TQVWAAPLMI LGGVALLAWM AVEAGGLAPM LSLDSGEPLD
WGPSFWALFF PSLMGVIGYW ATLTLNISDF TRFSSSQRAQ VVGQTLGLPT TMTLFSLLAV
MVTAGTAAVY GEPLWNPIDV VARMDSGIGL LFAVFVVLLA TVSTNIAANL VGPAYDLSNL
RPRLISFRAG AITTCVLSVL IMPWRLLENE SVYIFTWLGT VGGLLGTVAG VLLADYWLLR
RTRMNLPALY ERGSEYWYRH GWNWRALVAF GVGSVLAVGG SHSPEGSGPF PAEGLVPFLA
PLADYGWLVG LASGLLLHWG LGVLLPHRDA GERAARRTEG AAAG