Gene Ndas_2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2109 
Symbol 
ID9245959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2528078 
End bp2529505 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content70% 
IMG OID 
Productsugar transporter 
Protein accessionYP_003680040 
Protein GI297561066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.197372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGAG CCAGACAGGC CGGGGACGAG TCGGTTCCGG CGGGGGAGGG CAACATCGTC 
CACGTGACGA TGATCGCCGC GGCCGCGGCG ATGGGCGGTT TCCTGTTCGG CTACGACAGC
GCGGTCATCA ACGGAGCGGT ACCCGCCATC CAGGAGTACT TCGGAGTGGG CCCCGCCACG
CTGGGCTTCA CGGTCGCCGC CGCGCTCCTG GGCTGCGTGG TGGGCGCCGC GGTCGCGGGG
GCCCTGGCCG ACCGCCTCGG CCGCATCCGC ACCATGCAGA TCGCCGGTGT GCTGTTCGCG
ATCAGCGCCG TCGGCTCGGC GCTGCCGTTC AACGTGTGGG ACCTGACCGC CTGGCGGATC
CTGGGCGGTG TCGCCATCGG CCTGGCCTCG GTGATCGCCC CGACCTACAT CGCAGAGGTG
TCGCCCGCGG CCTACCGCGG CCGCCTGGCG TCGTTGCAGC AGCTGGCCAT CGTGCTGGGC
ATCGCCGCCT CGCAGCTGGT CAACTACGGC ATCGCCCAGA TGGCCGACGG CACCGCGAGC
GGCATGCTGG GGCCGATCCA GGCCTGGCAG TGGATGCTGG GCGTCGAGGT CCTGCCCGCC
CTGGTCTACC TGGGGCTGAG CGTGCTCATC CCCGAATCTC CCCGCTACCT GGTGCGCGTG
GGGCAGACCG AACGCGCCCG CCGCATCCTG GCCGACGTCG AGGGCGGCGG AGCCGAGCGG
GTGGACAAGC GCATCGGGGA GATCCGCGAG GCGCTGGGCT CGGAGGTCCG GCCCAGGCTG
AGCGACCTGA CCGGCCGCTA CGGTCTGCTG CCCATCGTGT GGATCGGCAT GGCCGTCTCG
GCGTTCCAAC AGCTGGTCGG GATCAACGTC ATCTTCTACT ACTCCAGTTC GCTGTGGCAG
TCGGTGGGGG TGGAGGAGTC GGCCTCGCTG CTGCTGAGCC TGTTCACCTC CATCGTGAAC
ATCGTGGGTA CGTTCGTGGC GATCCTGCTG GTGGACCGGG TCGGCCGCAA GCCGCTGCTG
CTGGTGGGCT CGGCCGGGAT GACGGTGGCG CTGGCGCTGG CCGCCTACGC CTTCAACCAC
GCGGTGGTGC GGGGCGAGGA GGTGACGCTG TCGTTCGGCT GGGGCGCGGT GGCGCTGACC
GCGGCCAGCC TGTTCGTGCT CTTCTTCGCG CTGTCGTGGG GCGTGGTCGT GTGGGTGCTG
CTGGGGGAGA TGTTCCCGCT GCGCATCCGT GCCGCGGCGA TGGGCGTGGC CACCGCGACC
CAGTGGCTCA CCAACTGGCT CATCACCGTG AGCTTCCCGA GCCTGCGCGA CTGGAGCCTG
AGCGGCACGT ACCTGATGTA CGCGTTCTTC GCGCTGGTGT CGTTCTTCTT CGTGCTGAGG
TTCGTGAAGG AGACCCGCGG CAAGACCCTG GAGGAGATGC GGGGCTGA
 
Protein sequence
MSGARQAGDE SVPAGEGNIV HVTMIAAAAA MGGFLFGYDS AVINGAVPAI QEYFGVGPAT 
LGFTVAAALL GCVVGAAVAG ALADRLGRIR TMQIAGVLFA ISAVGSALPF NVWDLTAWRI
LGGVAIGLAS VIAPTYIAEV SPAAYRGRLA SLQQLAIVLG IAASQLVNYG IAQMADGTAS
GMLGPIQAWQ WMLGVEVLPA LVYLGLSVLI PESPRYLVRV GQTERARRIL ADVEGGGAER
VDKRIGEIRE ALGSEVRPRL SDLTGRYGLL PIVWIGMAVS AFQQLVGINV IFYYSSSLWQ
SVGVEESASL LLSLFTSIVN IVGTFVAILL VDRVGRKPLL LVGSAGMTVA LALAAYAFNH
AVVRGEEVTL SFGWGAVALT AASLFVLFFA LSWGVVVWVL LGEMFPLRIR AAAMGVATAT
QWLTNWLITV SFPSLRDWSL SGTYLMYAFF ALVSFFFVLR FVKETRGKTL EEMRG