Gene Dgeo_2821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2821 
Symbol 
ID4074050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp206750 
End bp208036 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content59% 
IMG OID641228659 
Productextracellular solute-binding protein 
Protein accessionYP_594324 
Protein GI94972284 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.858803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGCC TGACTCTGCT GTTTGCCCTG CCCGGTCTAC TGGTCGCTGT GCCTGCCCAT 
GCCCAGAGTG TCGACTGGAG CGCCTGTAAG GGCACAACCC TGCGTGTGCT GCTCAACCAG
CATCCCTGGA CAACCGCGAT GCAGCCCGCC TTTCCAGAAT TTGAAAAGCT GACCGGGATG
AAGCTGGCGG TAGAAACCTA TCCTGAGGCG CAGTTCCGGC AAAAAGTCTT GGTGGAACTC
TCCACCGGGG GGCAGAATCT CGACGCCTTT ATGCTCTCGC CGGGCCAGGA AGGGCTGCTG
TATGCCCGTA GCGGCTGGAT CGAGGACATG AAAACGTATA TCAACAATAA GAATTTGACT
GCCGGAAACT GGGGCTTTTC TGACTTCTAT CCCTCGGTTG TGCGGTCCAC CGAGTACAAC
GGAATCATGA CGGGTGTGCC GATCCAAACG GAAACACCGA TGCTGTTCTA TCGCAAGGAT
CTATTTACCA AGTACAAGAT TCCAGTGCCC AAGACTATGG CCCAGCTCGA GGCTGCGGCC
AAAGCCCTTC ATGGCAAAGA CGGCGTCTTT GGAATTGCTC TCCGTGGCAA GGGTGCGGCG
GCCACCAGCC AGTTCAGCCC CTACATGTTC TCCTATGGCA GCACTTGGCT GAACAAAGAT
GGCCAAGCCA ACTTTACCGA TCCCAAGTTT GTGCAGGCCA TGACGATGTA CACCGGCTTG
CTGCGCAAGT ACGGCCCGCC AGCTGCCGTG ACGATGAGTT GGCCAGAGGT CACCAACCTC
TTCGCGCAGG GCAAGGTCGC CATGTTCACC GACGCCTCGC TGTTTCGCAG CATTGTGGAC
GATCCCAAAA GCAGCACGGT GGCCGGGAAG GTCGGTTATG CGCCATTCCC CGCTGGACCG
GCCGGACGCA AGCCCTATGT GACCACTTGG GCCCTGAGTA TCCCCAAGGG CAGCAAGAAC
AAGCCGTGCG CTTGGCTGTT TACCCAGTGG GCCACCAACC GCCAAAACCA GTTGCGCGTG
CTGCTTCAGG ATGTGCCCGC TGTACGGCGC AGCGTCTGGA ACGACCCAGC TTTCAAAAAG
CAGGAAACCA ACCCCGAGTG GACCCAGGCT CACCTCAGTC AGTTGGCCAG TGCCAACCCG
CTGTGGAATC CCCCGGTCAG CCAGGTTGGC GAGGTGCGCG ATGCGCTGGG TCAGGCCATC
GTGGGGATCT TGCAGGGCGG TAACACCCTA GACCTGCTCA AGCGTGCCGA GCAGACCACC
AACGCGATCA TCAGCAAGGA AAAGTAA
 
Protein sequence
MRRLTLLFAL PGLLVAVPAH AQSVDWSACK GTTLRVLLNQ HPWTTAMQPA FPEFEKLTGM 
KLAVETYPEA QFRQKVLVEL STGGQNLDAF MLSPGQEGLL YARSGWIEDM KTYINNKNLT
AGNWGFSDFY PSVVRSTEYN GIMTGVPIQT ETPMLFYRKD LFTKYKIPVP KTMAQLEAAA
KALHGKDGVF GIALRGKGAA ATSQFSPYMF SYGSTWLNKD GQANFTDPKF VQAMTMYTGL
LRKYGPPAAV TMSWPEVTNL FAQGKVAMFT DASLFRSIVD DPKSSTVAGK VGYAPFPAGP
AGRKPYVTTW ALSIPKGSKN KPCAWLFTQW ATNRQNQLRV LLQDVPAVRR SVWNDPAFKK
QETNPEWTQA HLSQLASANP LWNPPVSQVG EVRDALGQAI VGILQGGNTL DLLKRAEQTT
NAIISKEK