Gene Dgeo_0174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0174 
Symbol 
ID4058420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp162924 
End bp163826 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content63% 
IMG OID641229172 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_603646 
Protein GI94984282 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.335046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG TCCTCTGTCT CTCACTGGCT GTGCTGATGG GCAGCGCCGC CGCCAAACCT 
ATTGTGGTGG GCAGCAAGCT CGACCCCGAG GCCCAGATTC TCGGGCAGAT GATCGTCTTG
ACGCTGCGCA ATGCCGGGCT GGAGGTGAGC GACAAGACCA ATCTGGGTGA TACCGGCGTG
AACCGCAAGG CGATTCTGGC GGGTGAGATC GACGTGTACC CCGAGTACAC CGGCAACGCG
GTGTATCTCT TTCCACAGGC CAAGATCAGC GCCAAGGATG CAGGGAATCC CGGCAAAATC
TACGGGTATG CCCGGCAGCT CGACGCCAAA AACGGCATCA CTTGGCTGAA GCCGGCCAAC
GTCAACAACA CCTGGGTGAT CGCCGTGCCG CAAGCGCTGG CACAACGGGA AAAGCTGAGC
AGCGTGGCCG ACCTGGCTCG CTACCTCCAG GCGGGGGGCC GCTTCAAGAT CGCCGGGAGT
CCCGAGTTCT TTAACCGCCC GGACACCATG CCCGCCTTTG AGGCTGCCTA CGGCTTCAAG
CTGCGACCCG ACCAGAAGCT GGTGTTGGCT GGGGCCACGC CGCCACAGAC GCAGCAGGCG
GCCGCCAACG GCACCAACGG TGTGAATGCT GCGATGGCCT ACGGCACAGA CGGCACCCTG
GCCGCGCTCA AATTGGTGGC CCTCAAAGAT CCCAAGGGCG CACAGGCGGT CTATCAGCCT
GCCCCGATCA TCCGCAGCGA GGTGCTCCAG GCTCACCCGG AGATCGGGAC ACTGCTCAAC
AAGACCTTTG CCACGCTCAC GCAAGCGGGG CTGCAAAGGT TAAATGCCCA GGTCGCGCTC
GAAGGCCGCA CCGCGCAGGA GGTGGCCCAA AGCTACCTCA AGAGCAAGGG GCTGATCAAG
TGA
 
Protein sequence
MSKVLCLSLA VLMGSAAAKP IVVGSKLDPE AQILGQMIVL TLRNAGLEVS DKTNLGDTGV 
NRKAILAGEI DVYPEYTGNA VYLFPQAKIS AKDAGNPGKI YGYARQLDAK NGITWLKPAN
VNNTWVIAVP QALAQREKLS SVADLARYLQ AGGRFKIAGS PEFFNRPDTM PAFEAAYGFK
LRPDQKLVLA GATPPQTQQA AANGTNGVNA AMAYGTDGTL AALKLVALKD PKGAQAVYQP
APIIRSEVLQ AHPEIGTLLN KTFATLTQAG LQRLNAQVAL EGRTAQEVAQ SYLKSKGLIK