Gene Dgeo_0173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0173 
Symbol 
ID4058419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp161681 
End bp162874 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content70% 
IMG OID641229171 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_603645 
Protein GI94984281 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.348533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGGTCG CCGGACAGCA GGAGGCGAGC TGGGCTGTCG CCCCCGCGAC TGTCTCGGGA 
ACGAACACGG TTCTGATGCT GGCCATGCTG CCGATGCTGC TGGGCGCGCT GCTGCCCTGG
GTGCTCCTGC GACCCAACCG CCTCGCACCG GGCACGTATC TGCACCTGCC ACCTGCTCTG
GTGGGTGCGG CGCTGCTGCT GGCTGTATTG CCGCTCCTCA CGGCCCGCTT CACTCCACGT
CTAACCTGGC TGGCTGCGGC CCTGGCAGTT GTGGCAGGGT TCTGGTGGCT GGGGAGTCAG
ACTCGGGCAG CACTGCTGAA CCAGCTTCCC TTTGCGCGGG CGAGTGCCGC CAGTGGGGTA
TGGCTGTTTT TATTAGGCGC AGGCGTCGCC GCCTACGGAG CGGGTCTGCT GAGCCGGTGC
CGTGCGGAAC GCTGGCTCGC TTGGGCCTGG CTGCCTGTGG TCACAGTCCT GTTCCTATCC
GGGCATCTGA ACGCCTGGTC GGTCCTGGTT GAGGGCCGCA ACGAGGGACC GCGCTGGGTG
CAGGAATGGG TGCAGCACCT GCGGCTGGTG GGGGAAGGGC TGGGCCTAGC CTTGCTGATC
GGCGCGCCGC TGGCGGTGTG GGCCACGGGC CGCGAACGGG TGGCCAGCGC AGTGCTGGGC
GCAGCAAATG CCGTCCAAAC CCTGCCCAGT CTCGCCCTGT TGGGCCTTCT CATTGCGCCG
CTGGCAGCCC TGGCGAATGC CTTTCCAGGC CTGCGGACCC TGGGCATAAG CGGCATCGGC
GTGGCTCCAG CGCTCACCGC CCTGACGCTG TATGCGCTGC TGCCGATCCT GCGCAATGGT
GTGGTGGCGC TGCGGGGCGT GCCAACCGGT GTGGTCGACG CCGCACGCGG CATGGGCATG
ACACCGGCGC AACTGTTCTG GCGGGTACGG CTGCCCCTGG CTTTGCCGGT CTGGCTCAGT
GGCATCCGGC AGGCCGCCGT GCTGCTGGTG GGCGTGGCGG CGGTGGCCGC CCTGATTGGC
GCTGGGGGAT TGGGCACCTA CATCTTCAAG GGCCTCCAAA GTGCCGCCGC CGACCTGATC
CTGCTCGGTG CGGTGCCCGC TGCCCTGCTC GCCCTAGGGC TGGACGCCGC CCTGCGCGGG
CTGGAGAGGC TGCTGGGGCA ACGGCTGGGC AGCGTGCAGG GTAGGGTCGC GTGA
 
Protein sequence
MRVAGQQEAS WAVAPATVSG TNTVLMLAML PMLLGALLPW VLLRPNRLAP GTYLHLPPAL 
VGAALLLAVL PLLTARFTPR LTWLAAALAV VAGFWWLGSQ TRAALLNQLP FARASAASGV
WLFLLGAGVA AYGAGLLSRC RAERWLAWAW LPVVTVLFLS GHLNAWSVLV EGRNEGPRWV
QEWVQHLRLV GEGLGLALLI GAPLAVWATG RERVASAVLG AANAVQTLPS LALLGLLIAP
LAALANAFPG LRTLGISGIG VAPALTALTL YALLPILRNG VVALRGVPTG VVDAARGMGM
TPAQLFWRVR LPLALPVWLS GIRQAAVLLV GVAAVAALIG AGGLGTYIFK GLQSAAADLI
LLGAVPAALL ALGLDAALRG LERLLGQRLG SVQGRVA