Gene Dgeo_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2122 
Symbol 
ID4058857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2234162 
End bp2235883 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content61% 
IMG OID641231162 
Productextracellular solute-binding protein 
Protein accessionYP_605585 
Protein GI94986221 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.973923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TTGCTCTGCT CAGCACCCTT CTTCTCGCTG GCGCGGCCTT TGCCGCTGCC 
CCCAAAGACA CGCTGGTGGT CCAGCAGGCT GCCGATATCC CCACCCTTGA CCCCGGCGCG
ACCTACGACA CCGCCTCGGG TTCAGTGGTG GAGAACATCT ACGAGACGCT GGTGACCTAC
AAGGGGAGTA GCCTGCGTGA CCTGGAGCCG CTGCTGGCCA CCAAGTGGAC CATCAGCAAT
GGTGGCAAGA CCTACACCTT TGATCTGCGC AAGAACGTCA AGTTCCACTC CGGGAATCCC
TTCACCTGCG CTGACGCCGA GTACACCTTC GAGCGCAACC TGGTCACCAA CTCTGCCGAG
TCGGGCAATT GGTTCCTGGC TGAGAGCCTG CTGGGCACCG GCAGCAACGC CAACGATGAC
AAGTCGATCA CCTGGTCCAA GATCGACAGG GCGGTCGAGT GTAACAATCA GGGCCAACTC
GTCTTTAACC TGCCCAAGCC CGACCCGGCC TTCCTCGCCA AGCTGGCCTT CCCGGGTCAG
AGCATTGTGG ACAAGAACTG GGCCATCAAG CTTGGAGAGT GGAGCGGCAA AGAGGCTGAC
TGGAAGAGCT GGGTGGGCAA GGATCTCCAG GGCAGCAAGC TGAATGCTCA ACCCAGCGGC
ACCGGCGCCT ACCGTCTGGT GCGCAAGGAC GCCAATGCCA CGCTGGCCCA GGCCTTCGAC
GGCTACTGGG GCAAGAAGCC TGCCATCAAG AATGTGATTC TTCAGAAGGT GCCCGAACTG
GCTGCCCGCC AGCAGGCCTT CTTGCGCGGT GACGCCGATC TGATTGAGGC GGGTACCCGC
GCCAACGTTG AAGAGCAGCT CAAGGGCAAG CCCGGTGTGG TGGTCGTGGA CGGCCTGCCC
AACACCACGG CGACGGCGAT CTTCATGAAC GAGAACATCA AGGATCCTGC GCGCCTGGGC
AGCGGCAAGC TGGACGGCCA GGGGATTCCC GCCAACTTCT TCAGCGACGT CAACGTGCGG
CGCGGGTTCT CCTATGCCTT CAACTACGCC CAGTACATCA GTGATGTGCT GAAGGGCAAG
GGCAAGCAGC GCACCATGCT GCTCCCCGAC TCCTTCCCCG GCTATGACGC TAAGGTCAAG
ACCTACAAGT ACGACCCAGC GCAGGCCAAG GCCTACTTCC AGCGCGCCTG GGGCGGCCAG
CTCTGGAAAA ATGGCTTTAC CCTGAATGTG GCGTACCGCG CGGGGAGTGT GGGCGCGCAG
ACTGCGATGG AAATTCTGAA GAAGAACATC GAGTCTCTCA ATCCCAAGTT CCGCGTGAAC
ATCCAGGCCA AGGAGTGGTC GGCCATGCTG AATGACTCCA AGGCAGGGAA GGAGCCGATG
ATCATTCTCG GCTGGGCGCC GGACTACGCG GACCCCGACA ACTTCATGTA CACCTTCTAC
TCCAGCAACG GGTACTACTA CCCCCGCAGC AATTGGAAGG ACGCGACCGT TGACAAGTGG
CTGGAGCAGG CCCGCAACAC CACCAACACG GCCGAGCGTA ACCGCCTCTA CAGCCTGGTG
GGTCAGCGCG CCTACGAGCA GGCGCCCTTT ATCCTGGTGC CGGCGGGCGT CGGCTTGAAC
GTGCAGCGCA GCAATCTGGT CGGCGCAACG GCGCAGACCT TCAACCCCAT GATCTCGTTC
AGCTACACCG GGACCTTCTG GAAGGATCTC AGCAAGAAGT AA
 
Protein sequence
MKKIALLSTL LLAGAAFAAA PKDTLVVQQA ADIPTLDPGA TYDTASGSVV ENIYETLVTY 
KGSSLRDLEP LLATKWTISN GGKTYTFDLR KNVKFHSGNP FTCADAEYTF ERNLVTNSAE
SGNWFLAESL LGTGSNANDD KSITWSKIDR AVECNNQGQL VFNLPKPDPA FLAKLAFPGQ
SIVDKNWAIK LGEWSGKEAD WKSWVGKDLQ GSKLNAQPSG TGAYRLVRKD ANATLAQAFD
GYWGKKPAIK NVILQKVPEL AARQQAFLRG DADLIEAGTR ANVEEQLKGK PGVVVVDGLP
NTTATAIFMN ENIKDPARLG SGKLDGQGIP ANFFSDVNVR RGFSYAFNYA QYISDVLKGK
GKQRTMLLPD SFPGYDAKVK TYKYDPAQAK AYFQRAWGGQ LWKNGFTLNV AYRAGSVGAQ
TAMEILKKNI ESLNPKFRVN IQAKEWSAML NDSKAGKEPM IILGWAPDYA DPDNFMYTFY
SSNGYYYPRS NWKDATVDKW LEQARNTTNT AERNRLYSLV GQRAYEQAPF ILVPAGVGLN
VQRSNLVGAT AQTFNPMISF SYTGTFWKDL SKK