Gene Dgeo_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1189 
Symbol 
ID4058805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1261508 
End bp1263085 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content61% 
IMG OID641230204 
Productextracellular solute-binding protein 
Protein accessionYP_604655 
Protein GI94985291 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0440454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC TGCTCCTGAC CGCCCTGCTC GCCTCCCTCC CGACCGCCGG AGCCGCCACG 
CTGGTCTTCG GCAACAACGG TGATCCCGTG AGCCTCGAAT CCGGCAACAT CACGGACGGC
ATCAGCATCG CGGTGCAGCG TCAGATCTAT GACACCTTGG TCGACTTCAA GGACGGCACG
ACCGAGCCAG TCCCTGGCCT GGCGACGAGC TGGAAGGCCA ACAAGGACGC GACCCAGTGG
ACCTTCACGC TGCGCAAGGG CGTCAAATTC CAAGACGGTA CCCCCTTCAA CGCGGACGCC
GTGATCTTTA ACGTCAACCG CTGGTGGGAT CCCAAGAATG CCTATGGCTA CCGCGACCAG
GGTCATACCT ATGAGATCTG GGGCCAGCTG ATGGGAGGCT ACAAGGGCGA CGCCACCTCC
ATCCTTAAGA ACGTGGTGAA GCTCGACGAC TACACCGTGC GCTTCGAGAT GAATAAGCCC
TCCACGGTGT TCCCCAGTGT GATTGGGTCG GGGTATTTCG GCATCGCCAG TCCGGCGGCG
ATCAAGAAAG ACGGGGCCAA GTACGGCACG CCCGCCAGCA AGCCGGTCGG CACCGGTCCA
TTTATCTTCC AGAGCTGGAA GACCGGGGAC CGCATCGTCC TGCTGCCCAA CAAGCTGTAC
TGGGGCACCA AGCCCAAGGT GGACCAGCTG GTGATCCGCT CGATCAAGGA CGCCTCGCAG
CGCCTGAACG AACTCAAGGC CGGGACCATC GACTTTGCCA ATGACCTGAC ACCCGACAGT
CTCAAGGCGG TGCAGGCCGA CAAGAACCTG GTGGCGGTCA AGCGGCCCTC TTTCAACGTG
GGCTTCGTCA GCCTGAATAA CCGCAACCCG TACCTCAAAA ACGACAAGGT GCGGCAGGCG
ATCAGCATGG CGATCAACAA AAAGGCGATT GTTGAGGCCT TCTGGCCGGG GCTGGGCATC
AGCAACGCGA GCTTCTTGCC ACCGGTGCTG AGCTGGGCCA ACTCCAAGAA CGTGCCCGCC
GACTACAAGT ACGATCCGCA GGCGGCCAAG AAGCTGCTCG CAGATGCCGG GTACCCCAAC
GGCTTCTCTG TCGACCTGTG GTACATGCCG GTCAGCCGCC CCTACTTCCC GCAGCCCAAA
CCCATCGCGG AAGCCATCGC CGCCGACCTC AGCGCGATCG GCATCAAGGT GAACCTCAAG
ACCGAAGACT GGGCCAAGTA CTTGGAAGAT CGCCGCAAAG AACCCGGCTT TGACATGTAC
ATGATCGGCT GGACGGGCGA CTACGGCGAC CCCGATAACT TCTACAGTGC CTACTACGGA
CCGGGCGGTT CGGACGACAT CAACTGGAAC CCCCCGCAGC TCGAGAAGTT GCTGGAGCAG
GGCCGCGCTG CGGTGAGTCA GGCCGACAAG GCCAAAGCCT ACAGCCAGAT TCACGAGATC
ACCTACAAGG CGAACTACCG CATTCCGATG GTCCACAGCC AGCCGCTGGC CGCCGCGCGC
ACCTACGTGA AGGGCTGGGT GCCCAGCCCG CTGGGTAGCG AAGCATTCAA CACCATCAGC
GTCGTCGGCA AGAAATAA
 
Protein sequence
MKKLLLTALL ASLPTAGAAT LVFGNNGDPV SLESGNITDG ISIAVQRQIY DTLVDFKDGT 
TEPVPGLATS WKANKDATQW TFTLRKGVKF QDGTPFNADA VIFNVNRWWD PKNAYGYRDQ
GHTYEIWGQL MGGYKGDATS ILKNVVKLDD YTVRFEMNKP STVFPSVIGS GYFGIASPAA
IKKDGAKYGT PASKPVGTGP FIFQSWKTGD RIVLLPNKLY WGTKPKVDQL VIRSIKDASQ
RLNELKAGTI DFANDLTPDS LKAVQADKNL VAVKRPSFNV GFVSLNNRNP YLKNDKVRQA
ISMAINKKAI VEAFWPGLGI SNASFLPPVL SWANSKNVPA DYKYDPQAAK KLLADAGYPN
GFSVDLWYMP VSRPYFPQPK PIAEAIAADL SAIGIKVNLK TEDWAKYLED RRKEPGFDMY
MIGWTGDYGD PDNFYSAYYG PGGSDDINWN PPQLEKLLEQ GRAAVSQADK AKAYSQIHEI
TYKANYRIPM VHSQPLAAAR TYVKGWVPSP LGSEAFNTIS VVGKK