Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_1189 |
Symbol | |
ID | 4058805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | + |
Start bp | 1261508 |
End bp | 1263085 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641230204 |
Product | extracellular solute-binding protein |
Protein accession | YP_604655 |
Protein GI | 94985291 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0440454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAC TGCTCCTGAC CGCCCTGCTC GCCTCCCTCC CGACCGCCGG AGCCGCCACG CTGGTCTTCG GCAACAACGG TGATCCCGTG AGCCTCGAAT CCGGCAACAT CACGGACGGC ATCAGCATCG CGGTGCAGCG TCAGATCTAT GACACCTTGG TCGACTTCAA GGACGGCACG ACCGAGCCAG TCCCTGGCCT GGCGACGAGC TGGAAGGCCA ACAAGGACGC GACCCAGTGG ACCTTCACGC TGCGCAAGGG CGTCAAATTC CAAGACGGTA CCCCCTTCAA CGCGGACGCC GTGATCTTTA ACGTCAACCG CTGGTGGGAT CCCAAGAATG CCTATGGCTA CCGCGACCAG GGTCATACCT ATGAGATCTG GGGCCAGCTG ATGGGAGGCT ACAAGGGCGA CGCCACCTCC ATCCTTAAGA ACGTGGTGAA GCTCGACGAC TACACCGTGC GCTTCGAGAT GAATAAGCCC TCCACGGTGT TCCCCAGTGT GATTGGGTCG GGGTATTTCG GCATCGCCAG TCCGGCGGCG ATCAAGAAAG ACGGGGCCAA GTACGGCACG CCCGCCAGCA AGCCGGTCGG CACCGGTCCA TTTATCTTCC AGAGCTGGAA GACCGGGGAC CGCATCGTCC TGCTGCCCAA CAAGCTGTAC TGGGGCACCA AGCCCAAGGT GGACCAGCTG GTGATCCGCT CGATCAAGGA CGCCTCGCAG CGCCTGAACG AACTCAAGGC CGGGACCATC GACTTTGCCA ATGACCTGAC ACCCGACAGT CTCAAGGCGG TGCAGGCCGA CAAGAACCTG GTGGCGGTCA AGCGGCCCTC TTTCAACGTG GGCTTCGTCA GCCTGAATAA CCGCAACCCG TACCTCAAAA ACGACAAGGT GCGGCAGGCG ATCAGCATGG CGATCAACAA AAAGGCGATT GTTGAGGCCT TCTGGCCGGG GCTGGGCATC AGCAACGCGA GCTTCTTGCC ACCGGTGCTG AGCTGGGCCA ACTCCAAGAA CGTGCCCGCC GACTACAAGT ACGATCCGCA GGCGGCCAAG AAGCTGCTCG CAGATGCCGG GTACCCCAAC GGCTTCTCTG TCGACCTGTG GTACATGCCG GTCAGCCGCC CCTACTTCCC GCAGCCCAAA CCCATCGCGG AAGCCATCGC CGCCGACCTC AGCGCGATCG GCATCAAGGT GAACCTCAAG ACCGAAGACT GGGCCAAGTA CTTGGAAGAT CGCCGCAAAG AACCCGGCTT TGACATGTAC ATGATCGGCT GGACGGGCGA CTACGGCGAC CCCGATAACT TCTACAGTGC CTACTACGGA CCGGGCGGTT CGGACGACAT CAACTGGAAC CCCCCGCAGC TCGAGAAGTT GCTGGAGCAG GGCCGCGCTG CGGTGAGTCA GGCCGACAAG GCCAAAGCCT ACAGCCAGAT TCACGAGATC ACCTACAAGG CGAACTACCG CATTCCGATG GTCCACAGCC AGCCGCTGGC CGCCGCGCGC ACCTACGTGA AGGGCTGGGT GCCCAGCCCG CTGGGTAGCG AAGCATTCAA CACCATCAGC GTCGTCGGCA AGAAATAA
|
Protein sequence | MKKLLLTALL ASLPTAGAAT LVFGNNGDPV SLESGNITDG ISIAVQRQIY DTLVDFKDGT TEPVPGLATS WKANKDATQW TFTLRKGVKF QDGTPFNADA VIFNVNRWWD PKNAYGYRDQ GHTYEIWGQL MGGYKGDATS ILKNVVKLDD YTVRFEMNKP STVFPSVIGS GYFGIASPAA IKKDGAKYGT PASKPVGTGP FIFQSWKTGD RIVLLPNKLY WGTKPKVDQL VIRSIKDASQ RLNELKAGTI DFANDLTPDS LKAVQADKNL VAVKRPSFNV GFVSLNNRNP YLKNDKVRQA ISMAINKKAI VEAFWPGLGI SNASFLPPVL SWANSKNVPA DYKYDPQAAK KLLADAGYPN GFSVDLWYMP VSRPYFPQPK PIAEAIAADL SAIGIKVNLK TEDWAKYLED RRKEPGFDMY MIGWTGDYGD PDNFYSAYYG PGGSDDINWN PPQLEKLLEQ GRAAVSQADK AKAYSQIHEI TYKANYRIPM VHSQPLAAAR TYVKGWVPSP LGSEAFNTIS VVGKK
|
| |