Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_2122 |
Symbol | |
ID | 4058857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | + |
Start bp | 2234162 |
End bp | 2235883 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641231162 |
Product | extracellular solute-binding protein |
Protein accession | YP_605585 |
Protein GI | 94986221 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.973923 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTGCTCTGCT CAGCACCCTT CTTCTCGCTG GCGCGGCCTT TGCCGCTGCC CCCAAAGACA CGCTGGTGGT CCAGCAGGCT GCCGATATCC CCACCCTTGA CCCCGGCGCG ACCTACGACA CCGCCTCGGG TTCAGTGGTG GAGAACATCT ACGAGACGCT GGTGACCTAC AAGGGGAGTA GCCTGCGTGA CCTGGAGCCG CTGCTGGCCA CCAAGTGGAC CATCAGCAAT GGTGGCAAGA CCTACACCTT TGATCTGCGC AAGAACGTCA AGTTCCACTC CGGGAATCCC TTCACCTGCG CTGACGCCGA GTACACCTTC GAGCGCAACC TGGTCACCAA CTCTGCCGAG TCGGGCAATT GGTTCCTGGC TGAGAGCCTG CTGGGCACCG GCAGCAACGC CAACGATGAC AAGTCGATCA CCTGGTCCAA GATCGACAGG GCGGTCGAGT GTAACAATCA GGGCCAACTC GTCTTTAACC TGCCCAAGCC CGACCCGGCC TTCCTCGCCA AGCTGGCCTT CCCGGGTCAG AGCATTGTGG ACAAGAACTG GGCCATCAAG CTTGGAGAGT GGAGCGGCAA AGAGGCTGAC TGGAAGAGCT GGGTGGGCAA GGATCTCCAG GGCAGCAAGC TGAATGCTCA ACCCAGCGGC ACCGGCGCCT ACCGTCTGGT GCGCAAGGAC GCCAATGCCA CGCTGGCCCA GGCCTTCGAC GGCTACTGGG GCAAGAAGCC TGCCATCAAG AATGTGATTC TTCAGAAGGT GCCCGAACTG GCTGCCCGCC AGCAGGCCTT CTTGCGCGGT GACGCCGATC TGATTGAGGC GGGTACCCGC GCCAACGTTG AAGAGCAGCT CAAGGGCAAG CCCGGTGTGG TGGTCGTGGA CGGCCTGCCC AACACCACGG CGACGGCGAT CTTCATGAAC GAGAACATCA AGGATCCTGC GCGCCTGGGC AGCGGCAAGC TGGACGGCCA GGGGATTCCC GCCAACTTCT TCAGCGACGT CAACGTGCGG CGCGGGTTCT CCTATGCCTT CAACTACGCC CAGTACATCA GTGATGTGCT GAAGGGCAAG GGCAAGCAGC GCACCATGCT GCTCCCCGAC TCCTTCCCCG GCTATGACGC TAAGGTCAAG ACCTACAAGT ACGACCCAGC GCAGGCCAAG GCCTACTTCC AGCGCGCCTG GGGCGGCCAG CTCTGGAAAA ATGGCTTTAC CCTGAATGTG GCGTACCGCG CGGGGAGTGT GGGCGCGCAG ACTGCGATGG AAATTCTGAA GAAGAACATC GAGTCTCTCA ATCCCAAGTT CCGCGTGAAC ATCCAGGCCA AGGAGTGGTC GGCCATGCTG AATGACTCCA AGGCAGGGAA GGAGCCGATG ATCATTCTCG GCTGGGCGCC GGACTACGCG GACCCCGACA ACTTCATGTA CACCTTCTAC TCCAGCAACG GGTACTACTA CCCCCGCAGC AATTGGAAGG ACGCGACCGT TGACAAGTGG CTGGAGCAGG CCCGCAACAC CACCAACACG GCCGAGCGTA ACCGCCTCTA CAGCCTGGTG GGTCAGCGCG CCTACGAGCA GGCGCCCTTT ATCCTGGTGC CGGCGGGCGT CGGCTTGAAC GTGCAGCGCA GCAATCTGGT CGGCGCAACG GCGCAGACCT TCAACCCCAT GATCTCGTTC AGCTACACCG GGACCTTCTG GAAGGATCTC AGCAAGAAGT AA
|
Protein sequence | MKKIALLSTL LLAGAAFAAA PKDTLVVQQA ADIPTLDPGA TYDTASGSVV ENIYETLVTY KGSSLRDLEP LLATKWTISN GGKTYTFDLR KNVKFHSGNP FTCADAEYTF ERNLVTNSAE SGNWFLAESL LGTGSNANDD KSITWSKIDR AVECNNQGQL VFNLPKPDPA FLAKLAFPGQ SIVDKNWAIK LGEWSGKEAD WKSWVGKDLQ GSKLNAQPSG TGAYRLVRKD ANATLAQAFD GYWGKKPAIK NVILQKVPEL AARQQAFLRG DADLIEAGTR ANVEEQLKGK PGVVVVDGLP NTTATAIFMN ENIKDPARLG SGKLDGQGIP ANFFSDVNVR RGFSYAFNYA QYISDVLKGK GKQRTMLLPD SFPGYDAKVK TYKYDPAQAK AYFQRAWGGQ LWKNGFTLNV AYRAGSVGAQ TAMEILKKNI ESLNPKFRVN IQAKEWSAML NDSKAGKEPM IILGWAPDYA DPDNFMYTFY SSNGYYYPRS NWKDATVDKW LEQARNTTNT AERNRLYSLV GQRAYEQAPF ILVPAGVGLN VQRSNLVGAT AQTFNPMISF SYTGTFWKDL SKK
|
| |