Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0510 |
Symbol | |
ID | 4463423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 521350 |
End bp | 522882 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639699513 |
Product | extracellular solute-binding protein |
Protein accession | YP_842941 |
Protein GI | 116753823 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0306064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGGT TGGTAAAGAT CCTCTGCACT GCCATGCTGA TATCGCTTCT TCTGCCGGCA GTTGCAGAGG AGATACCGGT ATACACGATA GCGGACACGA CCGGGGACTG GGGGTACCCA TCGCCGTACC TCCACTACTC CAGGGGTCCC GGATATGTGC GGATGAGCTT CATCTTCGAT ACCTTGGTCT GGAAGGACCA GAACGGCTTC GTGCCTGCGC TCGCAGAGAG CTGGGAGTAT CTGAAGGACG AGAACGCGTA CATCTTCAAT CTGAATCCGA ACGCGAAATG GCATGACGGC GAGCCATTCA CAGCGGACGA TGTGGTCTTC ACGATCAACT ACATAAAAGA GCATCCGTAC CAGTGGGTCG ACAGCAGCAC TGTGGATAGG GCTGAGAAGA TCGATGATCA CACGGTTAAG ATCTACCTGG CGAAGCCCTA CGCCCCGTTC CTGGACCAGG TCGCAGGTAC ACTGCCGATC CTCCCGGAGC ACATCTACAT CGATGTGACG AAGCCGGAGG ATTTCCAGGA TGCGAAGGCG CTTACAGGCA CAGGACCGTT CAAGCTTGTC GATTACAACA AGGCCCAGGG CACCTATCTG TACGAGGCTA ATGAGGATTA CTACCAGGGC GCTCCGAAGG TGAAGCAGCT CAAGTTCGTC AAGCTGAGCG AGGAGATGGC AGGGCCGGCG CTGAAGCGAG GCGAGGTCGA TGCTGCGGCT GTGCCTCCAG AGGTCGCTGA GGGGCTCAGG GGCAGCTTCG TGGTTCTCGA GGGCGCGCAC GACTGGCTCG CGAAGCTCAT GATAAACCAC AAGAAGGAGC CGTTCTCCGA TGTCAGGTTC AGACAGGCTC TTGCGTATTC CATAGACAGG GAGAAGCTCG TTGAGATAGC CCAGCGCGGG TATGGTGTTC CTGCGAGCCC CGGCCTCTTC GCGCCTGACA GCGAGTGGTA CAATCCGGAT GTTGAGCAGT ATGCCCATGA TCCGGAGAAG GCGGGCGAGC TCCTGAAGGA GCTGGGGTAT GAGAAGAAGG GGGTCTTCTT CGAGAAGGAC GGAAAGACGC TGGAGGTTGA GCTTCTCGTC ACATCGAGCA ATGAGCGCGC TGGCGAGCTC ATAAAAAGAG ATCTGGAGGA GGCGGGCATA AAGGTCAACA TGCGCAGCGT CGACTCGAAG ACTCTTGATA ACGCGGTCAA CGAGTGGAAC TTCGATCTCG CGCTCAGCGG GCACGGCGGA ATGGGCGGAG ATCCGGCGAT ACTGAACAAG GTGATCACAG GGTCTGGCTT CAACAGCGCC AGATACGATA ACCAGGAGCT GAGTGATCTT CTCAAGAGGG AGATCTCGGA GATGGATCCG GAGAAGAGAA GAGAGCTGGT GAACGAGATA CAGGAGGTCT ACGCGAGAGA ACTTCCAGCG CTCCCGCTCT ACTACCCGAC CAACTACTGG GCACACGACG GGAAGGTCGA TCTCTTCTAC ACAAAGAACG GCGTTGGGAG CGGCGTGCCA ATTCCGCTGA ACAAGATGGC CTTCCTGGCC TGA
|
Protein sequence | MSRLVKILCT AMLISLLLPA VAEEIPVYTI ADTTGDWGYP SPYLHYSRGP GYVRMSFIFD TLVWKDQNGF VPALAESWEY LKDENAYIFN LNPNAKWHDG EPFTADDVVF TINYIKEHPY QWVDSSTVDR AEKIDDHTVK IYLAKPYAPF LDQVAGTLPI LPEHIYIDVT KPEDFQDAKA LTGTGPFKLV DYNKAQGTYL YEANEDYYQG APKVKQLKFV KLSEEMAGPA LKRGEVDAAA VPPEVAEGLR GSFVVLEGAH DWLAKLMINH KKEPFSDVRF RQALAYSIDR EKLVEIAQRG YGVPASPGLF APDSEWYNPD VEQYAHDPEK AGELLKELGY EKKGVFFEKD GKTLEVELLV TSSNERAGEL IKRDLEEAGI KVNMRSVDSK TLDNAVNEWN FDLALSGHGG MGGDPAILNK VITGSGFNSA RYDNQELSDL LKREISEMDP EKRRELVNEI QEVYARELPA LPLYYPTNYW AHDGKVDLFY TKNGVGSGVP IPLNKMAFLA
|
| |