Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0785 |
Symbol | |
ID | 4462424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 834346 |
End bp | 835884 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639699796 |
Product | extracellular solute-binding protein |
Protein accession | YP_843214 |
Protein GI | 116754096 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0600833 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGAT TGAAAGACGT TTTGTGCACT GCAATACTGA TATCGCTTCT TCTGCCGGCA GTTGCAGAGG AGATACCGGT ATACACGATA GCGGACACGA CCGGGGACTG GGGGTACCCA TCGCCGTACC TCCACTACTC CAGGGGTCCG GGTTACGTGC GGATGAGTTT CATCTTCGAT ACACTGGTCT GGAAGGACCA GAACGGCTTC GTGCCTGCGC TCGCGGAGAG CTGGGAGTAT CTGGAGGGCG AGAACGCGTA TCTCTTCAAT CTGAATCCGA ACGTGAAATG GCATGACGGT GAGCCATTCA CCGCAAACGA CGTTGTATTC ACGATCAACT ACATAAAAGA TCATCCGTAC CAGTGGGTCG ACAGCAGCAT TGTGGATAGG GCTGAGAAGA TCGATGATCA CACGGTGAAG ATCTACCTGG CGAAGCCCTA CGCGCCGTTT TTGGACCAGG TGGCTGGCAC TCTGCCGATT CTCCCAGAGC ACATCTACAG CGATGTGACA AACCCGGAGG ATTTCCAGGA TCCAAAGGCG CTTACAGGCA CAGGACCGTT CAAGCTTGTC GATTACGACA AGGCCCAGGG GACATATCTG TATGAGGCGA ACGAGGATTA CTACCAGGGC GCTCCGAAGG TTAAGCAGCT GAAGTTTGTA AAGCTGAGCT CAGAGATGTC CGGGCCGGCG CTGAAGCGAG GCGAGGTCGA TGCCACGGCA GTTCCACCAG AGGTCGCTGA CGAGCTCAGG GGCAGCTTCG TCGTTCTCGA AGGCGCTCAT GACTGGCTCG CGAAGCTGAT GATAAACCAC AAGAAGGAGC CGTTCTCCGA TGTGAGGTTC AGGCAGGCTC TCGCATACGC CATAGATAGA GAGAAGCTCG TCGAGATAGC CCAGCGCGGA TATGGCGTTC CTGCGAGCCC CGGACTATTC GCGCCTGACA GCGAATGGTG CAACCCGGAT GTGGAGCAGT ACGAGCATGA TCCTGAGAAG GCCGGGGATC TGCTGAAGGA GATGGGGTAT GTGAAGAAGG GAGATTTCTT CGAGAAGGAT GGAAATACAC TGGAGGTGGA GCTTCTCGTC ACATCGAGCA ATGAGCGCGC GGGCGAGATC ATAAAGAACG ACCTGGAGGA GGCTGGCATA AAGGTCAACA TGCGCAGCGT CGACTCGAAG ACACTTGATA ACGCGGTCAA CGAGTGGAAC TTCGATCTCG CGCTCAGCGG CCACGGCGGA ATGGGTGGAG ATCCCGAGAT ACTCAATCGC ATAATAGGCG AGGGATACAC ATTTTGCAGC GATCGATACA TCGTGAACCA AACGCTAAAC GATCTGCTGG ACCAAGAGGT TGCTGAAATG AACCCTGACA GGCGAAAGCA GATTGTAAAA GAGATTCAGC TGGTCTACTC GCAGCAGTTG CCTTCGTTGC CATTGTACTA TCCTGAATCT TTCTGGGCGC ATAATGGCAA GGTCGATCTG TATTACACTA AGAGAGGGAT CGCAAACGGC ATACCCATAC CGCTGAACAA GCTATTATTT GCGAAGTAA
|
Protein sequence | MAGLKDVLCT AILISLLLPA VAEEIPVYTI ADTTGDWGYP SPYLHYSRGP GYVRMSFIFD TLVWKDQNGF VPALAESWEY LEGENAYLFN LNPNVKWHDG EPFTANDVVF TINYIKDHPY QWVDSSIVDR AEKIDDHTVK IYLAKPYAPF LDQVAGTLPI LPEHIYSDVT NPEDFQDPKA LTGTGPFKLV DYDKAQGTYL YEANEDYYQG APKVKQLKFV KLSSEMSGPA LKRGEVDATA VPPEVADELR GSFVVLEGAH DWLAKLMINH KKEPFSDVRF RQALAYAIDR EKLVEIAQRG YGVPASPGLF APDSEWCNPD VEQYEHDPEK AGDLLKEMGY VKKGDFFEKD GNTLEVELLV TSSNERAGEI IKNDLEEAGI KVNMRSVDSK TLDNAVNEWN FDLALSGHGG MGGDPEILNR IIGEGYTFCS DRYIVNQTLN DLLDQEVAEM NPDRRKQIVK EIQLVYSQQL PSLPLYYPES FWAHNGKVDL YYTKRGIANG IPIPLNKLLF AK
|
| |