Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1482 |
Symbol | |
ID | 4461936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 1596641 |
End bp | 1598281 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639700501 |
Product | hypothetical protein |
Protein accession | YP_843895 |
Protein GI | 116754777 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.431628 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAGA CGCTTAATCC TTATTCGGAC ATCTCTCTAT CCCCCATCAC AAAGTTCAAG GATGTGTGGC GCTCAGAAGC AAAGGGGATT GTCTTGAGTT TCGCGGAGAG GATCATGGTG AAGAGCATGC CAGGCAATGT GTACAGAATA CTCGCAAAGG ACGTGCGTTC ATACGAGTTC ATAATACCCT ATGATGAAAA GGCAGATGTG GGGGAGATAT TCACTGTGGA GGACAGGGAT ATGCTCTTTC TCGCCCGTGT CGTCAATGTG CAGCACGACT CAAACTACAA CGGGAGATGG GATACGAGCA TTAGGGGCAC GGAGCTGTAC GATGAGGAGC AGATCTTCAA CAGGGTGATC GCAGAGCCGC TCGGATGCAT ACCCCGCGAT CAGACCAAAA GAAGAGGCGC ATTCAGAAAG GCAAAGACAA TTCCCACGAA GTTCTCAAAG GTGAAGAGGA CAGAGGCGGA GGAGTTCGGT TTTTTAAAGG ATACTATGGG AGATATCGAG GTCGGTGTTC TGAGAAACGG CAGCAGGGAC ACAGATGTGA CTGTGGCGCT CCACAGCAGC GCGATGGACC ATCATATGGG CATATTCGCA ACCACTGGAA TGGGAAAGTC TAATTTCATG AAGGTCTTCG CAGCCTCATG CATGAAGCTT GCTTCAGAGA ACAGATCTGA GTTCGGGCTG CTCATAGTAG ATCCTCATGG GGAGTACCTG AGAGGCGGAA AGGCCGGCAA GGGGCTTCTT CACCTCACCC CGTATGCATC AAGCCTTAAA TGTTACTCCA CAGATCCCAG GAATCACAGC CTCCCGGAGG TCAGCGAACT CACGATCTCC AGAAGGGATA TCCTGCCAGA GGATATCAAG GTGCTGCATG ACTGGACCGG TGCGCAGATG GACGCCTTGG ACAGTATCGA GCGGATCTTC GATGGGGACT CGTGGATAGA TGAGATCCTT GATGAGAATG GTAAAGCGAT GCTGACAGAG AGGGCTAAGG TATCAGAAAA GACTGTCGAT GTTCTGGTAA GAAAGCTGGA GAACATACTT TCAAGAAATA AATATATAAA ATCCTCTGGA AACTCCAGCA TCCCCGGGAT AATCGAGGGA GTGAAGAGCG GAAAGGTCGT GCTGATCGAC ATCCCCAACC TGAGCGAGAG CAGCGAGCTC TTCCTGCTCT CCCTGATATC CAGGCGGATA ATGGAGGACT ACAGAAACGA GGAGGAGGGG AGGAAGAGGT GCATGATAGT CATCGAGGAG GCGCAGAGGG TTCTCGGGAG TGACAGCAGG ATCGCGCGCT TCGAGGAGAT AGCAAGAGAG GGAAGAAAGT TTGGAGTTGG ACTTTGCGCG ATAACACAGC AGCCGAAGCT GATAGACAGG GAGCTCCTCT CGCAGTTCAA CACGGTGGTC GTGATGGGGC TCGCAGACAG GAACGATCGG GTGCGCGTGG AGGAGTCCGC AAAGCAGGAT CTCTCGTCGC TGGATGTTGA GATACAGACC TTGGAGAAGG GAGAGGCGAT CATAAGCACG CTGAACGTAC CCTTCCCGAT ACCTGCTAAG ATACACATAT ATGAGGATTA CATAGAGCGT CTCCGGCCGA GTGCACCCGA GAGAAGGTCG TTCAGGCCGA CACCTGACTG A
|
Protein sequence | MSETLNPYSD ISLSPITKFK DVWRSEAKGI VLSFAERIMV KSMPGNVYRI LAKDVRSYEF IIPYDEKADV GEIFTVEDRD MLFLARVVNV QHDSNYNGRW DTSIRGTELY DEEQIFNRVI AEPLGCIPRD QTKRRGAFRK AKTIPTKFSK VKRTEAEEFG FLKDTMGDIE VGVLRNGSRD TDVTVALHSS AMDHHMGIFA TTGMGKSNFM KVFAASCMKL ASENRSEFGL LIVDPHGEYL RGGKAGKGLL HLTPYASSLK CYSTDPRNHS LPEVSELTIS RRDILPEDIK VLHDWTGAQM DALDSIERIF DGDSWIDEIL DENGKAMLTE RAKVSEKTVD VLVRKLENIL SRNKYIKSSG NSSIPGIIEG VKSGKVVLID IPNLSESSEL FLLSLISRRI MEDYRNEEEG RKRCMIVIEE AQRVLGSDSR IARFEEIARE GRKFGVGLCA ITQQPKLIDR ELLSQFNTVV VMGLADRNDR VRVEESAKQD LSSLDVEIQT LEKGEAIIST LNVPFPIPAK IHIYEDYIER LRPSAPERRS FRPTPD
|
| |