Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1075 |
Symbol | |
ID | 4461800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 1164658 |
End bp | 1166892 |
Gene Length | 2235 bp |
Protein Length | 744 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 639700093 |
Product | hypothetical protein |
Protein accession | YP_843499 |
Protein GI | 116754381 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0202284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATATG AGTCAAACAT TTTTTTTAGA TTCTGGCTTA AATTGCACGA TTTCATAATA AAAAACGAAG TGTTTTTGGT TCTCTTGTCG GTATATTTCA TATACAATAT CAATTGTAGA ACCCTTGGAT CCGGAGATAC CATTCCAGCT TCTTTGTTAC CATTCAGCAT ATTGGAATAC CACAATTTAT ATATGGATAA TTTTTATTTT TATTATTTTT CTAATTATGA TCAGCTATGG TATTTCAAAG AGGCTGATGG TCATTTTTTG TCGAGCTATC CAATTGTAAC ACCATTATTA ATCACACCAC TTTATGTAAT TCCATATATT GTTTTGAAAT TTAATAATTT GCCAATCGAT TTATTCCATC CTGGGTTCGC TAAAACAGTT TGCGTGATGG AGAAGCTATC CGCATCTTTA ATTGCATCGA CATCTGTTGT TTTTGTTTAT TTATCAATAA AAGAGCTAAT AAATAGAAGA GTTGCATTTA TCGTAGCAGT ACTATTTGCA TTTGGAACCA ACACCTGGGC AATCAGCAGT CAGGCACTGT GGCAGCATGG ATTGATAGAG CTTATCCTGG CCATGTTCAT ATATCTAGTT TTAATAAACG AAAAGATAAA GTCAAATAAG ATAGTTATTT CTCTGGGTGT TTTATCCGGG TTATTCGTAT TTAACAGACC AATCGACAGC GTACTATTGG CGCCTGTGTT ATATTATATA TTTGATATGA GAGATAAAAG GATCATTTAT TATATATTTT CAGCATTTTC ATCCGGTGCA CCGTTTCTTT TATATAATAT TTATTATTTT GGAAATTTAT TTGGAGGTTA TACAGATCTG CTAAAATTGT TTGATTTAAG TCCTGAGATG GTTACAAGAT TTGCTGGATT GCTAGTTAGC CCAAGTCGAG GGCTCTTCGT ATATACTCCT ATCACGTTGT TATCCGTGTT AGGATTTGCT AAAGTTATGC GGATACCCAA CAAACGAATC AAAAATTTTC TAATTTTGAT GGGCATTTCG TGCTTTACTC TGGTCATAAT ATACAGCTCT TTTATCATAT GGTGGGCAGG TGGGTCGTAT GGACCCAGGT TCCTGACTGG CATGCTTCCT GCAATGGCGA TCTTTTTAGG ATTTTTTATC AAAGACATTA AATTGAACGT TTACAGATTT AAAAATTTAT CAATTATTTT TATCGTATCC GCCCTAGTTT TTTGGTCGTT TTTTACACAA TTTGTCGGTG CATTCTATTA TCCAAACGGC AACTGGGATG GCGATCCAAA CGTGGATCTG CACCCTGAAA AATTGTGGGA CTGGAAAGAC ACACAATTGA CCAGAACATT CAATGCTGGT ATGGCATCAT CACCACTGAG CTGCTTTAAA AATATCTTCT CTGCCATGTC TCTTTTACAT ATAAAAGATA TCTCCGATTA CAGCATAATG AAACTTGCCG GCTGGTATGG AATAGAGTTA TGGAATAATG TGCCCACACG ATGGATGCAA GATGATGCAA AGGTGGCATT AAAATCTCCC GATAATCAGA CATGCGAAAT GAGCTTGCAA GCAACGAGCT TCTACCATCC AAGGACTTTG GAGATATATG CAGGAGACGA GAAGATATTA ACCGTTGAAA TCCCCAGCGA CGGATTTATC AATCTGTCTG TACCTGTAAG CCTTGTAGAG GGCATGAATG TAATACGCAT GCATGTGCCA GAGGGTTGCG AAAGACCTTG CGACATAAAA GAGCTGAACA ACCCTGACTC CAGGTGCCTG AGCATTGCTG TGCAGAACTT AAAGATCATG CCATCTGAGT CCATCATATA CTATACACCC ATTTCAGGGT TCTATGGTAT CGAATCCTGG TCCGGAATAC CAACAAACTG GATCATGGAT GATGCAGATC TTGGAGTATT TTCGCTGGAT AATTTAACCT GTAACCTAAG CATACGAGCG AGGAGCTTCC ATTGTACAAG AACGCTGGAA ATATATGCAG GAAATGCTTT TATAAATAGC GTTTCAGTTC CGAGTGACGA CTTCATCAAT GTAACATGTT CCATAAAGCT TGCTAGGGGC ATGAATACAA TACAGCTGCG TGTACCAGAT GGCTGCGATA GACCATCAGA CATTAGAGAG ATAAATGTCC AAGATCGGAG ATGCCTTAGC ATGGCTCTAC AGAACGTGAG AATAGATTGC GGAAATCGAT TGAGGTGTGA CCAAAGTGGA GAGAGAACCT GCTGA
|
Protein sequence | MEYESNIFFR FWLKLHDFII KNEVFLVLLS VYFIYNINCR TLGSGDTIPA SLLPFSILEY HNLYMDNFYF YYFSNYDQLW YFKEADGHFL SSYPIVTPLL ITPLYVIPYI VLKFNNLPID LFHPGFAKTV CVMEKLSASL IASTSVVFVY LSIKELINRR VAFIVAVLFA FGTNTWAISS QALWQHGLIE LILAMFIYLV LINEKIKSNK IVISLGVLSG LFVFNRPIDS VLLAPVLYYI FDMRDKRIIY YIFSAFSSGA PFLLYNIYYF GNLFGGYTDL LKLFDLSPEM VTRFAGLLVS PSRGLFVYTP ITLLSVLGFA KVMRIPNKRI KNFLILMGIS CFTLVIIYSS FIIWWAGGSY GPRFLTGMLP AMAIFLGFFI KDIKLNVYRF KNLSIIFIVS ALVFWSFFTQ FVGAFYYPNG NWDGDPNVDL HPEKLWDWKD TQLTRTFNAG MASSPLSCFK NIFSAMSLLH IKDISDYSIM KLAGWYGIEL WNNVPTRWMQ DDAKVALKSP DNQTCEMSLQ ATSFYHPRTL EIYAGDEKIL TVEIPSDGFI NLSVPVSLVE GMNVIRMHVP EGCERPCDIK ELNNPDSRCL SIAVQNLKIM PSESIIYYTP ISGFYGIESW SGIPTNWIMD DADLGVFSLD NLTCNLSIRA RSFHCTRTLE IYAGNAFINS VSVPSDDFIN VTCSIKLARG MNTIQLRVPD GCDRPSDIRE INVQDRRCLS MALQNVRIDC GNRLRCDQSG ERTC
|
| |