Gene Mthe_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1075 
Symbol 
ID4461800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1164658 
End bp1166892 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content38% 
IMG OID639700093 
Producthypothetical protein 
Protein accessionYP_843499 
Protein GI116754381 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0202284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATATG AGTCAAACAT TTTTTTTAGA TTCTGGCTTA AATTGCACGA TTTCATAATA 
AAAAACGAAG TGTTTTTGGT TCTCTTGTCG GTATATTTCA TATACAATAT CAATTGTAGA
ACCCTTGGAT CCGGAGATAC CATTCCAGCT TCTTTGTTAC CATTCAGCAT ATTGGAATAC
CACAATTTAT ATATGGATAA TTTTTATTTT TATTATTTTT CTAATTATGA TCAGCTATGG
TATTTCAAAG AGGCTGATGG TCATTTTTTG TCGAGCTATC CAATTGTAAC ACCATTATTA
ATCACACCAC TTTATGTAAT TCCATATATT GTTTTGAAAT TTAATAATTT GCCAATCGAT
TTATTCCATC CTGGGTTCGC TAAAACAGTT TGCGTGATGG AGAAGCTATC CGCATCTTTA
ATTGCATCGA CATCTGTTGT TTTTGTTTAT TTATCAATAA AAGAGCTAAT AAATAGAAGA
GTTGCATTTA TCGTAGCAGT ACTATTTGCA TTTGGAACCA ACACCTGGGC AATCAGCAGT
CAGGCACTGT GGCAGCATGG ATTGATAGAG CTTATCCTGG CCATGTTCAT ATATCTAGTT
TTAATAAACG AAAAGATAAA GTCAAATAAG ATAGTTATTT CTCTGGGTGT TTTATCCGGG
TTATTCGTAT TTAACAGACC AATCGACAGC GTACTATTGG CGCCTGTGTT ATATTATATA
TTTGATATGA GAGATAAAAG GATCATTTAT TATATATTTT CAGCATTTTC ATCCGGTGCA
CCGTTTCTTT TATATAATAT TTATTATTTT GGAAATTTAT TTGGAGGTTA TACAGATCTG
CTAAAATTGT TTGATTTAAG TCCTGAGATG GTTACAAGAT TTGCTGGATT GCTAGTTAGC
CCAAGTCGAG GGCTCTTCGT ATATACTCCT ATCACGTTGT TATCCGTGTT AGGATTTGCT
AAAGTTATGC GGATACCCAA CAAACGAATC AAAAATTTTC TAATTTTGAT GGGCATTTCG
TGCTTTACTC TGGTCATAAT ATACAGCTCT TTTATCATAT GGTGGGCAGG TGGGTCGTAT
GGACCCAGGT TCCTGACTGG CATGCTTCCT GCAATGGCGA TCTTTTTAGG ATTTTTTATC
AAAGACATTA AATTGAACGT TTACAGATTT AAAAATTTAT CAATTATTTT TATCGTATCC
GCCCTAGTTT TTTGGTCGTT TTTTACACAA TTTGTCGGTG CATTCTATTA TCCAAACGGC
AACTGGGATG GCGATCCAAA CGTGGATCTG CACCCTGAAA AATTGTGGGA CTGGAAAGAC
ACACAATTGA CCAGAACATT CAATGCTGGT ATGGCATCAT CACCACTGAG CTGCTTTAAA
AATATCTTCT CTGCCATGTC TCTTTTACAT ATAAAAGATA TCTCCGATTA CAGCATAATG
AAACTTGCCG GCTGGTATGG AATAGAGTTA TGGAATAATG TGCCCACACG ATGGATGCAA
GATGATGCAA AGGTGGCATT AAAATCTCCC GATAATCAGA CATGCGAAAT GAGCTTGCAA
GCAACGAGCT TCTACCATCC AAGGACTTTG GAGATATATG CAGGAGACGA GAAGATATTA
ACCGTTGAAA TCCCCAGCGA CGGATTTATC AATCTGTCTG TACCTGTAAG CCTTGTAGAG
GGCATGAATG TAATACGCAT GCATGTGCCA GAGGGTTGCG AAAGACCTTG CGACATAAAA
GAGCTGAACA ACCCTGACTC CAGGTGCCTG AGCATTGCTG TGCAGAACTT AAAGATCATG
CCATCTGAGT CCATCATATA CTATACACCC ATTTCAGGGT TCTATGGTAT CGAATCCTGG
TCCGGAATAC CAACAAACTG GATCATGGAT GATGCAGATC TTGGAGTATT TTCGCTGGAT
AATTTAACCT GTAACCTAAG CATACGAGCG AGGAGCTTCC ATTGTACAAG AACGCTGGAA
ATATATGCAG GAAATGCTTT TATAAATAGC GTTTCAGTTC CGAGTGACGA CTTCATCAAT
GTAACATGTT CCATAAAGCT TGCTAGGGGC ATGAATACAA TACAGCTGCG TGTACCAGAT
GGCTGCGATA GACCATCAGA CATTAGAGAG ATAAATGTCC AAGATCGGAG ATGCCTTAGC
ATGGCTCTAC AGAACGTGAG AATAGATTGC GGAAATCGAT TGAGGTGTGA CCAAAGTGGA
GAGAGAACCT GCTGA
 
Protein sequence
MEYESNIFFR FWLKLHDFII KNEVFLVLLS VYFIYNINCR TLGSGDTIPA SLLPFSILEY 
HNLYMDNFYF YYFSNYDQLW YFKEADGHFL SSYPIVTPLL ITPLYVIPYI VLKFNNLPID
LFHPGFAKTV CVMEKLSASL IASTSVVFVY LSIKELINRR VAFIVAVLFA FGTNTWAISS
QALWQHGLIE LILAMFIYLV LINEKIKSNK IVISLGVLSG LFVFNRPIDS VLLAPVLYYI
FDMRDKRIIY YIFSAFSSGA PFLLYNIYYF GNLFGGYTDL LKLFDLSPEM VTRFAGLLVS
PSRGLFVYTP ITLLSVLGFA KVMRIPNKRI KNFLILMGIS CFTLVIIYSS FIIWWAGGSY
GPRFLTGMLP AMAIFLGFFI KDIKLNVYRF KNLSIIFIVS ALVFWSFFTQ FVGAFYYPNG
NWDGDPNVDL HPEKLWDWKD TQLTRTFNAG MASSPLSCFK NIFSAMSLLH IKDISDYSIM
KLAGWYGIEL WNNVPTRWMQ DDAKVALKSP DNQTCEMSLQ ATSFYHPRTL EIYAGDEKIL
TVEIPSDGFI NLSVPVSLVE GMNVIRMHVP EGCERPCDIK ELNNPDSRCL SIAVQNLKIM
PSESIIYYTP ISGFYGIESW SGIPTNWIMD DADLGVFSLD NLTCNLSIRA RSFHCTRTLE
IYAGNAFINS VSVPSDDFIN VTCSIKLARG MNTIQLRVPD GCDRPSDIRE INVQDRRCLS
MALQNVRIDC GNRLRCDQSG ERTC