Gene Mthe_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1066 
Symbol 
ID4463081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1151073 
End bp1152272 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content50% 
IMG OID639700084 
ProductABC transporter related 
Protein accessionYP_843490 
Protein GI116754372 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAGA CAGCGGTTAA GGTCGAGCAC CTCTGGAAGA CGTTCCGGAT ACCTCACGAG 
CGGAGGAACA CGCTGTTCGA GAACATCATT GGTTTTTTCA GGCCGAACAG CTATGAGACA
TTCACCGTGC TGAAGGACAT AAACCTAGAG GTTGAGCGCG GGGAGTGCAT CGGCATAATC
GGCGACAACG GTTCCGGCAA GAGCACGCTT CTGAAGATCA TAGCGAAGAT ACTCAGGCCA
ACAAGCGGAT TTGTAAAGGT ATTCGGGAAG CTCACGCCGT TCCTCGAGCT CGGCGTCGGG
TTCCAGCCGG AGCTCAGCGT GAGGGAGAAC ATACGGATTT ATGCCACCAT AATGGGTCTG
CCGAAGAAGG TAATAGATGA CAGGATAGAT GATGTGATAA GGTTTGCGGG GCTTGAGCGG
TTCGAGGACG CAAAGCTGAA GAATCTCTCG TCTGGCATGC AGGTGCGGCT CGCGTTCTCG
ACAGCGATCC AGACAGACCC GGATATACTC CTGGTCGATG AGGTGCTGGC AGTCGGTGAT
ATGGAGTTCC AGCAGAAGTG CTTCAGGGTG TTCGAGGATT ACAGAGATAG CGGAGTTACA
ATACTGTTTG TGTCTCACGA TCTGAACGCG GTCAGGATGC TATGCGACCG GACACTGCTT
CTCAGCAATG GAGAACGTGT GGATTTTGGA GACACAAATA GCATTATAGA TAAATATATT
TATAAGACAG ATGTATCTGA AGTTGCAGAA ACATCTTCTG AGAAGGAGCG CGCATCCACC
AGGAAAGAGA TAGAGATTGT TGATGTCAAG TTCGTGGACA AATACGGATG TCCCAACGAG
AACTTCGTAG CCGGAGATCC GCTGAGGGTC CGTATTTTCT TTGATGCACA TGGGACGGTT
AGGTCTCCAG TATTTGGTAT AATATTTTAT CATGGAGATA CCTACTGCTA CGGGACAACC
ACGGAGTTTA AGGGATCTGA TACGGGTATT ATCAATGGCA AGGGATATGT GGACTTCATA
ATACCAAGTT TGCCTTTTCT TCAGGGGAGG TTCGAGGTAA CAGTGGCTGT GGCATCACAT
GATTACAGCA CACAGTACGA CTGGCACGAC AGACGCTATG CATTCAACGT CCACAATCCA
ACACGTGACC TGGGCATGAT GCTTATAGAA GGCACATGGT CGCTGCGCAG GGATGCTTAG
 
Protein sequence
MGETAVKVEH LWKTFRIPHE RRNTLFENII GFFRPNSYET FTVLKDINLE VERGECIGII 
GDNGSGKSTL LKIIAKILRP TSGFVKVFGK LTPFLELGVG FQPELSVREN IRIYATIMGL
PKKVIDDRID DVIRFAGLER FEDAKLKNLS SGMQVRLAFS TAIQTDPDIL LVDEVLAVGD
MEFQQKCFRV FEDYRDSGVT ILFVSHDLNA VRMLCDRTLL LSNGERVDFG DTNSIIDKYI
YKTDVSEVAE TSSEKERAST RKEIEIVDVK FVDKYGCPNE NFVAGDPLRV RIFFDAHGTV
RSPVFGIIFY HGDTYCYGTT TEFKGSDTGI INGKGYVDFI IPSLPFLQGR FEVTVAVASH
DYSTQYDWHD RRYAFNVHNP TRDLGMMLIE GTWSLRRDA