Gene Mthe_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1401 
Symbol 
ID4463024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1501574 
End bp1502785 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content56% 
IMG OID639700419 
Producthypothetical protein 
Protein accessionYP_843816 
Protein GI116754698 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03266] putative methanogenesis marker protein 1
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGACTTA GATCCTGCCT GAAGAGGTAC AAGAAGGACA CACATCGTGC TCTGCCTCCG 
GAGGAGACGC TCGAGATAGT AGAAAAGAAG ATGCCTGCAG CTGGAATAAC AAGAGTCGCG
GACATAACCA ACCTCGACCG GATAGGCATA CCGGTCTTCA CTTCCATAAG ACCAACAGCG
GAGAAGGGTG CGATATCTGT TTACAATGGA AAGGGCGCGA CTCCAACCGA GGCGAAGGTC
TCCGCCATAA TGGAGGGCAT CGAGAGGTAC TCTGCCGAGG TTCGGAATGC CGATCTCAGA
ACTGCCAGGT TCTCCGAGCT CAGGGAGAAC GCGCTGAATC CGGCGGAGCT CATACTGCCG
AGAGATGCCG ATCCAGATGC GGTGATTCCG TGGGTGACAG GATATGATCT CATGGGGGAT
GAGGAGATCC TGGTTCCGGC GAATGCTGTA TTCCATCCGC TGCCCTCGAG CTACACGCGT
CTGTTCAGGA CAAACACCAC AGGTCTCGCA TCTGGGAACC AGCTGGAGGA GGCGATCTTC
CATGGGCTCG CAGAGGTCGT TGAGAGGGAT GCGTGGTCGA TAGCAGAGCA CGCCAGGAGT
ATGGGGCCGC TGCTCCGATA CAACGGCGAT GGCCTTGCAG GGGAGCTTCT TGAAATGTTC
CAGCGCGCAG AGGTCCAGGT TTACGTGAGG GACATAACGA GCGATGTGGG TGTTCCAACA
TTCGCCGCGG TCTCGGATGA TGTGAAGTTG AAGGATCCCG CGCTTCTGAC AGCTGGAATG
GGAACGCATA CCGATCCGGA GGTGGCGCTG CTGAGAGCGC TCACAGAGGT CGCACAGAGC
CGTCTCACGC AGATACACGG CGCCCGGGAG GATACTGTGA GCGCCGAATT CAGGCGCATG
ATGGGTTATG ACAGGCTGAA GCGCCTGAAC AGGCACTGGT TTGAGTACGA ACGCGAGGAG
GATTTCAGCT CCTTAAACTC TTACAATACC GATGATTTTC TTGATGACAT AAGATACATG
CTCGATCGGC TTCAGACCGC AGGATTTGAG AGGGCGATAG TTGTGGACCT CACGGCTTCT
GAGATAATGG TGCCAGTGGT CAGGGTGATA GTGCCCGGGC TGGAGATATC GGCTGTGGAT
CCGGAGAGGG TCGGCAAGCG GTGCAGGGAT GCCAAGAATC GTCGTGTTTC TGGGCCCAAG
CATGCCTCAT GA
 
Protein sequence
MRLRSCLKRY KKDTHRALPP EETLEIVEKK MPAAGITRVA DITNLDRIGI PVFTSIRPTA 
EKGAISVYNG KGATPTEAKV SAIMEGIERY SAEVRNADLR TARFSELREN ALNPAELILP
RDADPDAVIP WVTGYDLMGD EEILVPANAV FHPLPSSYTR LFRTNTTGLA SGNQLEEAIF
HGLAEVVERD AWSIAEHARS MGPLLRYNGD GLAGELLEMF QRAEVQVYVR DITSDVGVPT
FAAVSDDVKL KDPALLTAGM GTHTDPEVAL LRALTEVAQS RLTQIHGARE DTVSAEFRRM
MGYDRLKRLN RHWFEYEREE DFSSLNSYNT DDFLDDIRYM LDRLQTAGFE RAIVVDLTAS
EIMVPVVRVI VPGLEISAVD PERVGKRCRD AKNRRVSGPK HAS