Gene Mhun_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_0104 
Symbol 
ID3923799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp120839 
End bp122701 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content47% 
IMG OID637895757 
Producttype II secretion system protein E 
Protein accessionYP_501601 
Protein GI88601423 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGGC TGCAGATTCA GGGGAAAATA CCTTTTCAGG AGGCTGAGAG TCATGATACT 
GAAGAGTGTA TACTAAACCC TGAGTCCTGC GCCCTGTACC GGATGCTGCC CGCAAATGCC
AAAGAGTATG CCCGGAACTA TCCCCACCTT CTTGAATACC TCCATATCTT CCCGGTAGAT
GAATTCGGAA TCCCGTTATT CTTTTCTGAG CTGAAACGTG ATTTGAAAGG GATTAAGGAT
CCGAATCTCA TTTATCCTGC AAAACCTCCA ATTTTCATTC ATATCTTCTT TGATCCGAAT
GATGTCAGGA ACTTTTACAT CCCCATCGAA CCATCGTTCA TGCATAATAT CGGCCGGTTG
CTTCCGGCAA TCGAGTACCG GCTGGTAGAC CTGCTTGATG CCCTGGAAGA AGACCCGGTT
ACTCCGGAAG AGCGGACAGC GGTATTAAAA CGCCTGCTCA GACAGGTCAT GTATATCAAG
AAAGCAGGAG AAGCAATTGA TCCCAGCCTG CTGAAGATTG AAGGGCCGAA AGGATTTGGA
GACAAGTTAA AATCATTCCT TACCACTGAT CTGACCGCCA AAGAAGAACC GGGTGCCGAG
CGGCTTTTTG CAGATGTGCC CCATCTCTCT GACGGGCGTA TCATAGTGAG TCCCCAGGAG
TTTGAAGCGA TTGAGTACCA GATGATCCGT GACAAAATCG ATGTCGGACT TCTGTATCCG
TTTATCTCGG ATAATTTTAT TGAGGATATC ACCTGTGACG GGCTTGGCCC CATATTTATC
GAACACAAGA TCTTCAAAGG GCTCAAATCT GTCGTCGGTT TTGATCAGGA GGCTGCACTT
GATGATTTCT GTATTAAGCT TGCAGAAAAA TCGCGTCGCC CGATTACGTA CCGGAATCCA
ATCGTTGATG CGACCCTTCC GGACGGATCA CGTATCAATA TCGTCTATTC AACGGAGATC
AGCCGGCAGG GGAGTAACTT TACCATTCGT AAAGCGATGG ATGATGTCAT CTCCATCACC
AAGCTGGTCG AATTTGGAAC CTGTAGTTAT GCACTTGCGG CATATCTCTG GATCTGCATT
GAGAACGGAA TGTCCCTTTT CATGTCCGGC GAGACGGCAT CCGGCAAGAC AACCTCAATG
AATGCTCTTA CAACCTTCAT CTCCCCTGAG TCAAAGATCG TCAGTATCGA GGACACTCCG
GAACTTATCA TCCCTCATAA AAACTGGACC CGTGAAGTAT CCAAAGGAAA AGGAAAAGGT
GAAGGTGAAG GTTCAGATGT TACCATGTTC GATCTCCTTC GTGCTGCACT TCGTCAGCGT
CCAAACATGA TCATGGTCGG AGAGATCCGT GGAGTGGAAG GAGCAGTCGC ATTCGGTGCC
ATGCAGACCG GCCACCCGGT GATGAGTACA TTCCACGCAG CAACGGTGGA GAAACTGATT
CAGCGTCTTA CCGGTGACCC CATTCTTATC CCCAAGACCT TCATTGATAA CCTCAACCTG
GTGGTCATTC AAAGTGCAGT CCGGCGGCCT GATGGAGCGA TGGTCAGACG TCAGCTGAAT
GTTTCTGAAC TGGTCGGATA TGATGCACAA AGTGGAGGTT TCTCCTTTGT TGAGGTCTTT
ACCTGGGACC CGGTCACTGA TACGCACGAA TTTACCGGTA AAGGTTCAAG CTATCTTCTG
GAAAACAAGA TCGCAACCAT GCTTGGTATT CCTGAGCATA AAAAGGCGCA AATGTACATG
GAAGTAGAAA AACGTGCTAA GATCCTGGAA CGGCTTCATA AAGCAGGATA TACTGACTTT
ATCGAACTCT TTCTGATGAT AACAAAATTG AAAAAACAGG GACTTCTCAA TATCGATATC
TGA
 
Protein sequence
MSRLQIQGKI PFQEAESHDT EECILNPESC ALYRMLPANA KEYARNYPHL LEYLHIFPVD 
EFGIPLFFSE LKRDLKGIKD PNLIYPAKPP IFIHIFFDPN DVRNFYIPIE PSFMHNIGRL
LPAIEYRLVD LLDALEEDPV TPEERTAVLK RLLRQVMYIK KAGEAIDPSL LKIEGPKGFG
DKLKSFLTTD LTAKEEPGAE RLFADVPHLS DGRIIVSPQE FEAIEYQMIR DKIDVGLLYP
FISDNFIEDI TCDGLGPIFI EHKIFKGLKS VVGFDQEAAL DDFCIKLAEK SRRPITYRNP
IVDATLPDGS RINIVYSTEI SRQGSNFTIR KAMDDVISIT KLVEFGTCSY ALAAYLWICI
ENGMSLFMSG ETASGKTTSM NALTTFISPE SKIVSIEDTP ELIIPHKNWT REVSKGKGKG
EGEGSDVTMF DLLRAALRQR PNMIMVGEIR GVEGAVAFGA MQTGHPVMST FHAATVEKLI
QRLTGDPILI PKTFIDNLNL VVIQSAVRRP DGAMVRRQLN VSELVGYDAQ SGGFSFVEVF
TWDPVTDTHE FTGKGSSYLL ENKIATMLGI PEHKKAQMYM EVEKRAKILE RLHKAGYTDF
IELFLMITKL KKQGLLNIDI