Gene Mhun_2184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_2184 
Symbol 
ID3922989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp2455564 
End bp2456673 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content47% 
IMG OID637897790 
Productthiamine biosynthesis protein 
Protein accessionYP_503608 
Protein GI88603430 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.72421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTG ACATCTGGCT TATCCGATAT TCTGAAATAT TTTTGAAAAG CGAACCTGTT 
CGTCGTGTAT GGGAGGATCT CCTTATCCAA ACACTCAAAC AAAAATTACC GGATTGTAGC
ATATCAAAAA CACGGGGAAG GATATGGATA ACCGGTGATG TAGATCCTGA AGCAATCGCT
CATACTTTCG GGGTGTATTC CTTTTCTCCG TGTATCATGT TCCCCTTGTC AGATCTCAAT
GAAAGAGTTC TCACATATGT AAAAGAAACC GGTTTCTCTG CATACAAAAC GTTTGCATTA
CGGATTAACC GCTCAGGGAC ACACCCTTTT ACATCACAGG ATCTGGCCCG AACACTCGGA
GCATCCATCC AAAAATCATG GCCATCTATT GCTGTTGACC TCACCAATCC GGAATATGAA
TTACATATTG AGGTCCGGGA TGAACAGTGC TACCTGTATC AGGAGATCAT ATCCGGTCCC
GGGGGAATTC CGCAGGGAGC ATCAGGCACT CTTGTCGCCC TGCATTCCGG AGGAATTGAC
TCGCCGGTTG CCATGTACAT GATGATGAAA CGGGGGACGA TACTTCACCC GGTATACGTG
AAAATCGCTC CATTTCATGA TGATAGTTCT GAAGAACGGG CACATCTGAT TGTAGAACAT
TTGAGGAAAT ATCAGCCGGA TCTAACCCTT GAGGTCATTG ATGATGGGCA TGTGTACGCT
ACCAGGATGG AACTGAAAAA ACGGGACCTT GAAAAGTATG CCTGTGTTCT CTGCAAACGG
CACCTGTACC GGATCGCCGA GCAAAAAGCA CGCGCAATCG GGGCAAAAGG GATTGTCACC
GGTGAGTCAC TGGCACAGGT AGCTTCTCAG ACCCTTGATA ACTTGTACGT CCTGGATGAT
GCGGTCTCAA TGCCGGTATA TCGTCCCCTC ATTGGATTTG ATAAAGAAGA GACCATAGCA
GTCGCAAAAA AAATCGGGAC ATATGATCTA TCAGTCATGC AGGTTCCCAG CTGTTGCTGT
GCCATTCCCT TTAAGCCGGC GACCACATCC CAGCGGGAAA CGATATCTAC CCTTGAAGAG
GAACTAGCAA AAGGATCTCC AGCCAAATAA
 
Protein sequence
MKPDIWLIRY SEIFLKSEPV RRVWEDLLIQ TLKQKLPDCS ISKTRGRIWI TGDVDPEAIA 
HTFGVYSFSP CIMFPLSDLN ERVLTYVKET GFSAYKTFAL RINRSGTHPF TSQDLARTLG
ASIQKSWPSI AVDLTNPEYE LHIEVRDEQC YLYQEIISGP GGIPQGASGT LVALHSGGID
SPVAMYMMMK RGTILHPVYV KIAPFHDDSS EERAHLIVEH LRKYQPDLTL EVIDDGHVYA
TRMELKKRDL EKYACVLCKR HLYRIAEQKA RAIGAKGIVT GESLAQVASQ TLDNLYVLDD
AVSMPVYRPL IGFDKEETIA VAKKIGTYDL SVMQVPSCCC AIPFKPATTS QRETISTLEE
ELAKGSPAK