Gene Mboo_1595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1595 
Symbol 
ID5410909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1663678 
End bp1664955 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content59% 
IMG OID640868829 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001404755 
Protein GI154151137 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.753357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.11929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCA TTGAGGATGC CCGGCACGGC ATCGTAACAG AAGAGATGAA ACAGGTCGCG 
AAAGCCGAAG GCGTGACCGA GGACTTTATC CGGCGCAGCG TCGCCGAAGG CCACATTGTT
ATCCCGGTCT CGCCCTACCG CAAGGTGAAG ATCTGCGGTA TCGGCGAGGG CCTGCGCACC
AAGGTGAACG CCTCTATCGG GACCTCCACC GATATTGTCA ATATCCCTGA AGAGATCGAG
AAGGCAAAAC AGGCAGAACG GGCCGGCGCC GACACGCTTA TGGAGCTCTC CACCGGTGGG
GACTTTGCCG ATATCAGGAG ACAGATTATC GCCAATACCA CGCTCTCTGT CGGTAGTGTC
CCCCTTTACC AGGCATTCAT CGAGGCCGTG AAAAAGGACG GGGCCGTCAT CCACATGAAA
GAGGACGACC TCTTCCGGAT CACGGCCGAG CAGGCAAAAC TGGGCACGAA CTTCATGGCG
ATCCACACCG GCATCAACTG GGAGACGGTA AAGCGTCTGC GCAACCAGGG CCGGCACGCG
GGGCTTGTCT CCCGGGGCGG TGCGTTCATG ACCGCATGGA TGCTCCATAA CGAAAAGGAA
AACCCGCTCT ACTCCGAGTT CGACTACCTC ATGGAGATCA TGAAGGAGCA CGAGGTCACC
CTCTCCATGG GTAATGGAAT GCGGGCAGGA GCCATCCACG ATGCCACCGA CCGGGCCGGC
ATCCAGGAAC TCCTCATCAA TGCAGAGCTT GCCGACAAGG CGCATGCAAA GGGCATCCAG
GTGATTGTCG AGGGTCCGGG CCACGTGCCG ATCGATGAGA TCGCCACAAA CGTCCAGCTC
ATGAAGCGGG TCACCAACAA CAAGCCGTTC TACATGCTCG GCCCCATTGT CACCGACATT
GCGCCGGGTT ACGATGACCG GGTCTCGGCC ATCGGGGCCG CCATCTCCTC ATCGCTCGGC
GCCGACTTCA TCTGCTACGT CACCCCCGCA GAGCACCTTG CGCTCCCGAC GCCCGAAGAG
GTGTACGAGG GTGTCATCAG TTCGCGGATT GCGGCCCATG TCGGGGATAT GGTCAAGCTT
AAGAAAGTCC GGGAAGCCGA TCTCGAGATG GGCCATGCCC GGCGCGATCT CGACTGGGAA
CGCCAGTTTG CGGTTGCCAT GAACCCGGCC CGGGCCCGGA AGATCCGCGA AGAGCGGATG
CCGGCCGACA CTGACGGCTG CACCATGTGC GGTGACTTCT GTGCCATTAA GATCGTTAAC
CGCTACTTTA AATTCTAA
 
Protein sequence
MSIIEDARHG IVTEEMKQVA KAEGVTEDFI RRSVAEGHIV IPVSPYRKVK ICGIGEGLRT 
KVNASIGTST DIVNIPEEIE KAKQAERAGA DTLMELSTGG DFADIRRQII ANTTLSVGSV
PLYQAFIEAV KKDGAVIHMK EDDLFRITAE QAKLGTNFMA IHTGINWETV KRLRNQGRHA
GLVSRGGAFM TAWMLHNEKE NPLYSEFDYL MEIMKEHEVT LSMGNGMRAG AIHDATDRAG
IQELLINAEL ADKAHAKGIQ VIVEGPGHVP IDEIATNVQL MKRVTNNKPF YMLGPIVTDI
APGYDDRVSA IGAAISSSLG ADFICYVTPA EHLALPTPEE VYEGVISSRI AAHVGDMVKL
KKVREADLEM GHARRDLDWE RQFAVAMNPA RARKIREERM PADTDGCTMC GDFCAIKIVN
RYFKF