Gene Memar_1558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMemar_1558 
Symbol 
ID4847506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanoculleus marisnigri JR1 
KingdomArchaea 
Replicon accessionNC_009051 
Strand
Start bp1562509 
End bp1563783 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content64% 
IMG OID640116251 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001047469 
Protein GI126179504 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.635884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTCA TAGAAGACGC CCGGCGCGGT GTCGTCACTG AAGAGATGCG GATCGTCGCA 
GCAGCGGAGG GGGTTACCGA AGATTTCGTC CGGCGCGGTG TGGCCGAGGG TCACATCGTC
ATTCCGGTCT CCCCCTACCG GAGGGTGAAG ATCTGCGGCA TCGGCGAAGG GCTTCGCACG
AAGGTCAATG CAAGCATCGG GACGTCGACG GATATGGTCG ACGTCGATAT GGAGGTCGAG
AAGGTCCGGC AGGCCGAGCG CGCCGGTGCC GATACGTTGA TGGAACTCTC CACCGGCGGC
GACTTTTTAG AGATCCGGCG CCGGGTCGTC GAGGCGACCA CCCTCTCGGT CGGGTCGGTC
CCGCTCTACC AGGCGTTCAT CGAGGCCGCC CGGAAGCGTG GGGCGGTCGT CCACATGGAG
CCGGACGACC TCTTCCGGAT CACAGCGGAG CAGGCGAAGC TCGGGACCAA CTTCATGGCG
ATCCACACCG GGATCAACTA CGAGACGATG AAGCGCCTCC AGAACCAGGG GCGGCACGGC
GGGCTCGTCT CCCGGGGCGG GGCCTTCATG ACCGCGTGGA TGCTCCACAA CGAGAAGGAG
AACCCGCTTT ATTCGGAGTT CGACTACCTG CTCGAGATCT TGAAGGAGCA CGAGGTCACC
CTCTCGTTCG GCAACGGGAT GCGGGCGGGA GCGGTTCACG ACGCGACCGA CCGCGCCCAG
GTCCAGGAGC TGCTTATCAA CGCGGAACTC GCCGATAAAG CGCATGAACA GGGTGTCCAG
ACGATCATCG AGGGACCGGG GCATATCCCG GTCGACGAGA TCCAGACAAA CGTCGTCCTG
CAAAAAAGGG TCACGAACCG GAGACCGTTC TACATGCTCG GGCCGCTCGT CACCGATATC
GCGCCCGGCT ACGACGACCG GGTGGCCGCC ATCGGGGCCG CGCTCTCCTC CTCCTACGGC
GCCGACTTCA TCTGCTACGT GACGCCGGCC GAGCACCTGG CGCTTCCCAC CCCCGAGGAG
GTCTACGAGG GCGTCATGAG CTCGAGGATC GCCGCCCACG TCGGGGATAT GATCAAACTC
AAGAAGCGGG ACGCCGACCT CGAGATGGGT CATGCCCGCC GCGACCTCGA CTGGGAGCGG
CAGTTCGCGG TCGCGATGAA CCCCGAGCGC GCCCGGGCGA TCCGGGACGA ACGGATGCCG
GCCGATACCG ACGGCTGCAC GATGTGCGGG GACTACTGCG CGCTGAAGAT TGTTGGAAGG
CATTTTAATT TCTGA
 
Protein sequence
MSLIEDARRG VVTEEMRIVA AAEGVTEDFV RRGVAEGHIV IPVSPYRRVK ICGIGEGLRT 
KVNASIGTST DMVDVDMEVE KVRQAERAGA DTLMELSTGG DFLEIRRRVV EATTLSVGSV
PLYQAFIEAA RKRGAVVHME PDDLFRITAE QAKLGTNFMA IHTGINYETM KRLQNQGRHG
GLVSRGGAFM TAWMLHNEKE NPLYSEFDYL LEILKEHEVT LSFGNGMRAG AVHDATDRAQ
VQELLINAEL ADKAHEQGVQ TIIEGPGHIP VDEIQTNVVL QKRVTNRRPF YMLGPLVTDI
APGYDDRVAA IGAALSSSYG ADFICYVTPA EHLALPTPEE VYEGVMSSRI AAHVGDMIKL
KKRDADLEMG HARRDLDWER QFAVAMNPER ARAIRDERMP ADTDGCTMCG DYCALKIVGR
HFNF