Gene Memar_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMemar_0053 
Symbol 
ID4848266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanoculleus marisnigri JR1 
KingdomArchaea 
Replicon accessionNC_009051 
Strand
Start bp50759 
End bp52039 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content71% 
IMG OID640114724 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001045970 
Protein GI126178005 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATACCC TGATCTCCGC GTGCCTGCGC GGGGTTCCCC CCGAGGTGGA GACGATCGCC 
CGGGAAGAGG ATCTCCCCCC CCACGCCGCC GCGCGGGCCG TCGCCCGCGG CCGGATAGTC
ATTCCCGCAA ACCCCGCGAG ACCGCACCGG CTCTGCGCCA TCGGGGAGGG CTGCAGGGTG
CGGGTCAACG TGAACGTCGG GACGTCGGGA ACACGGTGCG ACGAGGATCT CGAGGCCGAG
AAGGCGAAGG CGGCCCTCCG GGAGGGGGCG GACGCGCTGA TGGACCTCTC TACCGGCGGC
GATCTCGTCC GCATCCGGCA GAGGATCCTC GAACTCGATG CGCCGGTCGG CACCGTCCCG
GTCTACGAGG CGGTCCGGCG GGCGGGGAGT GCGGCGGACG TCGACGCCGA TCTGTTGTTC
AAGGTGATCC GGGAGCACTG CCGGCAGGGC GTGGACTTCC TGACGCTGCA CTGCGGCGTG
AACCGCGACG CGCTCGCGTC GCTCAAGGCC GACCCCCGGA CGATGGGCGT TGTTTCCCGG
GGCGGGGCGT TCCACGTGGC GATGATGGCC GCGACGGGCG AGGAGAATCC CCTTTACGCG
GAGTACGACT ACCTGCTCGA GATCCTCAGC GAGCACGACG TCGTCGTGAG CCTCGGCGAC
GGGATGCGGC CGGGCGCGCT CGTGGACGCC GGCCGCCTCG CGAAGTCGAC CGAGTACCTG
ACGCTCGGCC ACCTCGCGAA ACGGGCGCTC GCCGCCGGGG TGCAGCGGAT GATCGAGGGG
CCGGGGCATA TCCCGGCCGA CCAGGTCGGC TACAACGTCC GGATGATCAA GGAACTGACC
GACGGCGCTC CGCTCTACCT GCTCGGCCCG CTCGTCACCG ACGTGGCGCC GGGCTACGAC
CACGTCGTGG GGGCGATCGG GGGCGCGATC GCCTGCATGA ACGGCGCCGA CTTCCTCTGC
ATGGTCTCGC CGGCCGAGCA CCTGGCGCTG CCGGACGTCC GCGACATCGT GGAGGGGACG
CGGGTGGCGA AGATCGCGGC GCACGTCGGG AGCCTCTCCC GCGCCGCCGC ACACACGAAG
AACCGCGAGA TCCGGATGGC GGAGGCGCGG CGGGCGCTCG ACTGGGAGAA ACAGTTCGAG
GCGGCGCTCG CTCCCGAGGA GGCGCGGCGT ATCCACGAGC GCGACGGCGA GATCGAGACC
TGCTCGATGT GCGGCGACCT CTGCGCCGTG AAGATGGTGC GGGATATCCT CCCGGTGCCG
GAAGAACGGA TGGAGCCGTG A
 
Protein sequence
MHTLISACLR GVPPEVETIA REEDLPPHAA ARAVARGRIV IPANPARPHR LCAIGEGCRV 
RVNVNVGTSG TRCDEDLEAE KAKAALREGA DALMDLSTGG DLVRIRQRIL ELDAPVGTVP
VYEAVRRAGS AADVDADLLF KVIREHCRQG VDFLTLHCGV NRDALASLKA DPRTMGVVSR
GGAFHVAMMA ATGEENPLYA EYDYLLEILS EHDVVVSLGD GMRPGALVDA GRLAKSTEYL
TLGHLAKRAL AAGVQRMIEG PGHIPADQVG YNVRMIKELT DGAPLYLLGP LVTDVAPGYD
HVVGAIGGAI ACMNGADFLC MVSPAEHLAL PDVRDIVEGT RVAKIAAHVG SLSRAAAHTK
NREIRMAEAR RALDWEKQFE AALAPEEARR IHERDGEIET CSMCGDLCAV KMVRDILPVP
EERMEP