Gene Mbar_A3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3501 
Symbol 
ID3624899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4490945 
End bp4492291 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content49% 
IMG OID637702328 
Producthydroxymethylpyrimidine kinase 
Protein accessionYP_306952 
Protein GI73670937 
COG category[H] Coenzyme transport and metabolism
[S] Function unknown 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG1992] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.604187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAA AAACCTTAAA AGTAAAAACC CCAATTGTTC TGACTATTGC AGGTTCGGAT 
TCAGGGGGAG GAGCCGGAAT TGCTGCGGAC CTGAAGACTT TTGCCGCTCT TGGAGTACAT
GGAACCTGCG CTATTACATC GGTTACTGCA CAGAATACCA CTGGAGTGCT GAAAACCTTT
GACCTCACTC CTGAAGCTGT CGCCAGCCAG ATTGAAGCTG TCTGCACCGA TATGGAGGTC
GGGTGGGCAA AAAGTGGTAT GCTTGCCTCA TCGGAAATCG TAAGGGAAGT TGCAAAAGAA
ATCAGGAAAT ACGAGCTTTC CCTTGTGCTG GATCCTGTTA TAGCTGCTGA AGCAGGAGGA
AACCTGCTGC GAAAAGAAGC TATCTCTGTC CTTACCGAAG AACTGCTGCC CTTCTGCAAG
GTTACGACGC CCAATGCATC CGAAGCAGGT GAGCTCGCTG GCATGGCTGT CAAAACCCCT
GAGGACGCGA AAATCGCAGC CAGAAAAATT GCGGACCTGG GTGTCGAAGC TGTCATTGTT
ACAGGAGGAC ACCTGAATGC CACTGACTTG ATTTATGAAG CTGATTCTGA GACTTTTACC
CGTGTTCCGG GCACTTTTGT TAAAGGAGGA ACACACGGCA CGGGCTGCAC TTACTCTGCG
GCAGTGACTG CCTTCTTAGC CTCAGGAGAG AACCTGGAAG GAGCTGCAAG GAAAGCAAAG
AAATTCGTTG AACAAGCAAT CCTCAGGAGC AGGCCTGCAG GCAGGGGAGT AAGTCCTGTA
AACCAGCTTG GAGTGGTTCT GGAGCAAAAA GAGCGCTATC TGGTATTAAG AGAATTAAAA
GAAGCAGTTT CGATTCTTGA AGGCAGCCCT GATTTTTCAA AACTGATTCC CGAAGTAGGT
TGTAACATAG GAATGGCTAT TCTTGAAGCT GACAGCTACG AAGACGTTGC GGCCGTCGAA
GGCAGGATAG TAAGGCACAG GGGACGTGCG GTTCCTGTAG GTTGTGTGGA TTTTGGGGCC
AGCCGACATG TAGCAAGGAT TATTCTCGCG TCCCTTCGTT ATGATCCTGA AGTTAGGGCA
GCAATTAACG TAAAATACTC CAGGGAGGCA CTTGCAGCCT GCATAGATAT GAAACTTGAA
ATTTCCTCTT TTGACAGGGC TGAAGAACCA GAAAACTCCA GTACTATGGA CTGGGGTACA
GTCGAAGCAA TCAAAAAGTA CGGAAGTGTG CCGAAAATCA TCTGTGATAA AGGAGGCCAG
GGAAAAGAAC CAATGATCCG CCTGCTCGGG AGATGTGCAA CTGAAGTGGC AAAGCTTGCT
GTGGAGCTTG CAGAAAAAAT ACAGTAA
 
Protein sequence
MTEKTLKVKT PIVLTIAGSD SGGGAGIAAD LKTFAALGVH GTCAITSVTA QNTTGVLKTF 
DLTPEAVASQ IEAVCTDMEV GWAKSGMLAS SEIVREVAKE IRKYELSLVL DPVIAAEAGG
NLLRKEAISV LTEELLPFCK VTTPNASEAG ELAGMAVKTP EDAKIAARKI ADLGVEAVIV
TGGHLNATDL IYEADSETFT RVPGTFVKGG THGTGCTYSA AVTAFLASGE NLEGAARKAK
KFVEQAILRS RPAGRGVSPV NQLGVVLEQK ERYLVLRELK EAVSILEGSP DFSKLIPEVG
CNIGMAILEA DSYEDVAAVE GRIVRHRGRA VPVGCVDFGA SRHVARIILA SLRYDPEVRA
AINVKYSREA LAACIDMKLE ISSFDRAEEP ENSSTMDWGT VEAIKKYGSV PKIICDKGGQ
GKEPMIRLLG RCATEVAKLA VELAEKIQ