Gene Mbar_A1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1003 
Symbol 
ID3626886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1224416 
End bp1225387 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content45% 
IMG OID637699892 
Producthypothetical protein 
Protein accessionYP_304551 
Protein GI73668536 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTA ATAAATTAAA AATCGGCAAA ACAGAAACAC CTGGGAACCT TCTTCTTGCG 
CCTATGGCAG ACGTGACAAA TCTGGCTTTC AGGCTGCTCT GCAGGCAGAA TGGAGCTGAC
CTTACATATA CTGAGATGAT CAGCGCAGAT GCCTTGCTCA ACGAAAACCG AAAATCGCTT
CTCAAAGGAC TGTCTTCCCC TGAAGACCGA CCTTTTGGAG TTCAGCTTGT AGGAAGTTCT
CCTGAGAAAC TGAGGGAAGC TGCGCTCTTT ATTGAAGATG AGTACAGACC TGAACTCATC
GATGTAAATA TGGGCTGCCC TGCAAAGCGT ATTACTGGAA CAGGGTGCGG CTCGGCTCTT
CTCAACTCCA AGAAACTCAT ATATGAAATT ATCTCCGATC TGACCGATGT TCTGAAGACT
CCTGTAACTG CAAAGATCCG CATTCTGAAG AGGGATGAGA AAACCCTTGA GATTGCACGC
CTGATCGAAG AGGCTGGAGC TTCTGCCCTG ACCATACATG GCAGGAGGGC AGAGCAAATG
TACTCCGGAA GTTCAGACCT TACAGTGATA AGAGCCGTTA AACAGGAACT TTCCATTCCA
GTAATTGCAA ACGGGGATAT AAGGAACGAA GAGTCTGCTG AAGCTGCCCT TGATTTTACC
GAATGTGACG GGCTTATGAT CGGACGCGCA GCAATGGGAA ATCCCTTTAT CTTCAAAAGG
ATAAGGCATT ACCTAGAAAC CGGGGAAAGG CTGGAATTTG ACAGGCAAGT TCGTCAATTA
GAGGACTTTG AGAATTATAT CGCTTTACTT GAAGAATATG ATCTCCACGC GTCTACAAAT
ATAAGAATGC ATGCTCACTG GTTTACAAAA GGGTTGCGCG GCTCGCGGCA GATTAGGGAA
AAGATTAATA ATCTGAAAGA TGGAAAGGCG ATTGTTGAGT TAATAAAAGA TTTCCATACG
GAGAATTATT AA
 
Protein sequence
MKLNKLKIGK TETPGNLLLA PMADVTNLAF RLLCRQNGAD LTYTEMISAD ALLNENRKSL 
LKGLSSPEDR PFGVQLVGSS PEKLREAALF IEDEYRPELI DVNMGCPAKR ITGTGCGSAL
LNSKKLIYEI ISDLTDVLKT PVTAKIRILK RDEKTLEIAR LIEEAGASAL TIHGRRAEQM
YSGSSDLTVI RAVKQELSIP VIANGDIRNE ESAEAALDFT ECDGLMIGRA AMGNPFIFKR
IRHYLETGER LEFDRQVRQL EDFENYIALL EEYDLHASTN IRMHAHWFTK GLRGSRQIRE
KINNLKDGKA IVELIKDFHT ENY