Gene Mbar_A2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2234 
Symbol 
ID3624696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp2836353 
End bp2837558 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content47% 
IMG OID637701106 
Producthypothetical protein 
Protein accessionYP_305739 
Protein GI73669724 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00649369 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.770336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAA GTTCAAAACC TGAACTCCTT GTAGGGATCA GGAATCTTTC AGGGCTTAAG 
GCCTGCTCAG GATATGCAGA CGCTGTTTAC TTTTCAACCG ACAGGCTCAG CCTGAGGGCA
AAAGCAAAAG AAATTACTCT GGAGACTCTT GACGATTTTG TCTCTGAGGT AAAAACCAGA
GGGCTCAAGG CCTATCTGGC TGTAAACTCG ACAGTAAATG AGGACAGGCT CGGAGATGCA
TCGGACGTGA TAGCTGCAGC TTCGAATGCA GGAGTAGACG CAGTAATTGC CTGGGACCCG
GCTGTAATTC TCAGGGCACG GAAAGCAGGG CTTAGAATTC ACATCTCCAC GCAGGCAAAT
ATTACCAACC ATGAAACTGC AAATTTCTAC AGAAACCTTG GGGCCGAAAG AATTGTGCTT
TCAAGAGAAC TTTCCCTTGA AGAGATACGG AAGATAAACC AGCAGACTGA GGTAGAAATC
GAGACTTTCG TCCACGGTGC AATGTGCATG GCAGTCTCAG GCCGATGCCA TCTGTCAGCT
TACGTTCTTG GTAAATCCGG AAATTGCGGA GAATGCACTC AGCCCTGCCG CTGGGAATGG
GAACTTCATG GAGAAAACGG GCTTGTTGCT GCAAGCCTTG GAAAGTACCT TCTAAGTGCA
AAAGACCTCT GTATGATAGG GCATATTCCC GAACTTCTTG AAGCGGGAAT AGCTGCCTTT
AAAGTGGAAG GGCGGTTGCG GGATCCTGGA TACCTTGAAA TGGTTTCCCG TTGCTACAGG
GAAGCTATCG ACGCCTGCAT AGAAGGAAAC TATACTCCTG AGAAAATAGA AGCCTGGAGG
CGTGAACTAG CTTCGGTTTA TAATAGGGGT TTTTCAACTG GCTTTTACTT TGGGGTTCCA
GGCCTTGAAG GCTTTTCTCC TGAAAAAGGT ATGAATGCCT CGGAAAAGCA ACGCAGGGCT
GTTGGAATTA TTGAAAATTA TTATACGAAA CAGCAAGCTG CAGCCGTCAG ACTTCTCGAA
GAAGGAATTT CAGTTGGAGA CGAGATCGTA ATTGAGGGAA ATACGACTTA TTTGAGGCAG
CAAGTCCGTT CCCTGAGGAA TAAAGGAGAA ACTGTTGAAA GGGCTGAAAA AGGAGACATG
GTTGGCCTTG CCGTTGAAGG GACCGTTCGA AAAAATGACA GAGTTTTCAG AATACAATTG
AAATAA
 
Protein sequence
MRKSSKPELL VGIRNLSGLK ACSGYADAVY FSTDRLSLRA KAKEITLETL DDFVSEVKTR 
GLKAYLAVNS TVNEDRLGDA SDVIAAASNA GVDAVIAWDP AVILRARKAG LRIHISTQAN
ITNHETANFY RNLGAERIVL SRELSLEEIR KINQQTEVEI ETFVHGAMCM AVSGRCHLSA
YVLGKSGNCG ECTQPCRWEW ELHGENGLVA ASLGKYLLSA KDLCMIGHIP ELLEAGIAAF
KVEGRLRDPG YLEMVSRCYR EAIDACIEGN YTPEKIEAWR RELASVYNRG FSTGFYFGVP
GLEGFSPEKG MNASEKQRRA VGIIENYYTK QQAAAVRLLE EGISVGDEIV IEGNTTYLRQ
QVRSLRNKGE TVERAEKGDM VGLAVEGTVR KNDRVFRIQL K