Gene Mbar_A2788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2788 
Symbol 
ID3625116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3559308 
End bp3560483 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content41% 
IMG OID637701639 
Producthypothetical protein 
Protein accessionYP_306269 
Protein GI73670254 
COG category[S] Function unknown 
COG ID[COG4069] Uncharacterized protein conserved in archaea 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000344178 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000371594 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATTG GAATTGTTAT TCATAATGTC CAGCTTATGG ATTCCCCTCA GATAATTAAA 
AATATTCTTA CACTACTCTC CAGGGAAAAC CTTGTCGACG CCTGTCTCTG CGGTACATTG
GGAAAAGTTG CAGTTATTGA TGCCGGTTTG GAAGACTTGA TTGAAATCAA CCAGTTTCTG
AGTCCAAGTG CCTGTATTGG GACTCTCTTT AAATCAAATG ATATGGTATG CCTTCTGAAC
CATGGTAGAG AGCTTAATAC AGGCCGGACT TTTGGGAGGA TCGTAGTCTC GCATGTAGAA
AATCATGATG AAAAGCCCCT TATACAGATA GAAAGGCCCG GTTGTCCCGA TGGTGAAATC
ATCCCATGGA ATCAGGCTGC TGAACTCCAT GCAGAAAAAC TTTCCAAATT GCTTAATCTG
AAAATTTCCC AGCCTCCGCG TCCGATTAAT AGTATTGAAG TCACAAATCA GGGAAGGCGG
GTTCTGAGGA AAATATCAGC CTTTCCCGGG GCATACATAC TTGTTGAGGG GATCATCATC
GGAAAAGCTA CTTCTTCTGA GATCACCCTC ATATCCGAAG ATGGTTTCCT GACCTCAATG
GAAGGAGGAA TTCTAAAAGA GCAAGGTATT GATATTCTCC ATAAGCACGG AGAAAGAGTA
CCTATAGACC TTTCCAGGTC CTGGGTAAAA ACCGGTACTC AGCGGGAAAA TTTCAAAGAC
TGCAAAAATC CTCTTGAAGG CAAAAGTGAA TGTGTGGCAA AAACTGCTAT TTTAAAAAAG
AACTCTTTAA TGGAGACAAC CCCTGAACGC GGAATTAAAG TCATTCTAAT AGACCACTGC
GCAGAGCGTT CACTTGAGAT GATTGAGGGA GCAGACCTTG CTATTACCAT TGGAGACGAC
ACAACTGAGA TTGCAGGAAG TATATTTTCT AGGTTTGGAA TTCCGATAAT TGGAGTTACA
GATGGCGACT GTGACGAACT TGCAGCTTCA GTTACCTATT CTGCGGGTTC TGTGATCCTG
AATTTAAAAT CCGGACAGGA TGACGAGTTC GGTAGACTAA TTCTGCAAGA TATTTTATCA
GGAAAAAAAG TCGCCTTTTT TGAAAATCTG GATAATTTGA AGTTAAGAAT CATGAATCTC
GCAGAAAATT CTCTTGAATC CGTCTTAGAA TATTGA
 
Protein sequence
MKIGIVIHNV QLMDSPQIIK NILTLLSREN LVDACLCGTL GKVAVIDAGL EDLIEINQFL 
SPSACIGTLF KSNDMVCLLN HGRELNTGRT FGRIVVSHVE NHDEKPLIQI ERPGCPDGEI
IPWNQAAELH AEKLSKLLNL KISQPPRPIN SIEVTNQGRR VLRKISAFPG AYILVEGIII
GKATSSEITL ISEDGFLTSM EGGILKEQGI DILHKHGERV PIDLSRSWVK TGTQRENFKD
CKNPLEGKSE CVAKTAILKK NSLMETTPER GIKVILIDHC AERSLEMIEG ADLAITIGDD
TTEIAGSIFS RFGIPIIGVT DGDCDELAAS VTYSAGSVIL NLKSGQDDEF GRLILQDILS
GKKVAFFENL DNLKLRIMNL AENSLESVLE Y