Gene Mbar_A2944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2944 
Symbol 
ID3626143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3780681 
End bp3781787 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content45% 
IMG OID637701789 
Producthypothetical protein 
Protein accessionYP_306419 
Protein GI73670404 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.81591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00297728 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGACTGA GCCAAAACCC TGACGGAGTA AGCCAGATCA TCGCCAGCCG ATGGAAAATT 
GGGGCAGCCC TGGCGGTAGC GCTGCTTCTG TACTTTGCGT TTCTAATTCT GCTGCCCCTC
GCAGATGGAA TCGTACTGGG CATAGTCTTT GCTTACATTG CCAGGCCTAT CCGGGTAAAA
TTCAAAAAAC ACAGAAAGGT GGGAGCTCTC GTTGCCAGTC TGTGCATATT CATCCCAATA
GTATTCATTG TTGGGGCAGG TATTGTTGAG ATCCTTAACC AGATCTCCTG GGTTATTGAA
CATCAGACAG CAGTTGCGGC AGCAATTTTG AATTTCATAA ACTCTCTGAA CATTCCAGAT
AAAATCATAG AAAGTATTAA TTCCGCGATC TGGGACCTCT TTACCTCGCT GCTTCCTGCA
GTTGGCAGTA TAGGGCTTCT TTCATATGCC CAGAGTATAG GTCTATTTTT CATTAATTTT
TTAATCTCGA TCATTTTCTG CTATTTTGTA CTTGCTGATG GGGATCGGCT TTACTGCGCA
TTTCTTGGTG TGATCCCAAA AGAGTACAAA GGAGTTGTAA ACTGTTACGC GCATCATCTT
GATATAATCC TTAAAGGAGT TTTCATAGGC AATGCCTACT CTGCTCTGAT AGTAAGCGTA
ACTTCGGTTT TTGTTTTCTA CTCTTTTGGG TTTACCCATG TACTTGCCCT AGCGACCCTT
ATCTTTGTAG CTTCGATAAT TCCCCTTTTT GCCGGGTACA TGGTGCTGGT ACCTCTGGCT
TTAATGCGGT ACTTTGAATC CGGGTTTAGA AGTGCAGCCA TTTTTTTTAC GGTATCCTCC
ATCATTATCT ACGGCCCCCC AGAACTGATT CTCAGGCCTT ACCTGACCAG CTTGAAATCT
AAGATTCACC CAATGCTGCT TATGCTCGCC TTCCTGGGCG GGGCTTTTGT CGGAGGGATT
GCAGGATTTT TTGCAGCCCC TATTCTTCTC GGGGCTCTGG TTGCAGCTTA CAGGGTTTAT
CAGGATCACA CCAATCCCGA AATTACCGAG ACCTGTGCAG ACTTCAAGAA CCTTGGACAT
GCCCATAAGG CTGGTTCGGA AAAGTAA
 
Protein sequence
MRLSQNPDGV SQIIASRWKI GAALAVALLL YFAFLILLPL ADGIVLGIVF AYIARPIRVK 
FKKHRKVGAL VASLCIFIPI VFIVGAGIVE ILNQISWVIE HQTAVAAAIL NFINSLNIPD
KIIESINSAI WDLFTSLLPA VGSIGLLSYA QSIGLFFINF LISIIFCYFV LADGDRLYCA
FLGVIPKEYK GVVNCYAHHL DIILKGVFIG NAYSALIVSV TSVFVFYSFG FTHVLALATL
IFVASIIPLF AGYMVLVPLA LMRYFESGFR SAAIFFTVSS IIIYGPPELI LRPYLTSLKS
KIHPMLLMLA FLGGAFVGGI AGFFAAPILL GALVAAYRVY QDHTNPEITE TCADFKNLGH
AHKAGSEK