Gene Mbar_A3090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3090 
Symbol 
ID3625221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3975881 
End bp3976969 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content36% 
IMG OID637701927 
Producthypothetical protein 
Protein accessionYP_306557 
Protein GI73670542 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0391882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0374194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTA AGGAAGTAAT TAAAAAATAT AAACAATTTA AGATCCCATC TTTATCTATA 
ATTTCTGAGA AGGCAAAACA ATACTCAAAG ACTTCTCTAT TAATTGTTGT TATCTTTGCA
GTGTTTTTGG TTTTAGCACT GCAATATGTT CCGCACTGGC AAGTCGCTCA ATTTGGAATT
ATGAACCCAA AATACTTAGC TGAGATGGAA AATAACTATC GTGCTACATT AGCTCAAATA
TTTGGTGGCG TTGCTGTTGG GATTGGTATT TATTTTGCTT GGGGAAACCT TACAACTACT
AGAGAAGGAC AGATAACTGA ACGTTTCACT AGGGCTGTTG ATCAGCTAGG AAATGAAAAT
ATGGAGATTC GTTTAGGTGG AATATATGCA CTTGAAAGAA TCTCAAAAGA GTCTGAAAAA
GACTACTGGC CAATCATGGA GATTTTAACC GCTTATGTTA GAAAGAATTC CAGTGTTGAA
GTAATTGAAA ACGTTGAAAC CCAAACGATG GCATCATCAG ATATTCAAGC AGTTCTTACT
GTCATTGGAA GGCGCAAATA TTCTTTTCAG TCTGGAGAGC CTAGTTACTT GGATTTACAT
GGAACTTATT TAGAGGGGGT TAACCTTAAT GGGGTTAATC TTGAACGAGC GAATTTCACA
GGGGTTAACC TTAATAGAGC TAACCTTGAA AGGGCTAACT TTAAATACGC TAAACTTGAT
GGAGCAAATC TTGAAGGAGC TAGTCTTTCA TATGCTAACC TTGAAAATGC TAACCTTATA
AAAACTAACC TTATCTTCGC TCAACTCTAT GGAGCTAACC TTAAATTGGC TAATCTTTCA
TCTGCTTACC TAATATATGC TAACCTTAGA AAGACTAACC TTGAATGGGC TTTTCTTAAA
TGTGCTTACT TTAGTAAGGC TATTCTTGAA GGAGCTTGTC TTGAAAAAGC TTACCTTGTT
GAGACTATAG GCTTATCAGT TGACCAGCTT TCTAGAGTAA AAACACTTTA CAATACAGAA
TTAGACGAAG AGCTGGAGAT ACCATTGAGA GAGAGGTACC CTGCTCTTTT TGAGAAACCT
GATGAATGA
 
Protein sequence
MDFKEVIKKY KQFKIPSLSI ISEKAKQYSK TSLLIVVIFA VFLVLALQYV PHWQVAQFGI 
MNPKYLAEME NNYRATLAQI FGGVAVGIGI YFAWGNLTTT REGQITERFT RAVDQLGNEN
MEIRLGGIYA LERISKESEK DYWPIMEILT AYVRKNSSVE VIENVETQTM ASSDIQAVLT
VIGRRKYSFQ SGEPSYLDLH GTYLEGVNLN GVNLERANFT GVNLNRANLE RANFKYAKLD
GANLEGASLS YANLENANLI KTNLIFAQLY GANLKLANLS SAYLIYANLR KTNLEWAFLK
CAYFSKAILE GACLEKAYLV ETIGLSVDQL SRVKTLYNTE LDEELEIPLR ERYPALFEKP
DE