Gene Mbar_A3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3641 
Symbol 
ID3624585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4686791 
End bp4687945 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content44% 
IMG OID637702474 
Producthypothetical protein 
Protein accessionYP_307085 
Protein GI73671070 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.802577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TGAGAATAGG AATGTTTACC TGGGAAAGTT TATATTCAAT ACGTGTGGGA 
GGTATTTCGC CCCACGTATC CGAGCTATCT GAGGCTCTTG CGGCAGAGGG ACATGAAATC
CACCTGTTTA CACGAGACCG CGAAGATAAA GATGAAGTAA TAAATGGGGT TTATTATCAC
AAAATTGCCT GTGATCAAAG CGGGGGAATT GTTGAGCAAA TGAACCGGAT GTGTGATGCT
ATGTACTGCC GGTTCCTTGA AGTGAGAGAA AGCACAGGAG AGTTTGATGT ATTACATGGT
CACGACTGGC ACCCCGTAAA TGTGCTTTGC AGGATAAAAG CCCAGTTTGG ACTGCCCTTT
GTGCTGACCT TCCACAGTAC AGAATGGGGA CGTAATGGAA ATCATCATGG AGATTGGTGG
GAAGCAAAGG AAATCTCACA TAGGGAGTGG CTCGGAGGCT ATGAATCTTC GGAGATTATC
ATAACCTCGA CCATATTGAA GGAAGAAATC AAACAAATTT ACAAAATCCC TGACTACAAG
CTCTGGGAAA TTCCTAACGG CATAAACGTG GGAAAAATAA GAAGGCAAAT CGACCCTGGT
GATGTGAAAA GGCAATACGG CATCCATCCA TGTGTTCCAG TGGTGCTTTT CACAGGAAGG
ATGTCTTATC AGAAGGGGCC TGACCTGCTG GTGGAAGCTG CTGCTAAAGT CCTGAAGAAG
AGGAATGCAC AGTTTGTGCT GATCGGTGAA GGAGAAATGC GTGCTCATTG TGAATATAGG
GCTCAAAAAC TTGGCATTGG AAATTCATGC AATTTCCTCG GGTACGCTCC AGATAATACT
GTAATAGACT GGTTCAATGC CTGCGACCTC GTGTGTGTGC CCAGCCGGAA TGAACCCTTC
GGAATTGTGG TGCTTGAAGC CTGGGATGCA AAAAAACCTG TAGTTGCAAG TGATGCAGTA
GCCCTTGTGG AAAATTTCAA GACAGGCGTT ATTACTCATA AGGAACCATC TTCTATTGCA
TGGGGCCTTA ATTATGTCCT TGAAGGGCTT GGCCACAACC GGATGGGAGA AAAAGGTTAC
GACCTTATTA AAAAGCGATA TAACTGGAAA ATAATAGCTG GAAAAACTCT CGAGGTATAC
AAAAAAATAA TTTAA
 
Protein sequence
MKKMRIGMFT WESLYSIRVG GISPHVSELS EALAAEGHEI HLFTRDREDK DEVINGVYYH 
KIACDQSGGI VEQMNRMCDA MYCRFLEVRE STGEFDVLHG HDWHPVNVLC RIKAQFGLPF
VLTFHSTEWG RNGNHHGDWW EAKEISHREW LGGYESSEII ITSTILKEEI KQIYKIPDYK
LWEIPNGINV GKIRRQIDPG DVKRQYGIHP CVPVVLFTGR MSYQKGPDLL VEAAAKVLKK
RNAQFVLIGE GEMRAHCEYR AQKLGIGNSC NFLGYAPDNT VIDWFNACDL VCVPSRNEPF
GIVVLEAWDA KKPVVASDAV ALVENFKTGV ITHKEPSSIA WGLNYVLEGL GHNRMGEKGY
DLIKKRYNWK IIAGKTLEVY KKII