Gene Mbar_A2754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A2754 
Symbol 
ID3624998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3501288 
End bp3502313 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content39% 
IMG OID637701607 
Productcell surface glycoprotein (S-layer protein) 
Protein accessionYP_306237 
Protein GI73670222 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3420] Nitrous oxidase accessory protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00332846 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00111806 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGAA AATCAATCCA AAACATCTGC ATAATCTTTC TTCTTGTGCT GGGTGCAACA 
GCTACTCAGG TAAGTGCTGC GGTTCTTACC GTAGGCGTTA AAGGGGGAGA AAATTATACT
TCGATTCAAG AAGCTGTCAA TAATTCACAG AACGGGGATA CAATTGTTGT AAGCCCTGGT
ATATACATAG AAAATGTAAA CGTGAATAAA GAAATTGCAA TTATCTCGAA AACTGATGTT
TCGGGCGACA GGCTGAACCG TACCTATGTA ATAGGCGCAG TTCCGACAAA TGACGTCTTC
AGTATTAACT CAAATAACGT GAAAATAACC GGTTTCCATA TTATGGGAGG TCCTTCAGGA
ATTAATGCTT ATCAGGAAGT CGGGCTTTAC CTTGAAGGTG TGCAAAACTG CTCCTTAAGT
AACAATACCC TGATGTTGAA TGATGTTGGC ATTGGCCTCA ACAATTCTCA AGGCAATTTC
CTTGACAACA ATCGGATAGG TCTTGGATCT ACAGGAATCA TTCTTTCTAG ATCAAACGAA
AACAAGTTGT CAAACAACCT GGTAGAGACA AATGACGAGG GAATCCTTCT GGATAATTCT
ACTAACAATA CTCTCATGAA TAACACTGCA GAATCAAATG ATATAGGAGT CCTCCTTGCT
ACTTCAAAAA CTAACACGCT TGGATACAAC TCCATTTCAA GAAACAGTTA TGGAATAGTC
CTTGAAGATA TGGCAGAATC TAATACTTTG ACTAATAACA GCCTGTACAT GAATGGTCTT
GGAATGTACC TTAGAGGGTC CACCGGAAAT ATGATTTCTC TCAATAAATT CTTCAACTTC
ATCAATGCCG TAGATGAAGG AACAAATTCC TGGAACAGCA GTTCAGCAGG CAATAAGTGG
AAAGATTATA ATGGAACAGA TGCTGACGGA AACGGTATAG GAGACACTCC TTATGTTGTT
AACCAGACAA CCGGAAGCAT AGATTACATG CCTCTGGCAA ATAACGTTTC TTCAGGTAAT
CAATGA
 
Protein sequence
MKRKSIQNIC IIFLLVLGAT ATQVSAAVLT VGVKGGENYT SIQEAVNNSQ NGDTIVVSPG 
IYIENVNVNK EIAIISKTDV SGDRLNRTYV IGAVPTNDVF SINSNNVKIT GFHIMGGPSG
INAYQEVGLY LEGVQNCSLS NNTLMLNDVG IGLNNSQGNF LDNNRIGLGS TGIILSRSNE
NKLSNNLVET NDEGILLDNS TNNTLMNNTA ESNDIGVLLA TSKTNTLGYN SISRNSYGIV
LEDMAESNTL TNNSLYMNGL GMYLRGSTGN MISLNKFFNF INAVDEGTNS WNSSSAGNKW
KDYNGTDADG NGIGDTPYVV NQTTGSIDYM PLANNVSSGN Q