Gene Mpal_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0221 
Symbol 
ID7270606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp254320 
End bp255633 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content59% 
IMG OID643568873 
ProductS-layer-like domain-containing protein 
Protein accessionYP_002465330 
Protein GI219850898 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1361] S-layer domain 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCGA ATAGATATTC ATCAGGTACA GTCTGGGTTC TCTCCCTGCT GCTGCTCATC 
GCCTGCGTGC TGGTCTCTCC AGCCATGGCC GGTACCAAGT ATCTCTCCGG CGGCCCGTCG
CTCTCAGCGG CGGTCACCGG CACCAACGAA CTGATCTCCG GCCAGACCGT GCCATTGCAG
GTGACGGTCC AGAACAGTGG TCTGATCGAC TCCAAGTTCT CCCAGACCGG ACTGGTCGAC
CGGACCGATC TACCGAACAC CGCCAAGACG GTGACGGTCG GGCTTGGTTC CGGTAGCGCA
CCGGTCACGA TCCAGTCGGA TCCGCAGATG ATCGGGGATA TCCTCGGAGG TGCTTCCGGG
CAGTCCAAGT TCAATGTTAA GGTCGAAGCC GATGCTCCAT CAGGCACCTA CACCCTGCCG
GTCTCAGTGA ATTACACCTA TCTCGAGTCT GCAGAACAGG TCGGGACCGA TTCGCTGAAC
TATAACTATG TGACCAAGAG CCTGATCATC CCCCTGACCG TCACGATCAG GTCCGAAGTG
ATCGTCGACG TCCAGAAGAT CTCGGCAGAG CAGTTGAACG TCGGCACTGA GGGATATCTG
AACCTGACCC TGCAGAACAC CGGGAACGAG AATGGTAAGA ATGCCATCGT GAAGATCGTC
AGAAACGGTG CCAGCCCGAT CACCCCGACC GACTCCTCGG TCTACATCGG TGACTTTGCA
AAGGGCGCCG TCGTGAACTG CAGGTACCGG GTCGCGGTCT CCACCGAGGC AGCCGCCCAG
ACCTACCCGG TCGACGTCAT CGTCGCCTAT GAGGACCATG ACGGGATTAA CAGGACCTCC
CGGCTCCAGA CGATCGGCGT CCCGATCGGC GGCAAGATAG ACTTCAAGGT CAGCTCTGAG
GCACCATCGA TCAACCCCGG CCAGAAGAAG GTGCTCGATG TCCAGTACAC CAACGTCGGT
GCGACCACCG TCTACAGCGC CCAAGCCCGA CTCTCAGCGG TGGACCCGTT CACCTCCAAC
GATGACACGG CCTACCTTGG GGATATAAAG CCCGGCGACT CGGTGATGGC ACACTTCGAG
GTATCGACCA CATCAGACGC GACCATCAAG CAGTACGGCC TCGACTCTGA GATACGGTAC
CGCGATGCAC TCGACAACTC CCAGATCTCG GATACCATGA AGGTCCCGGT GAACGTCGTG
GCCAAGACAG GGACCAGTGC AATCCTCGGC AACCCGATTA TCCTTGCCGT GATCGCGGCC
ATCATAATCG GTGTCGGCTA CTTCCTCTAC ACCAGGAAGA AGGGGTCAGC GTGA
 
Protein sequence
MSANRYSSGT VWVLSLLLLI ACVLVSPAMA GTKYLSGGPS LSAAVTGTNE LISGQTVPLQ 
VTVQNSGLID SKFSQTGLVD RTDLPNTAKT VTVGLGSGSA PVTIQSDPQM IGDILGGASG
QSKFNVKVEA DAPSGTYTLP VSVNYTYLES AEQVGTDSLN YNYVTKSLII PLTVTIRSEV
IVDVQKISAE QLNVGTEGYL NLTLQNTGNE NGKNAIVKIV RNGASPITPT DSSVYIGDFA
KGAVVNCRYR VAVSTEAAAQ TYPVDVIVAY EDHDGINRTS RLQTIGVPIG GKIDFKVSSE
APSINPGQKK VLDVQYTNVG ATTVYSAQAR LSAVDPFTSN DDTAYLGDIK PGDSVMAHFE
VSTTSDATIK QYGLDSEIRY RDALDNSQIS DTMKVPVNVV AKTGTSAILG NPIILAVIAA
IIIGVGYFLY TRKKGSA