Gene Mpal_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1201 
Symbol 
ID7271479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1234488 
End bp1235786 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content58% 
IMG OID643569838 
ProductS-layer-like domain-containing protein 
Protein accessionYP_002466262 
Protein GI219851830 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1361] S-layer domain 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGTTTC GTAAGTTTTT CATTATTCCA CTCCTAATCG TACTCATCGC CTGCGTGCTG 
GTCTCTCCAG CCATGGCCGG TACCAAGTAT CTCTCCGGCG GCCCGTCGCT CTCAGCGGCG
GTCACCGGCA CCAACGAACT GATCTCCGGC CAGACCGTGC CATTGCAGGT GACGGTCCAG
AACAGTGGTC TGATCGACTC CAAGTTCTCC CAGACCGGAC TGGTCGACCG GACCGATCTA
CCGAACACCG CCAAGACGGT GACAGTCGGG CTTGGTTCCG GTAGCGCACC GGTCACGATC
CAGTCGGATC CGCAGATGAT CGGGGATATC CTCGGAGGTG CTTCCGGGCA GTCCAAGTTC
AATGTTAAGG TCGAAGCCGA TGCTCCATCA GGCACCTACA CCCTGCCGGT CTCAGTGAAT
TACACCTATC TCGAGTCTGC AGAACAGGTC GGGACCGATT CGCTGAACTA TAACTATGTG
ACCAAGAGCC TGATCATCCC CCTGACCGTC ACGATCAGGT CCGAAGTGAT CGTCGACGTC
CAGAAGATCT CGGCAGAGCA GTTGAACGTC GGCACTGAGG GATATCTGAA CCTGACCCTG
CAGAACACCG GGAACGAGAA TGGTAAGAAT GCCATCGTGA AGATCGTCAG AAACGGTGCC
AGCCCGATCA CCCCGACCGA CTCCTCGGTC TACATCGGTG ACTTTGCAAA GGGCGCCGTC
GTGAACTGCA GGTACCGGGT CGCGGTCTCC ACCGAGGCAG CCGCCCAGAC CTACCCGGTC
GACGTCATCG TCGCCTATGA GGACCATGAC GGGATTAACA GGACCTCCCG GCTCCAGACG
ATCGGCGTCC CGATCGGCGG CAAGATAGAC TTCAAGGTCA GCTCTGAGGC ACCATCGATC
AACCCCGGCC AGAAGAAGGT GCTCGATGTC CAGTACACCA ACGTCGGTGC GACCACCGTC
TACAGCGCCC AGGCCCGACT CTCAGCGGTG GACCCGTTCA CCTCCAACGA TGACACGGCC
TACCTTGGGG ATATAAAGCC CGGCGACTCG GTGATGGCAC ACTTCGAGGT ATCGACCACA
TCAGACGCGA CCATCAAGCA GTACGGCCTC GACTCTGAGA TACGGTACCG CGATGCACTC
GACAACTCCC AGATCTCGGA TACCATGAAG GTCCCGGTGA ACGTCGTGGC CAAGACAGGG
ACCAGTGCAA TCCTCGGCAA CCCGATTATC CTTGCCGTGA TCGCGGCCAT CATAATCGGT
GTCGGCTACT TCCTCTACAC CAGGAAGAAA CAGAACTAA
 
Protein sequence
MKFRKFFIIP LLIVLIACVL VSPAMAGTKY LSGGPSLSAA VTGTNELISG QTVPLQVTVQ 
NSGLIDSKFS QTGLVDRTDL PNTAKTVTVG LGSGSAPVTI QSDPQMIGDI LGGASGQSKF
NVKVEADAPS GTYTLPVSVN YTYLESAEQV GTDSLNYNYV TKSLIIPLTV TIRSEVIVDV
QKISAEQLNV GTEGYLNLTL QNTGNENGKN AIVKIVRNGA SPITPTDSSV YIGDFAKGAV
VNCRYRVAVS TEAAAQTYPV DVIVAYEDHD GINRTSRLQT IGVPIGGKID FKVSSEAPSI
NPGQKKVLDV QYTNVGATTV YSAQARLSAV DPFTSNDDTA YLGDIKPGDS VMAHFEVSTT
SDATIKQYGL DSEIRYRDAL DNSQISDTMK VPVNVVAKTG TSAILGNPII LAVIAAIIIG
VGYFLYTRKK QN