Gene Mpal_2210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2210 
Symbol 
ID7270296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2357547 
End bp2359223 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content59% 
IMG OID643570825 
Productpeptidase S41 
Protein accessionYP_002467229 
Protein GI219852797 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.185176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.201668 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATG GCCCACTGCT TATCGCCCTG CTTCTCCTTG CCGTCCCTGC TTTGGCAGAG 
GTGATACCTG CTCCCCAGCC CACAGGATCG ATCGTATACG CGGACAATGG GACGATCTGG
AACATCTCCT CCGCCCCGTT CGAATGGAAC AATGAGACTG CGTACAACGA AGATCAGTTC
CAGATATTTG CACCCATCAT GGCGAACCTG ACCTATCCGG CGAACCTGAC CAATATGTCA
TGGAGTGAGG CCTTTAGCGC CGCCAATTCC CTCATGGAGG AGCGGTACGC CTTCTCCGGC
TTGAAGGCGG TGGACTGGGA TGCCCTCTAC CAGACCTATG CGCCAGCTGT CGCTGCGGCA
GAGAAGGATC AGGACAAAGC CGCGTACTTC CGGGCCCTGC GGGGGTACAT CTATGCCATC
CCGGACGGGC ATGTGTCGCT CCTCCCGGCT GAAGGCGATT TCGGGGCGAA GTACGCAGAC
ATTGGCGGTG ATTACGGGGT CGCAGTCACC CGGCTCGATT CCGGGACCGT GATCGTGAGT
TTCGTTGCGA ATAGGAGCCA GGCCGAGCAG GCAGGGATCC GGTTCGGCGA TGTTGTCACC
GCATGGAACG GAAGGGAGAT TTTGGATGCT ATCAACACGA CGTCGTACAT CTGGGCGGTC
AAGAAACCCT CGACGGCCGA GGGGATTCGG CTCCACCAAC AACGGCTTCT GACCCGGGGA
CCGATCGGGT CGACAGCTAC GATCACGATC AGTAACAGCA CGGCGTCTCT CCCCCGTACT
GTCACGCTGA CCGCGTATGA CGACGGGTAT GCAAACCTGA AGCAGACGTC CTGGTTCCTC
GGGATTCCGG TCAACGATTA TGGTGTGGAT CGATCCTGGG AGGATATCCT CCCCCAGATC
AGCAACGAGA CCGTAACGGT CCGCACGCTG CCCGGTAATT ATACCTACAT CGCGGTCTAC
AATGAGGAGT ACGATGTCTA CCAGCCGTTC AAGGCGGCCA TGCAGAATGC GATCGCGAAC
AACACTCCGG GTATCGTGTT CGACCTCAGG TTCAACAGTG GTGGCGATGA TAGTCTCGCT
GCCTGTTTTG CCAGTTGGTT CGTCAAAAAG CCAGCATTCT ACGAGTATGC GACGAAGTAC
GACCCCGGCA ATGGTAGGTT TACCACCCTC TCGGAGGCGT GGTCGATACC GAGAGCAGAC
GGCTACAGCG GGCCGGTCGC GGTCCTTGTG AGTCCGGATA CCATCAGTTC GGGAGAGGGG
GTTCCCATGA TCCTGAATCG GACCGGACGG GGTGAGGTCA TCTCGTTCTA TGGGACGAAC
GGGGCGTTTG GGATGAACAA TGTACAGGCG TCTTTACCGC TGGGCCTGTC ACTCTACTTC
CCTGACGGTG CTTCTCTCGA CCTGAATGGT ACGATCCAGG ACGACAGCAA CGCCGGGCTG
ACGGGGGGAG TCTCGCCAGG AATCAGGGTG CCAATCAACG AGGATACTGT GGCCCGGTCG
ATGGCGGGGG AGGATGTCCA GCTCACATAC GCCCTTCAGT GGCTGAAGGG GCAGGGGAAC
CAGACCTCTG CCTCAAAACC GTCTTTATCG TCAATCCCGG CGCGAAAGAC GTCGCTCGAT
TTCACCGCCG CGCTTGGTGC GCTGGGCATT CTCGTCATGG TTGCCGGACG GAAGTGA
 
Protein sequence
MKYGPLLIAL LLLAVPALAE VIPAPQPTGS IVYADNGTIW NISSAPFEWN NETAYNEDQF 
QIFAPIMANL TYPANLTNMS WSEAFSAANS LMEERYAFSG LKAVDWDALY QTYAPAVAAA
EKDQDKAAYF RALRGYIYAI PDGHVSLLPA EGDFGAKYAD IGGDYGVAVT RLDSGTVIVS
FVANRSQAEQ AGIRFGDVVT AWNGREILDA INTTSYIWAV KKPSTAEGIR LHQQRLLTRG
PIGSTATITI SNSTASLPRT VTLTAYDDGY ANLKQTSWFL GIPVNDYGVD RSWEDILPQI
SNETVTVRTL PGNYTYIAVY NEEYDVYQPF KAAMQNAIAN NTPGIVFDLR FNSGGDDSLA
ACFASWFVKK PAFYEYATKY DPGNGRFTTL SEAWSIPRAD GYSGPVAVLV SPDTISSGEG
VPMILNRTGR GEVISFYGTN GAFGMNNVQA SLPLGLSLYF PDGASLDLNG TIQDDSNAGL
TGGVSPGIRV PINEDTVARS MAGEDVQLTY ALQWLKGQGN QTSASKPSLS SIPARKTSLD
FTAALGALGI LVMVAGRK