Gene MmarC5_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC5_1038 
Symbol 
ID4928665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C5 
KingdomArchaea 
Replicon accessionNC_009135 
Strand
Start bp999866 
End bp1001473 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content34% 
IMG OID640166535 
ProductS-layer protein 
Protein accessionYP_001097558 
Protein GI134046072 
COG category 
COG ID 
TIGRFAM ID[TIGR01564] S-layer protein, MJ0822 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAGAA TGGGTGAAAT TTTGAAATCG ATTCTAAAAA AAATAGGATG CATTCTATTT 
GGAAGCACAC TATTGTCTGC GTCTGTCACA GCAGTTTACG CTCTCGAAAA ATACGGGAAT
GTCAATGACT TTTTAGATGA TGATTTAGTA AAGAACGGGA ATCCTGATGT TTATATCGTA
GTTGGAGAAA ACTCCGCTAC ACCAGATGTA ACATCTGCAA ACAAAATATC TGCAAAAATC
GGAACTCTTA CATACACTGA AAAGGTTGTT GAAGACACAT CTACAACAGG ATATCAAATT
ATTGGAGAAT CTGAAAGTAT TAATTTACTG GATGGAACAG ATAAATTAGA TAGTGCAGGT
ACTAAAAAAT CGTGGATTTT AGTTATTGGA GCAGATGACA GTTATTCTGA CTATTTCGAC
GATGATGAAG GAAATTCATT TTCATATTCT GAATTTTCGG AAGCAGATAA AACGAGATCT
CTTGGAGAAT TAGGATATTT ACTTAAGTTA TATGATATAG ATCCAAAAAA CCATTTTGAA
TCGGATGATG ATGCCTCAGA ACTGGTATTT GTAAGAATTA CTGATTCAGA CAAAAATGCA
AATTCTCAAA CATACAATAT CGAAAAAGAT ATGGTTTATG TATCAATTGT CTATCCTAAT
CAAATATCTG CCTTTAAACT AGCAAAAGAA TTAGAAGAAG GTTATGAAAT TCCGTTTTTG
GGCGATAAAT ACCGTATTGT AAAAATTGAT GAAGATGATG ACATAATATA TCTTGGAACA
GCTTCTTATG AAGGAACCAT TGCACATAAC GAATACATAA GTTCAGGCGA ATATCAGGTA
GTTTTGGGAG AAATTTTAGA AAATGAGGAT GAATACCGTG TTGAAATATC AGTATTAAGA
AATGGAAACA TGATTGAAGA ACATACTGAA ACATTAAGTT CTGATAAGTC ATTCTCATTC
ATTGCAGGAA AGATTGGAGT TACAGTTCAT GATGTATGGC TAAATACGGC TGCAGATACT
GGCTATGCAG ATATTACAAT ATGTAAATCA ATTACTGAAT TAGAGTTGGG GGAAGAATAC
ATTGAAAACT GGGAAATAAG GGCAGTCGTC AATAACGATG GAAATATCGA CTTTTTAAAA
ACTTATGAAG ATAAAACTGT AGGAATTGCA CTTGTATATA ATGGAAATGA TATTGAAAGA
ATAAAAGATG GTGACAGAAT CGAGATTGCA GATTATGTAA ATTTAGTATT TGATGATGAA
GATGATCTTG ATAAAATGAT TGCTGAATTT ACGGCTGAAA AAACAGTAAC TACTGGTTCA
ACCGGAGGTA CATTGGTTAC AACGTCAGGA AAAGTTCCAG AAGTTGTCCT TGATAGTGAA
ATCGAACTTG ATGAAACGGA CAAAAATTTA ATCCTTATAG GTGGCCCTGT TGCAAACCAT
TTAACAAAAG AGTTGCAGAA TAAAGGAAAA ATTGATATCG ACAATGAAAG TCCTGCGACA
GCAGTTTTAG TTGAAGGTGC AGCAAATGGA AATAACGTCC TTGTAATTGC AGGAGGGGAT
AGATACAGTA CGGAAAGTGC AGTATTGTCC ATCATGAATC TCATTTAA
 
Protein sequence
MRRMGEILKS ILKKIGCILF GSTLLSASVT AVYALEKYGN VNDFLDDDLV KNGNPDVYIV 
VGENSATPDV TSANKISAKI GTLTYTEKVV EDTSTTGYQI IGESESINLL DGTDKLDSAG
TKKSWILVIG ADDSYSDYFD DDEGNSFSYS EFSEADKTRS LGELGYLLKL YDIDPKNHFE
SDDDASELVF VRITDSDKNA NSQTYNIEKD MVYVSIVYPN QISAFKLAKE LEEGYEIPFL
GDKYRIVKID EDDDIIYLGT ASYEGTIAHN EYISSGEYQV VLGEILENED EYRVEISVLR
NGNMIEEHTE TLSSDKSFSF IAGKIGVTVH DVWLNTAADT GYADITICKS ITELELGEEY
IENWEIRAVV NNDGNIDFLK TYEDKTVGIA LVYNGNDIER IKDGDRIEIA DYVNLVFDDE
DDLDKMIAEF TAEKTVTTGS TGGTLVTTSG KVPEVVLDSE IELDETDKNL ILIGGPVANH
LTKELQNKGK IDIDNESPAT AVLVEGAANG NNVLVIAGGD RYSTESAVLS IMNLI