Gene MmarC5_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC5_1071 
Symbol 
ID4929072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C5 
KingdomArchaea 
Replicon accessionNC_009135 
Strand
Start bp1027103 
End bp1028173 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content32% 
IMG OID640166566 
Producthypothetical protein 
Protein accessionYP_001097589 
Protein GI134046103 
COG category[L] Replication, recombination and repair
[S] Function unknown 
COG ID[COG0177] Predicted EndoIII-related endonuclease
[COG1833] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01083] endonuclease III 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.661198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAATC TTGATACCCC ATTTATTAAA TTTTTAGATA TTTTAGGCGA AAATTTAAAA 
AAAGATGCAG TAGTTGACAA AATATCTAAA AATTCAGACG AAAATGAACG GGCTTTTAAA
ATATTAGTTT CTACAGTGAT AAGCGCACGA ACCAAAGATG AAACTACCGC AAAAGTATCA
AAAGAGCTAT TCAAAAAAGT AAAAACTCCA AAAGAGCTTT CAGAAATTTC TTTAGATAAC
CTTGAAAAGT TAGTTCACCC TGCAGGATTT TACAAAACTA AAGCTAAAAA TCTAAAAAAA
TTAGGCAAAA TTTTACTTGA AGAGTACGAT TCAAAAATTC CAAATTCAAT TGAAGAACTT
ATAACACTTC CAGGGGTTGG ACGAAAAACT GCAAACTTAG TAATGACTCT TGCATTTGAT
GAATACGCAA TCTGTGTTGA CACACACGTT CACAGAATTA CAAATCGTTG GAATTATGTT
GATACTGAGT TTCCTGAAAA CACAGAAATG GAACTTCGAA AAAAACTTCC GAAAGATTAC
TGGAAAAGAA TTAACAATCT ACTTGTTGTA TTTGGCCAAG AAATATGCAG CCCGATTCCA
AAATGCGATA AGTGTTTTTC TGAAATTCGA AAAATCTGCC CCCACTACAA TTCATTAAAA
GAACTCGAAA AAATTTATAA AGAGTTCAAC TTTAAAAAGA CCCCAAAAAC TAAAATCCCC
AAAGACAAAG GCACTTACGT CTTAAGAATA AAGATGAACG CTCCAAGAAC CATTCTCGTT
GGAAAAAGAG AGATTAAATT TAAAAAAGGA GATTATTTTT ACATCGGTTC TGCAATGGGT
AACAGCATGA ATTTATACAA TAGAATAAGC AGGCATCTGT CTGAAAATAA GAAAAAAAGA
TGGCATATTG ATTATTTACT GGAATTTTCA AATGTAAAAG AAGTAAACGT AACAATTGGA
CGATTTGAAT GTGATGTTTC GCAGAAGTTT AATATAGTGT TCGATTCTGT GGAATCTTTC
GGATGTTCGG ACTGCAAGTG TAAAAGTCAT CTCTATTACA TTAAACCATG A
 
Protein sequence
MNNLDTPFIK FLDILGENLK KDAVVDKISK NSDENERAFK ILVSTVISAR TKDETTAKVS 
KELFKKVKTP KELSEISLDN LEKLVHPAGF YKTKAKNLKK LGKILLEEYD SKIPNSIEEL
ITLPGVGRKT ANLVMTLAFD EYAICVDTHV HRITNRWNYV DTEFPENTEM ELRKKLPKDY
WKRINNLLVV FGQEICSPIP KCDKCFSEIR KICPHYNSLK ELEKIYKEFN FKKTPKTKIP
KDKGTYVLRI KMNAPRTILV GKREIKFKKG DYFYIGSAMG NSMNLYNRIS RHLSENKKKR
WHIDYLLEFS NVKEVNVTIG RFECDVSQKF NIVFDSVESF GCSDCKCKSH LYYIKP