Gene MmarC7_1555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC7_1555 
Symbol 
ID5329311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C7 
KingdomArchaea 
Replicon accessionNC_009637 
Strand
Start bp1517387 
End bp1518457 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content32% 
IMG OID640794109 
ProductDNA-(apurinic or apyrimidinic site) lyase 
Protein accessionYP_001330764 
Protein GI150403470 
COG category[L] Replication, recombination and repair
[S] Function unknown 
COG ID[COG0177] Predicted EndoIII-related endonuclease
[COG1833] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01083] endonuclease III 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.272014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATA CTGATATCCC ATTTATTAAA TTTTTAGACG TCTTGGACGA AAATTTAAAA 
AAAGATGCCG TAGTTGACAA AATATCTAAA AATTCGAATG AAAATGAACG GGCTTTTAAA
ATATTAGTTT CTACTGTGAT AAGTGCGCGA ACTAAAGATG AAACTACTGC AAAAGTATCA
AAAGCGCTAT TTAAAAAAGT AAAAAGTCCA AAAGACCTTT CTGACATTTC TTTAGAAGAA
CTTGAAAAAT TAGTTCATCC TGCAGGATTT TACAAAACTA AGGCTAAAAA TTTAAAAAAA
TTAGGTAAAA TATTACTTGA AGAGTATGAT TCAAAAATTC CAAACTCGAT TGAAGAACTT
GTAACTCTTC CCGGAGTAGG GCGAAAAACT GCAAACTTAG TAATGACCCT TGCATTTGAT
GATTACGCAA TCTGTGTTGA TACACACGTT CACAGAATTA CAAATCGCTG GAATTATGTT
AATACCGAGT TTCCTGAAGA CACAGAAATG GAACTTAGAA AAAAACTTCC GAAAAATTAC
TGGAAAAGAA TTAACAATCT GCTTGTTGTA TTTGGGCAAG AAATATGCAG CCCGATTCCA
AAATGCGATA AGTGTTTTTC CGAAATTCGA GAAATCTGTC CGCACTACAA TTCATTAAAA
GAACTCGAAA AAATTTATAA AGATTTTAAC TTTAAAAAGA CTCCAAAAAC CAAAATTCCA
AAAGATAAAG GTACTTACGT CTTAAGAATA AAAATGAACG CTCCAAGAAC CATTCTCGTT
GGAAAAAGGG AAATTAAATT TAAAAAAGGA GATTACTTCT ACATTGGTTC TGCAATGGGG
GACAGCATGA ACCTTTACAA CAGGATAAAC AGACATCTGT CTGAAAATAA GAAAAAAAGA
TGGCATATTG ATTATTTACT TGAATTTTCA AATGTAAAAG AAGTAAACGT AACTCTTGGA
CGATTCGAAT GTGATGTTTC ACAAAGATTT AATTTAGTTT TCGATTCAGT AGAATCTTTC
GGATGCTCGG ACTGCAAGTG TAAAAGTCAC TTATATTACA TTAAACCCTG A
 
Protein sequence
MNNTDIPFIK FLDVLDENLK KDAVVDKISK NSNENERAFK ILVSTVISAR TKDETTAKVS 
KALFKKVKSP KDLSDISLEE LEKLVHPAGF YKTKAKNLKK LGKILLEEYD SKIPNSIEEL
VTLPGVGRKT ANLVMTLAFD DYAICVDTHV HRITNRWNYV NTEFPEDTEM ELRKKLPKNY
WKRINNLLVV FGQEICSPIP KCDKCFSEIR EICPHYNSLK ELEKIYKDFN FKKTPKTKIP
KDKGTYVLRI KMNAPRTILV GKREIKFKKG DYFYIGSAMG DSMNLYNRIN RHLSENKKKR
WHIDYLLEFS NVKEVNVTLG RFECDVSQRF NLVFDSVESF GCSDCKCKSH LYYIKP