Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MmarC7_1555 |
Symbol | |
ID | 5329311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcus maripaludis C7 |
Kingdom | Archaea |
Replicon accession | NC_009637 |
Strand | - |
Start bp | 1517387 |
End bp | 1518457 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640794109 |
Product | DNA-(apurinic or apyrimidinic site) lyase |
Protein accession | YP_001330764 |
Protein GI | 150403470 |
COG category | [L] Replication, recombination and repair [S] Function unknown |
COG ID | [COG0177] Predicted EndoIII-related endonuclease [COG1833] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01083] endonuclease III |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.272014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAATA CTGATATCCC ATTTATTAAA TTTTTAGACG TCTTGGACGA AAATTTAAAA AAAGATGCCG TAGTTGACAA AATATCTAAA AATTCGAATG AAAATGAACG GGCTTTTAAA ATATTAGTTT CTACTGTGAT AAGTGCGCGA ACTAAAGATG AAACTACTGC AAAAGTATCA AAAGCGCTAT TTAAAAAAGT AAAAAGTCCA AAAGACCTTT CTGACATTTC TTTAGAAGAA CTTGAAAAAT TAGTTCATCC TGCAGGATTT TACAAAACTA AGGCTAAAAA TTTAAAAAAA TTAGGTAAAA TATTACTTGA AGAGTATGAT TCAAAAATTC CAAACTCGAT TGAAGAACTT GTAACTCTTC CCGGAGTAGG GCGAAAAACT GCAAACTTAG TAATGACCCT TGCATTTGAT GATTACGCAA TCTGTGTTGA TACACACGTT CACAGAATTA CAAATCGCTG GAATTATGTT AATACCGAGT TTCCTGAAGA CACAGAAATG GAACTTAGAA AAAAACTTCC GAAAAATTAC TGGAAAAGAA TTAACAATCT GCTTGTTGTA TTTGGGCAAG AAATATGCAG CCCGATTCCA AAATGCGATA AGTGTTTTTC CGAAATTCGA GAAATCTGTC CGCACTACAA TTCATTAAAA GAACTCGAAA AAATTTATAA AGATTTTAAC TTTAAAAAGA CTCCAAAAAC CAAAATTCCA AAAGATAAAG GTACTTACGT CTTAAGAATA AAAATGAACG CTCCAAGAAC CATTCTCGTT GGAAAAAGGG AAATTAAATT TAAAAAAGGA GATTACTTCT ACATTGGTTC TGCAATGGGG GACAGCATGA ACCTTTACAA CAGGATAAAC AGACATCTGT CTGAAAATAA GAAAAAAAGA TGGCATATTG ATTATTTACT TGAATTTTCA AATGTAAAAG AAGTAAACGT AACTCTTGGA CGATTCGAAT GTGATGTTTC ACAAAGATTT AATTTAGTTT TCGATTCAGT AGAATCTTTC GGATGCTCGG ACTGCAAGTG TAAAAGTCAC TTATATTACA TTAAACCCTG A
|
Protein sequence | MNNTDIPFIK FLDVLDENLK KDAVVDKISK NSNENERAFK ILVSTVISAR TKDETTAKVS KALFKKVKSP KDLSDISLEE LEKLVHPAGF YKTKAKNLKK LGKILLEEYD SKIPNSIEEL VTLPGVGRKT ANLVMTLAFD DYAICVDTHV HRITNRWNYV NTEFPEDTEM ELRKKLPKNY WKRINNLLVV FGQEICSPIP KCDKCFSEIR EICPHYNSLK ELEKIYKDFN FKKTPKTKIP KDKGTYVLRI KMNAPRTILV GKREIKFKKG DYFYIGSAMG DSMNLYNRIN RHLSENKKKR WHIDYLLEFS NVKEVNVTLG RFECDVSQRF NLVFDSVESF GCSDCKCKSH LYYIKP
|
| |