Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0023 |
Symbol | |
ID | 3705956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 19267 |
End bp | 20124 |
Gene Length | 858 bp |
Protein Length | 285 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637736547 |
Product | formylmethanofuran--tetrahydromethanopterin formyltransferase |
Protein accession | YP_342095 |
Protein GI | 77163570 |
COG category | [C] Energy production and conversion |
COG ID | [COG2037] Formylmethanofuran:tetrahydromethanopterin formyltransferase |
TIGRFAM ID | [TIGR03119] formylmethanofuran--tetrahydromethanopterin N-formyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.701164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCCA CTCGCATACT TATTACAGCA CAGGATCTAA AGTGGGCCTA TCATTCGGCC CAAACCATGA CCGGGTTCGC CACTTCGGTA ATTGCTTGTG GTTGCGAGGC AGGTATCGAA CGAGAATTAG ATCCTTCTGA GACACCCGAC GGACGCCCTG GAGTCGCCAT TTTACTATTC GCCATAGGGG GCAAAGGATT AGCTAAACAA CTTGAAGCCC GGGTTGGTCA GTGCGTACTC ACTTCACCCA CTTCCGCATT ATTTGCTGGA ATATATGAGG GTGAACTTAT CCCGTTAGGA AAGAATCTCC GCTTTTTTGG TGATGGCTTT CAAATATCGA AAATTATTAA TGGCCGCCGT TATTGGCGAA TTCCCGTCAT GGAGGGCGAA TTTCTTACCG AAGAGAAGGT GGGTATGATC CCAGCAATTG GCGGCGGCAA TTTTCTCGTT CTCGCCCAAT CTCAACCTCA AGCGTTAACG GCCTGCGAGG CGGCTATCGC AGCCATGAAA AAAATTCCTA ATGTTATAAT GCCTTTTCCT GGTGGCATCG TGCGCGCGGG TTCAAAGGTG GGTTCTAAAT ATCCAGGAAT AAAGGCATCC ACCAATGATG TTTTTTGCCC CACGCTTAAA GGGAAAACGC ATACTAATCT CTCGGCTGAG ATTGAATCCG TATTGGAAAT TGTTATTGAT GGTTTTTCTA AAGAAGATAT CCAAAAGGCT ATGTATGCTG GAATTTCTGC CGTCTGTAAC TTTGGAGCCC AAAACGGGAT ATACCGCATC AGTGCGGGCA ATTATGGAGG AAAACTAGGT CCGATTCATT TTCATCTCCA GGAGATCATG ATGGGACAAA TGGCTTAA
|
Protein sequence | MKATRILITA QDLKWAYHSA QTMTGFATSV IACGCEAGIE RELDPSETPD GRPGVAILLF AIGGKGLAKQ LEARVGQCVL TSPTSALFAG IYEGELIPLG KNLRFFGDGF QISKIINGRR YWRIPVMEGE FLTEEKVGMI PAIGGGNFLV LAQSQPQALT ACEAAIAAMK KIPNVIMPFP GGIVRAGSKV GSKYPGIKAS TNDVFCPTLK GKTHTNLSAE IESVLEIVID GFSKEDIQKA MYAGISAVCN FGAQNGIYRI SAGNYGGKLG PIHFHLQEIM MGQMA
|
| |