Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0804 |
Symbol | |
ID | 3707070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 876266 |
End bp | 877225 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637737306 |
Product | thiamine-monophosphate kinase |
Protein accession | YP_342847 |
Protein GI | 77164322 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.914964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATGAGT TCTCCCTGAT TGAAAATTTT TTTGCGGATT GTACTCAGAA ACGGGAAGAC GTTGCGCTGG CGGTGGGTGA CGATTGCGCC TTGATGACTG TCCCTCCAGG TTGTGAATTG GCGGTTTCTA TTGATACGTT AGTAGCCGGG GTGCACTTTA CTGCCGAGGT GGATTCCGCC GCTTTGGGGC ACAAAGCGCT GACGGTAGGA TTGAGCGATC TTGCTGCTAT GGGGGCAGAA CCGGCCTGGG CGACTTTGGC GTTGACTCTG CCAGAGCTCG ACAGAGCTTG GCTGGCTGGG TTTACTCAAG GGTTAAGCAA GCTTGCCAGA AGCTACGGTG TGCAATTGGT AGGGGGAGAT ACCACTCGGG GGCCGCTGGC GGTCACTATG CAGTTGCATG GTTTCGTGCC TCGGGGTAAA GCCCTGAGGC GTGATGGAGC ACGTCCCGGT GATGGAATTT ACGTAACGGG AACTTTGGGT GATTCTGGCC TTGCCCTTCA AGCGCGATTG GAAGGTCTCC AGTTATCCCA GGAGGCTTTA TGCTATGTTG AGCATCGCCT GGATTGGCCA CAGCCTCGGG TACATGAAGC CTTGGCGCTT CGTCCTCTCG CCCATGCTGC TATCGATATC TCAGATGGTC TCGCAGCAGA TTTGGGACAT ATCCTGAAAG GCAGCGGTGT TGGTGCGGCG GTTGAAGTAG AGGCTTTGCC GCTTTCAGAT TCCTTTCGTG CTTCTCTTGA GTTGGAGCAA GCCTGGGCCT TGGCGCTAAC CGCAGGCGAT GACTACGAAT TGTGTGTGAC CGCGCCTGCC GAATACCATG ACCGGATACA GGCGGTGCTC TCGGATCGGG GTTGTCCCTG CACCTTGATT GGAACGATTG AAGAGGAGCC AGGCTTCCGT TGCCGCCGCC GGAATGGAGC TTCATTTATT CCCCAACAGC AGGGTTACCG TCATTTTTAG
|
Protein sequence | MNEFSLIENF FADCTQKRED VALAVGDDCA LMTVPPGCEL AVSIDTLVAG VHFTAEVDSA ALGHKALTVG LSDLAAMGAE PAWATLALTL PELDRAWLAG FTQGLSKLAR SYGVQLVGGD TTRGPLAVTM QLHGFVPRGK ALRRDGARPG DGIYVTGTLG DSGLALQARL EGLQLSQEAL CYVEHRLDWP QPRVHEALAL RPLAHAAIDI SDGLAADLGH ILKGSGVGAA VEVEALPLSD SFRASLELEQ AWALALTAGD DYELCVTAPA EYHDRIQAVL SDRGCPCTLI GTIEEEPGFR CRRRNGASFI PQQQGYRHF
|
| |