Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0339 |
Symbol | |
ID | 3706510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 369158 |
End bp | 370654 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637736851 |
Product | hypothetical protein |
Protein accession | YP_342395 |
Protein GI | 77163870 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00480998 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTATTC TCCCCCTTAA TTTATACACG GCTGCCCAAG TACGTGAATT GGACAGGTGC ACCATTGAAG AGTTTGGAAT ATCTGGCGCC ATGTTAATGG AGCGAGCCGG TAAGGCTTCC CTAGAGCAAC TACGCAAACA CTGGCCACAG GCACAACGCT TAGTGATTAT CTGTGGAGTC GGCAATAATG GTGGAGATGG CTATGTCCTG GCCCGCCTAG CCCGTAAAGC CGAAATGTAT GTATCCATCT ATCAACTAGG GGATAATAGC AAGCTTAGCA CCGATGCCCA AGCCGCGCGA CAAATGCTAT TAGACAGCGG CATGGAAATC CTTCCCTTCC AACCCCAAGT GCTCCGCGCC GCGGATGTGG TAATAGATGC TATTTTCGGC ACCGGACTCA GCCGGGGAGT CACTGGCCAG TGGGCCGATG CTATCGAAGC CATCAATACC TGCGGACAGC CGGTGTTTGC TATGGATATC CCCTCTGGGC TCCATGCCGA TACCGGGAAT ATCCTGGGTA TCGCCATCAA AGCCCAAGTG ACTGCCACCT TTGGGGGCCT GAAACAAGGG ATGTTCACCC ATCTGGGTCC GGATTACTGT GGCGAGATTG CTTTCGACTC CCTTAGTATT CCTCCCGAAG CCTATCACGG GGTGACGCCT TCGGCCCGCC GCATTACCCT TGAAGATCAC ATCACCAAAC TCCCCTCCCG CGCTAAGGCT GGCCATAAGG GCGATTACGG CCATGTTGTC ATCATTGGAG GCGAGAGAGG AATGCCCGGA GCCGCACGCA TGGCGGGGGA AGCAGCTTAT CGGGTAGGAG CGGGGCTGGT CAGTATTGCC ACCCGGGAGA AGCATGCTTC CCTGCTCAAC CTAGCCCGCC CAGAATTGAT GTGCTACGGG GTAGAGAGCG CCGAAGAGTT AAAACCATTG CTGAACCGGG CTACTACCCT TGTCATTGGT CCAGGCCTAG GCCAGGATCT TTGGGGCCAA ACAATGCTCG CTGAGGCACT CAACCATTCC CACCCTCTGG TGGTGGATGC GGATGCCCTC AATCTGCTAG CCAGTCAACC CCGCCAGCAT AACCGCTGGA TTATCACTCC CCACCCCGGT GAAGCATCGA GACTGCTGAA CATTAATATT GAAGAAATAC AAGCAGACCG TTTTGCCGCA GTCCAGGCTC TTCAACAGCG CTACGGCGGG GTTGCGGTGC TTAAAGGAAA TGGGTCTCTG GTATGCTCCA CGAATCACCC CCTAGGGCTA TGCACGGCAG GCAATCCAGG AATGGCCAGT GGCGGTATGG GGGACGTCCT CTCCGGAACT ATCGCCGGCC TGCTGGCTCA AGGACTTACC CTGAATAATG CGGCCCATTT GGGGGTAACA ATTCATGCCA TGGCTGGCGA TCGCGCTGCC CGTGAAGGGG GTGAGCGGGG ACTGCTGGCG AGCGATTTGA TGGAACATCT GCGACAACTG GCCAATCTTC AACAAATAGG CCCATGA
|
Protein sequence | MAILPLNLYT AAQVRELDRC TIEEFGISGA MLMERAGKAS LEQLRKHWPQ AQRLVIICGV GNNGGDGYVL ARLARKAEMY VSIYQLGDNS KLSTDAQAAR QMLLDSGMEI LPFQPQVLRA ADVVIDAIFG TGLSRGVTGQ WADAIEAINT CGQPVFAMDI PSGLHADTGN ILGIAIKAQV TATFGGLKQG MFTHLGPDYC GEIAFDSLSI PPEAYHGVTP SARRITLEDH ITKLPSRAKA GHKGDYGHVV IIGGERGMPG AARMAGEAAY RVGAGLVSIA TREKHASLLN LARPELMCYG VESAEELKPL LNRATTLVIG PGLGQDLWGQ TMLAEALNHS HPLVVDADAL NLLASQPRQH NRWIITPHPG EASRLLNINI EEIQADRFAA VQALQQRYGG VAVLKGNGSL VCSTNHPLGL CTAGNPGMAS GGMGDVLSGT IAGLLAQGLT LNNAAHLGVT IHAMAGDRAA REGGERGLLA SDLMEHLRQL ANLQQIGP
|
| |