Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0379 |
Symbol | |
ID | 3706550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 419630 |
End bp | 421213 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637736891 |
Product | hypothetical protein |
Protein accession | YP_342435 |
Protein GI | 77163910 |
COG category | [R] General function prediction only [S] Function unknown |
COG ID | [COG0645] Predicted kinase [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGAC AGATAGACAC ACTGATTGAG CACTTGCAGC AGCCTGGGAT TTACCATCAT GCTGTGGAAA ATCTGACCAT GATTGAGACC CACATTTCCT GGGTAGTGCT TACTGGACCC TATGCCTACA AAATTAAAAA ACCCCTTGAT CTAGGTTTTC TTGATTTTTC TACCCTGGAC AAGCGCCGTC ACTATTGTGA TGAGGAGCTG CGAATCAACC GCCGGCTTGC TCCCGAAATT TACCTGGAGG TAGTGCCTAT CACCGGTAGT ACGGCCCAAC CCCATCTCGG GGGAACCGGT ATCCCGATAG AATACGCGGT CAAAATGATC CAATTTCCCC AGCAGACCCG CTTGGACTAC TGTTTGCAGC GGGGTGACTT ATCCCCGAAG CAGGTGGAAG ACTTAGCTGG TAAAGTCGCT GCATTTCATC AAAATGTTGC AATCGCTCCC CAGGACAGCC CCTATGGTGC GCCCAAAACC ATAGGACAGC CAGCCCTGGA AAACTTTCAA CAGATGGAAA TCTTCCTTAA AGAAGCGGAG GATCAAAAAA AATTGGCTCG CCTCCGGCAC TGGACGGAGA AAAAATGGCA GCAACTGCAA GCAGAATTTA CCGGTCGTAA AAAGGCGGGC TTTGTACGAG AATGTCACGG CGATTTACAT CTGGGGAATA TCGCCTTAAG AGAGGGTAAA TTCATCATCT TTGATGGGAT TGAGTTTAAT GAGAACCTGC GCTGGATCGA TGTCATGAGC GAGCTCGCCT TCCTGATCAT GGATCTGGAG AATCGGGGTC GCCCGGATTT AGCCCACCGT TGTTTGAATA GTTATCTGGA GCACAGTGGC GACTATCCAG GCTTGGCGGT GCTTGCTTAC TACCAGGTTT ATCGCGCCCT GGTCCGAGCC AAAGTCACTG CTATCCGCCT AGGACAAACG CCCCCAGAAA CCCAGGCTAT CAAAGAACGC TGCCGTGATT ATCTGAATCT GGCCTTACAT TATATCCAAC CGGCCAGATC TTTTTTGCTC ATTACCCATG GTCTCTCTGG TAGCGGCAAA ACCACCTTGA GCCAGCCCCT CATTGAACGC TTCGGTACCA TTCGTTTACG TTCGGATATT GAACGCAAGC GCCGCCATGG TTTGAAACCT CGGGAGCGGC TCAATAAGGG GATTGGTATC GGAATGTATT CCGCAGAGTC CAGCCATAAG ACTTATCAGC ACCTCCAACA GCTTGCTCAA ACGATCCTGA AAGCAGGATA TCCCCTCATT GTGGATGCGG CTTTTCTTAA GCAGCAACAA CGCCAAATAT TTCAGGATCT AGCTAATAAG CTCAACATAC CCTTTGCCAT TCTGGATTTT CATTGCGATC CCCAACAGCT ACAGCAACGG ATACGGGAGC GCCAACATAA AAATCAGGAT GCCTCCGACG CTGATCTCGC TGTTCTGGAA CATCAACAGG CCACGCAGGA GCCCCTGACC AAGGCAGAGC AGGCAATTAC CCTGGCAATC GATACTTCCC AGACTCAGGC CATGGAAACA GTCATCCAGC AGCTACAAGT CCTTACCGGG GAAAAAGCCC CTAGGGTCAA ATAG
|
Protein sequence | MAGQIDTLIE HLQQPGIYHH AVENLTMIET HISWVVLTGP YAYKIKKPLD LGFLDFSTLD KRRHYCDEEL RINRRLAPEI YLEVVPITGS TAQPHLGGTG IPIEYAVKMI QFPQQTRLDY CLQRGDLSPK QVEDLAGKVA AFHQNVAIAP QDSPYGAPKT IGQPALENFQ QMEIFLKEAE DQKKLARLRH WTEKKWQQLQ AEFTGRKKAG FVRECHGDLH LGNIALREGK FIIFDGIEFN ENLRWIDVMS ELAFLIMDLE NRGRPDLAHR CLNSYLEHSG DYPGLAVLAY YQVYRALVRA KVTAIRLGQT PPETQAIKER CRDYLNLALH YIQPARSFLL ITHGLSGSGK TTLSQPLIER FGTIRLRSDI ERKRRHGLKP RERLNKGIGI GMYSAESSHK TYQHLQQLAQ TILKAGYPLI VDAAFLKQQQ RQIFQDLANK LNIPFAILDF HCDPQQLQQR IRERQHKNQD ASDADLAVLE HQQATQEPLT KAEQAITLAI DTSQTQAMET VIQQLQVLTG EKAPRVK
|
| |