Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2800 |
Symbol | |
ID | 3705280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3171324 |
End bp | 3173072 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637739276 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_344777 |
Protein GI | 77166252 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCTTG AGCTGCACGG GATCGGCGTC TCCCGAGGCG TTGCTATCGG TAAAGTCCAC ATGCTTTCCC GCGATCAACC CGAGGTCAAT GAGTACATCC TGCCATCGAA TCTCATCGAA AGTGAAGTTC AACGGTACCA AGCGGCGCTC ATTACCGCTA AGGCGCAACT ACAGTCTATT CAAGAGCAAA TTCCCGATAA TATTTCAACG GATATAGCCG CTTTTATCAA CACCCACCTG CTAATGCTCG AAGACAAGGC ACTAGTTAGC GTGCCCGTGG AGCTTATACA GACCCGGCAC TGCAATGCTG AATGGGCGCT CAAACTACAG CGTGACGCTC TAGTTAGCGT TTTTGATGAA ATGGACGACT CTTATCTTCG CACTCGCAAA AATGATGTAG ACCACGTTGT TAATCGTATT CTACGCACAC TTACTAATCA GGAGAGCCCC TACCATGAAG CCCTTGGTAG CCGCCTTAAA GGATATATCT TACTTGCCGA CGACTTAACC CCGGCAGATA TTGTCTTGAT GCAGCAGCAA CAAGTGGGTG GATTTGCAAC CGAGCATGGC GGCGCTAATT CCCATACCAG TATTCTTGCC CGCAGCCTGG GTATTCCTGC TATCGTTGGC CTGCGCGGGG CTCGCCGCTA TGTCCAGGAT GATGAATTAC TCATTATTGA TGGTGGACAA GGAATCTTGC TTGCAGGAGT CGATGATCCT CTTATTAAGA AATTTCAGGA GAGGCGGGCC GATCAACAAC GCCGGCTGGC GGCTCTAGCA GCATTCCGGG GTCGGCCTGC GGTTACCCTT GAAGGACAAC TCATTACACT GGAGGCTAAC ATTGAATTAC CGGAAGATTT ACCTCTCGTA ACCGATTCTG GCGCCGAAGG AATCGGCCTC TATCGCACTG AATTTTTATA CATGAATCGT ACCGCCCCTC CCGACGAAGA GGAGCAGCTA AAAGCCTACA CGCGGGTGGT GGAAGCCCTG GGGGGTGCTC CGGTAACCAT TCGCACGGTC GATCTTGGTG CTGACAAGAC GGTAGACGGA AGCTATAGCA ATGCCTCAGT TGCCACTAAC CCCGCCTTAG GACTACGTGC TATTCGCCTA TGCCTACGAA AACCGGGGCT ATTTCGCCCT CAGTTACGGG CCATTCTCCG AGCCTCGGCA CGAGGTCCCG TGCGGTTGCT ATTGCCGATG ATCTGCACCC TCCAGGAACT AACCCAGGTT ATGGCATTGT TAGAGGACTG CAAACGGGAA CTGAAACGCC AAGGACTGAA ATACGACCCC GCCTTGCCGG TGGGTGCTAT GATTGAAGTT CCCGCTACGG CTATCTGTGC TGAGGTTTTT GCTCGCCATC TGGATTTCCT TTCCATTGGC ACCAATGATC TCACTCAATA TACACTTGCT ATTGACCGTA TTGATGATGA GGTGAACTAC CTCTATAGTC CCTTACACCC AGCAGTCTTA CATCTCATTC ATCGCACTAT CGCTGCCGGA GAGAAAACAG GCACGGCAGT CGCCATGTGT GGGGAAATGG CGGGAGATAT TCACTATACT CGGCTGCTGT TGGCGTTGGG GCTGCGGGAT TTTAGTATGC ACCCCGCTTC CCTACTAGAA GTCAAACAGG TGATCACCGA AAGCGATCAG CGAAAGCTAA CGGGATTAGC TCAGCAGATT TTAGAGTGCT GCGATCTTTC GGAAATGCAA GCGCTATTAG AACAAATTAA TGAAGGTCTA CCCCATTAA
|
Protein sequence | MTLELHGIGV SRGVAIGKVH MLSRDQPEVN EYILPSNLIE SEVQRYQAAL ITAKAQLQSI QEQIPDNIST DIAAFINTHL LMLEDKALVS VPVELIQTRH CNAEWALKLQ RDALVSVFDE MDDSYLRTRK NDVDHVVNRI LRTLTNQESP YHEALGSRLK GYILLADDLT PADIVLMQQQ QVGGFATEHG GANSHTSILA RSLGIPAIVG LRGARRYVQD DELLIIDGGQ GILLAGVDDP LIKKFQERRA DQQRRLAALA AFRGRPAVTL EGQLITLEAN IELPEDLPLV TDSGAEGIGL YRTEFLYMNR TAPPDEEEQL KAYTRVVEAL GGAPVTIRTV DLGADKTVDG SYSNASVATN PALGLRAIRL CLRKPGLFRP QLRAILRASA RGPVRLLLPM ICTLQELTQV MALLEDCKRE LKRQGLKYDP ALPVGAMIEV PATAICAEVF ARHLDFLSIG TNDLTQYTLA IDRIDDEVNY LYSPLHPAVL HLIHRTIAAG EKTGTAVAMC GEMAGDIHYT RLLLALGLRD FSMHPASLLE VKQVITESDQ RKLTGLAQQI LECCDLSEMQ ALLEQINEGL PH
|
| |