Gene Noc_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2800 
Symbol 
ID3705280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3171324 
End bp3173072 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content51% 
IMG OID637739276 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_344777 
Protein GI77166252 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTTG AGCTGCACGG GATCGGCGTC TCCCGAGGCG TTGCTATCGG TAAAGTCCAC 
ATGCTTTCCC GCGATCAACC CGAGGTCAAT GAGTACATCC TGCCATCGAA TCTCATCGAA
AGTGAAGTTC AACGGTACCA AGCGGCGCTC ATTACCGCTA AGGCGCAACT ACAGTCTATT
CAAGAGCAAA TTCCCGATAA TATTTCAACG GATATAGCCG CTTTTATCAA CACCCACCTG
CTAATGCTCG AAGACAAGGC ACTAGTTAGC GTGCCCGTGG AGCTTATACA GACCCGGCAC
TGCAATGCTG AATGGGCGCT CAAACTACAG CGTGACGCTC TAGTTAGCGT TTTTGATGAA
ATGGACGACT CTTATCTTCG CACTCGCAAA AATGATGTAG ACCACGTTGT TAATCGTATT
CTACGCACAC TTACTAATCA GGAGAGCCCC TACCATGAAG CCCTTGGTAG CCGCCTTAAA
GGATATATCT TACTTGCCGA CGACTTAACC CCGGCAGATA TTGTCTTGAT GCAGCAGCAA
CAAGTGGGTG GATTTGCAAC CGAGCATGGC GGCGCTAATT CCCATACCAG TATTCTTGCC
CGCAGCCTGG GTATTCCTGC TATCGTTGGC CTGCGCGGGG CTCGCCGCTA TGTCCAGGAT
GATGAATTAC TCATTATTGA TGGTGGACAA GGAATCTTGC TTGCAGGAGT CGATGATCCT
CTTATTAAGA AATTTCAGGA GAGGCGGGCC GATCAACAAC GCCGGCTGGC GGCTCTAGCA
GCATTCCGGG GTCGGCCTGC GGTTACCCTT GAAGGACAAC TCATTACACT GGAGGCTAAC
ATTGAATTAC CGGAAGATTT ACCTCTCGTA ACCGATTCTG GCGCCGAAGG AATCGGCCTC
TATCGCACTG AATTTTTATA CATGAATCGT ACCGCCCCTC CCGACGAAGA GGAGCAGCTA
AAAGCCTACA CGCGGGTGGT GGAAGCCCTG GGGGGTGCTC CGGTAACCAT TCGCACGGTC
GATCTTGGTG CTGACAAGAC GGTAGACGGA AGCTATAGCA ATGCCTCAGT TGCCACTAAC
CCCGCCTTAG GACTACGTGC TATTCGCCTA TGCCTACGAA AACCGGGGCT ATTTCGCCCT
CAGTTACGGG CCATTCTCCG AGCCTCGGCA CGAGGTCCCG TGCGGTTGCT ATTGCCGATG
ATCTGCACCC TCCAGGAACT AACCCAGGTT ATGGCATTGT TAGAGGACTG CAAACGGGAA
CTGAAACGCC AAGGACTGAA ATACGACCCC GCCTTGCCGG TGGGTGCTAT GATTGAAGTT
CCCGCTACGG CTATCTGTGC TGAGGTTTTT GCTCGCCATC TGGATTTCCT TTCCATTGGC
ACCAATGATC TCACTCAATA TACACTTGCT ATTGACCGTA TTGATGATGA GGTGAACTAC
CTCTATAGTC CCTTACACCC AGCAGTCTTA CATCTCATTC ATCGCACTAT CGCTGCCGGA
GAGAAAACAG GCACGGCAGT CGCCATGTGT GGGGAAATGG CGGGAGATAT TCACTATACT
CGGCTGCTGT TGGCGTTGGG GCTGCGGGAT TTTAGTATGC ACCCCGCTTC CCTACTAGAA
GTCAAACAGG TGATCACCGA AAGCGATCAG CGAAAGCTAA CGGGATTAGC TCAGCAGATT
TTAGAGTGCT GCGATCTTTC GGAAATGCAA GCGCTATTAG AACAAATTAA TGAAGGTCTA
CCCCATTAA
 
Protein sequence
MTLELHGIGV SRGVAIGKVH MLSRDQPEVN EYILPSNLIE SEVQRYQAAL ITAKAQLQSI 
QEQIPDNIST DIAAFINTHL LMLEDKALVS VPVELIQTRH CNAEWALKLQ RDALVSVFDE
MDDSYLRTRK NDVDHVVNRI LRTLTNQESP YHEALGSRLK GYILLADDLT PADIVLMQQQ
QVGGFATEHG GANSHTSILA RSLGIPAIVG LRGARRYVQD DELLIIDGGQ GILLAGVDDP
LIKKFQERRA DQQRRLAALA AFRGRPAVTL EGQLITLEAN IELPEDLPLV TDSGAEGIGL
YRTEFLYMNR TAPPDEEEQL KAYTRVVEAL GGAPVTIRTV DLGADKTVDG SYSNASVATN
PALGLRAIRL CLRKPGLFRP QLRAILRASA RGPVRLLLPM ICTLQELTQV MALLEDCKRE
LKRQGLKYDP ALPVGAMIEV PATAICAEVF ARHLDFLSIG TNDLTQYTLA IDRIDDEVNY
LYSPLHPAVL HLIHRTIAAG EKTGTAVAMC GEMAGDIHYT RLLLALGLRD FSMHPASLLE
VKQVITESDQ RKLTGLAQQI LECCDLSEMQ ALLEQINEGL PH