Gene Noc_1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1552 
Symbol 
ID3705810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1721790 
End bp1724132 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content52% 
IMG OID637738036 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_343565 
Protein GI77165040 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAG ATCAGGCTGA ATTATCGGAT TTTACCATCG GACCTACAAG TTCCCGTAGT 
CTAGGCTCCC TGAGCCGGAT TGTTCGGGAA GTTAGCCTTG CGGCTAGCCT CGAAGAGGTA
TTGCAGGTCA TTGTGGTGCA AACCCGTAAG ATGATGGCGG TGGATGTGTG CTCTGTTTAT
CTTACCGAGA GCAATGGCAG TCATGTGCTG ATGGCAACCC AAGGATTGCA TCCGGAAGCA
GTGGGGCAGG TACGGCTTAT TCCGAGAGAA GGATTGGTGG GTTTGGTGGC GGAACGGGCT
GAGCCGGTCA ATTTGGAAAA CGCAGCTGCT CATCCTCATT TTAAGTTTAT TCCTGGCTCG
GGCGAAGAAC CATTTCAGGC ATTTTTAGGC GTACCTGTTC TCCATCAGCG AAAATTATTA
GGTGTGCTGG TGGTGCAACA GAAAATAGCC CGCCAATTCG ATGAAGTTGA CGTTTCTTTT
TTATTTACAT TAGCCGCTCA ACTGGGTGGC GTCATTGCCC ATGCTAGGGC CAGTGGTGTT
CTACAAAAAC CTATGGGTAG CCAGAATGAG AATGGGCCTG AGCGCTACCT TACCGGTATT
GTCGGGGCTC CCGGAGTAGC CCTAGGAAAA GGGGTTGTGG TTTATTCCGC TACTGGTTTA
GATACAGTGC CTGAGCGTCA GGTAGCCAAT ATCAAGGAAG AGGAGAGAAT CTTCCGGCTA
GCCGTAGTTC ATGTCTCCCA GGAGATTCAG TCTCTAGGAA ATCACCTAGA GCTATCATTG
GCGAATGAAT ACCAGCCTCT GTTTAAAGCT TATGCCATGC TTACAGGAAG CCAAAGTTTG
GTGGAAGCTA CGGTAGAACG TATCCGGGCC GGTGAGTGGG CGCCGAGCGC CTTAAGTACG
GTTATCAAGG AACAAGCCCG GCGTCTTGAA GCCCAGGAAG ATCTTTATTT GCGGGAGCGG
GGCAATGACC TGCGAGAGAT TGGCCGGCGT ATTTTGGGCT ATCTTCAGAA TGTGGCGCCT
ATTAACTTGG AATATCCAGA GAATACCATC TTGATAGGTG AAAACTTGAG CGCCATGGAT
CTTGCCGAGG TACCCATGGG GCGTCTGGCA GGGGTAATTT CGGCCCATGG ATCGGGTTTC
TCCCATGTGG CGATTCTAGC CCACGCTATG GGTATTCCCG CCATTATGGG AATTAGTAAG
GCGAACCTTG GTCAGTTGGA CCAGCGGGAA TTGATCCTGG ATGGTTATCA AGGCCGGGTA
CATTTAGAGC CAAGCAGGCT AGTACGCCAA GAGTTTGCCC GCCTCGCCCG GCAGGAGCAA
CAGCTTACGG AAGAACTCAA GGGCTTGCGG GATTTGCCTG CCGAAACGCC AGATGGTTTT
CGGGTTCATT TATACGCCAA CATTGGCCTG TTGGCGGATA TCGAGCTTTC TCTTGCTGCG
GGTACTGAGG GTGTGGGGCT TTACCGGACC GAGTTGCTTT TTATGGTGAG GGATCAGTTT
CCCACTGAAG AAGAGCAATA TGCCGTGTAT CGGAAACTTC TCCAGGCTTT TACCCCTTCT
CCCGTCGTTT TACGCGTACT TGATGTGGGT GGTGATAAAT TCCTGCCCTA TTTCTCGATT
GAGGAGGCCA ATCCATTTTT AGGATGGCGA GGAATTCGCG TTATTCTGGA TCATCCAGAA
ATATTTTTGA CTCAAGTACG AGCGTCACTC CGGGCAGCCG AAGGGTTAAG TAACCTGAAT
TTGCTGTTTC CCATGATTAG CGCGGTTTCT GAGTTGGAAG AAGCTCTACA TATAGTGCGG
CGGGCCTATG AAGGACTGGT AGAGGAAGGT GTTCGGGTTA CTTGGCCTCG GGTAGGGGTC
ATGATTGAGG TGCCTGCCGC GGTCTATCAG GTGGAAGCGT TGGCGCGGCG AGTGGATTTT
CTTTCCATTG GCACTAACGA TCTTGCTCAA TATCTGCTGG CGGTTGATCG CAGTAATGAG
CGGGTGGCGG AATTATATCA TTCCCTCCAT CCGGCTGTTT TGGCCGCCAT TCTCACAGTA
GTGAAGGCGG CCCGCCGGCA CCATAAGCCC GTTAGCGTGT GCGGCGAGAT GGCGGGTGAG
GCCACGGCGG CTATATTATT GCTGGGTATG GGGATAGATA ACCTCAGTCT GACCGCTGGC
GATTTACCCC GAATCAAGTG GATAATAAGG AATTTTAGCC AGCAGTATGC GAGGGAATTG
CTCGCTCGGG CCTTGCGGGA AGAAAAACCC GAGCCTATCC ATAAAATGTT ATGCGAGGCC
CTGGATAACT TTGGGCTAGG GGAGTTGATA CGGGGGGGGA AAACAAGTTC CCCCCTGCTT
TGA
 
Protein sequence
MKQDQAELSD FTIGPTSSRS LGSLSRIVRE VSLAASLEEV LQVIVVQTRK MMAVDVCSVY 
LTESNGSHVL MATQGLHPEA VGQVRLIPRE GLVGLVAERA EPVNLENAAA HPHFKFIPGS
GEEPFQAFLG VPVLHQRKLL GVLVVQQKIA RQFDEVDVSF LFTLAAQLGG VIAHARASGV
LQKPMGSQNE NGPERYLTGI VGAPGVALGK GVVVYSATGL DTVPERQVAN IKEEERIFRL
AVVHVSQEIQ SLGNHLELSL ANEYQPLFKA YAMLTGSQSL VEATVERIRA GEWAPSALST
VIKEQARRLE AQEDLYLRER GNDLREIGRR ILGYLQNVAP INLEYPENTI LIGENLSAMD
LAEVPMGRLA GVISAHGSGF SHVAILAHAM GIPAIMGISK ANLGQLDQRE LILDGYQGRV
HLEPSRLVRQ EFARLARQEQ QLTEELKGLR DLPAETPDGF RVHLYANIGL LADIELSLAA
GTEGVGLYRT ELLFMVRDQF PTEEEQYAVY RKLLQAFTPS PVVLRVLDVG GDKFLPYFSI
EEANPFLGWR GIRVILDHPE IFLTQVRASL RAAEGLSNLN LLFPMISAVS ELEEALHIVR
RAYEGLVEEG VRVTWPRVGV MIEVPAAVYQ VEALARRVDF LSIGTNDLAQ YLLAVDRSNE
RVAELYHSLH PAVLAAILTV VKAARRHHKP VSVCGEMAGE ATAAILLLGM GIDNLSLTAG
DLPRIKWIIR NFSQQYAREL LARALREEKP EPIHKMLCEA LDNFGLGELI RGGKTSSPLL