Gene Nwi_0378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0378 
Symbol 
ID3676963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp423105 
End bp425372 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content67% 
IMG OID637711918 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_316997 
Protein GI75674576 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.223846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.989273 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAGCG CGTCGAGCGG CCCCCGCGTC TTGTTGAGAC GCCTCCGCGA AGTCATGGCG 
GAGCCGGTCA GCGCGCAGGA GCGTCTCGAC AAGATCGTGG TGCTGATCGC CGCCAACATG
GTGGCGGAAG TCTGCTCGAC CTATGTGCTG CGCGTCGACA ATACGCTGGA ACTCTACGCG
ACCGAGGGCC TCAATCGCGA CGCCGTGCAC CGCACCGTGC TGACCGCGCA TGAAGGCCTG
GTCGGCCTGG TCGCCAGCGA AGCGACGCCC CTCAATCTGT CGGATGCGCG CAGTCACCCG
GCATTCTCGT TCCGCCCGGA AACCGGCGAA GAAATCTACA ACTCGTTCCT CGGCGTGCCG
ATCCTGCGCG CGGGCAACAC CCTCGGCGTG CTGGTGGTGC AGAACCGCGC CCGGCGCACC
TATGTCGAGG AAGAGGTCGA GGCGCTTCAG ACCACGGCGA TGGTCCTCGC CGAGATGATA
GCTTCCGGCG AACTGGCCGC GCTCGCGCAG CCCGGTCTCG AACCCGCCGC CCGCCACACC
ATCCACCAGA TCGGCGCGAT CCTGTCCGAC GGCATCGCGC TCGGCCATGT CGTGCTGCAC
GAACCGCGCA TCGTCATCAC CAACTACATC GCCGACGACG TGCCAAAGGA AATCCGGCGG
CTGGAAACGG CGCTCACCAA GCTGCGCACG GACCTCGACC GCCTGCTGGA GCGCGGCGAC
GTCGCCGAGG GCGGCGAGCA TCGCGACGTG CTCGAAGCCT ACCGGATGTT CGCCAATGAT
CACGGCTGGT CGCACAGGCT GCAGGAGGCG GCCGCGACCG GATTGACCGC CGAGGCCGCT
GTCGAGCGCG TGCAATCAGA CACGCGCGCG CGAATGTTGC GCTCGACCGA TCCTTATTTG
CGCGATCGCC TTCATGACCT TGAGGACCTC GGCCACCGCC TGATGCGGCA ACTGGTCGGA
CAGGATCACG CGCCTTCCCG CGAACAGCTC CCGGATAACG CGATCCTGAT CGCGCGCGCC
ATGGGACCCG CGGCCCTGCT GGACTATGAC CGCAGGCGGC TGCGCGGACT GGTGCTGGAG
GAAGGCACGG CGACCTCGCA TGTCGCCATC GTCGCGCGCG CGCTGGGTAT TCCCGCCGTC
GGCGAGGTGC CGAACGCGGT CGGAATCGCC GATCCCGGCG ACGCCATCAT CGTCGACGGC
ACCTCCGGCT CGATTTACCT GCGGCCATCG GCGGAGATTG AATCCGCCTA TGCCGAGCGC
GTGCGCTTTC GCGCCCGGCG GCAGGCGCAG TATATCGCAT TGCGCGACAA GCCCTGCGTC
ACCAGGGACG GCGAGCCGGT CGATCTCATG ATCAATGCGG GCCTCGTGAT CGACCTGCCG
CATCTGGAGG ACACCAATTG CTCCGGCATC GGCCTGTTCC GGACCGAACT GCAATTCATG
CTCGGACAGA GTCTGCCGCG CGCAAGCGAG CAGCTTGCGC TGTACCGCAA GGTGCTGGAC
GCCGCCGGCG ACAAGCCGGT GACCTTCCGC ACCCTCGATA TCGGCGGCGA CAAGGCGCTT
CCCTACATGG AAGCGGTGGC AGAGGAAAAT CCCGCGCTGG GCTGGCGCGC CATCAGGCTC
GGGCTCGATC GCCCCGGCCT GCTGCGCGGC CAGATCCGCG CGCTGCTGCG CGCCGGAAGC
GGCCGCGCGC TGCGCATCAT GTTCCCCATG ATCTCGGAAG TCGCCGAATT CGATCAGGCC
AAAGCCATCG TCGAGCGCGA ACTCACTTAT CTGCGCCAGC ATGACCACAC GCTCCCGGAA
CGGGTGGACG TCGGCAGCAT GGTGGAAGTC CCTGCGCTGT TGTACCAGCT CGACGAATTG
TTCGCGAAGG CCGACTTCGT GTCCGTCGGA TCGAACGACC TGTTTCAGTT CATGTTCGCG
GTCGATCGCG GCAACAGCAA GGTATCGAAC CGCTTCGACA CGCTCTCTCC CCCGATCCTG
CGGGCGCTGC GCGACATCGT TGTCAAGGCG CGGGCGGCGG GCAAGAGCGC CTCGCTGTGC
GGGGAAATGG CGTCGCAACC GCTGGGCGCG CTGGCGCTGA TCGCGCTCGG CTACCGCTCG
CTCTCGCTTT CGGCCACCGC GCACGGTCCG GTCAAGGCCC TGATCCTTGA TCTCGATACG
AAAAAGGCGG AAGCGGCGAT CCTGCCGCTG CTCGAAGCGC CCTCCGGCAG CATTTCCGTC
CGCGATAAGC TCAAGGACTT CGCCGACGCC GAGGGGCTAC CGCTATGA
 
Protein sequence
MRSASSGPRV LLRRLREVMA EPVSAQERLD KIVVLIAANM VAEVCSTYVL RVDNTLELYA 
TEGLNRDAVH RTVLTAHEGL VGLVASEATP LNLSDARSHP AFSFRPETGE EIYNSFLGVP
ILRAGNTLGV LVVQNRARRT YVEEEVEALQ TTAMVLAEMI ASGELAALAQ PGLEPAARHT
IHQIGAILSD GIALGHVVLH EPRIVITNYI ADDVPKEIRR LETALTKLRT DLDRLLERGD
VAEGGEHRDV LEAYRMFAND HGWSHRLQEA AATGLTAEAA VERVQSDTRA RMLRSTDPYL
RDRLHDLEDL GHRLMRQLVG QDHAPSREQL PDNAILIARA MGPAALLDYD RRRLRGLVLE
EGTATSHVAI VARALGIPAV GEVPNAVGIA DPGDAIIVDG TSGSIYLRPS AEIESAYAER
VRFRARRQAQ YIALRDKPCV TRDGEPVDLM INAGLVIDLP HLEDTNCSGI GLFRTELQFM
LGQSLPRASE QLALYRKVLD AAGDKPVTFR TLDIGGDKAL PYMEAVAEEN PALGWRAIRL
GLDRPGLLRG QIRALLRAGS GRALRIMFPM ISEVAEFDQA KAIVERELTY LRQHDHTLPE
RVDVGSMVEV PALLYQLDEL FAKADFVSVG SNDLFQFMFA VDRGNSKVSN RFDTLSPPIL
RALRDIVVKA RAAGKSASLC GEMASQPLGA LALIALGYRS LSLSATAHGP VKALILDLDT
KKAEAAILPL LEAPSGSISV RDKLKDFADA EGLPL