Gene Nmul_A0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0220 
Symbol 
ID3784597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp233680 
End bp235407 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content56% 
IMG OID637810292 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_410920 
Protein GI82701354 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTTG TACTGCATGG GGTGGGCGTT TCGGAAGGTA TCGCCATAGG CCATGCTCAT 
CTTGCGTCTC ATGCCGCGCT GGAAGTCGCC CACCACGTCG TGCCCGAGGA TCAGGTCACC
AATGAAATCT CCCGGCTGGA CACCGCTTTC ACAACCGTTC GCAAAGAGCT TCAGGCATTG
CACGACACGG TCGTCAGCGG TCCGGCGGCT GCCGAATACG AAGCATTTCT CGATCTGCAC
CGGATGATTC TGGATGATCC CACGCTCTCG ACTGCGGCAA AAGCGTATAT TGCCCAGAAT
CAGTGCAATG CCGAATGGGC CATTACCCAG CAGATGGGGG TATTGATGGC GCAGTTCGAG
GAAATAGAAG ATCCCTATCT GCGCGAACGT AAAACGGACG TCATCCAGGT GGTGGAGCGC
GTTCTCAAGG TATTGCTGGG GCATCCCGGA TATATTCCCC CTTCGCAAAA GCGGGATGGC
GACAGTGTCC TGGTAGCGCA CGACTTGAGT CCGGCCGATG TGCTTCAATA TAAGCAGTAT
TCGTTTACTG CATTCCTGAC GGATCTTGGC GGCCTGACTT CGCATACCGC CATTGTCGCC
CGCAGCCTGA ACATCCCCTC CGTCGTCGCG CTTCACCATG CCCGCCGCCT CATCCGGGAA
AATGACATTC TGATCGTGGA TGGCAACCAG GGTGTGGTCA TCGTGGACCC GGACGAGCAT
GTGCTCGCGG AATATCGGCT GCGCCAGAGC GAGCTGGAGC TGGAAAAACA GAAACTCAAG
CGCTTGAGGA CGACGGTCGC GGCGACGCTG GATGGCACGG TAGTGGAACT GTATGCAAAT
ATCGAGTTGC CGCAGGACGT CGATCAGGTC AAAGAAAATG GAGCGACCGG CGTCGGCCTC
TTTCGGAGCG AATTCCTGTT CCTCAATCGG GACAGCTTGC CTGATGAGGA AGAGCAATTC
GAAGCTTATC GAACCGTGGC GCGAAAAATG CGGGGAATGC CGGTAACGGT GCGCACGTTC
GATCTTGGCG CGGACAAGAA CCTGGATCAT GCCAAACGGG TGGCTGCGAA TCCCGCCCTG
GGCCTGCGAG CCATCCGGCT GAGCCTGGCC GAACCCCAGA TGTTCAATAT CCAGTTGCGC
GCTATTTTAC GTGCTTCGCG CTATGGGCAG ATCCGGATTC TGGTGCCGAT GCTTTCCAAT
GTTGCTGAGA TAACGCAGAC GCTGCATTTG ATCCAAAGCG CCAAACAAAG CCTGCGTAAC
GAAAAGATTC CTTTTGACGA AAAAGTGCAG GTGGGAGGAA TGATAGAAAT TCCAGCCGCT
GCGCTGAGTC TTGACATTTT CATGCGCAAG CTCGATTTCC TGTCAATCGG CACCAATGAT
CTCATCCAGT ATACGCTGGC GATAGACCGC GCCGATGACA CTGTCGCGCA CCTCTATGAT
TCGCTGCACC CGGCGGTGCT GCGGTTGGTC GCGCATGTCA TACGCAGCGC GAACCGTGCA
AGCATACCGG TGTCCGTGTG CGGCGAGATC GCAGGGGATG TGGTGTTTAC CCGTCTGCTG
CTCGGTTTCG GCTTACGCGT GTTTTCCATG CACCCCGTCC AACTGCTGAC CGTAAAACGT
GAAGTCTTGC GGGCAAATCT GCCCGACCTC ATTCCGATCA CTCAAAAAAT ACTGAAAACC
GCTGATCCGG AGAAAATTCA CGCACTGCTG GCGAAGCTCA ACGCTTGA
 
Protein sequence
MSFVLHGVGV SEGIAIGHAH LASHAALEVA HHVVPEDQVT NEISRLDTAF TTVRKELQAL 
HDTVVSGPAA AEYEAFLDLH RMILDDPTLS TAAKAYIAQN QCNAEWAITQ QMGVLMAQFE
EIEDPYLRER KTDVIQVVER VLKVLLGHPG YIPPSQKRDG DSVLVAHDLS PADVLQYKQY
SFTAFLTDLG GLTSHTAIVA RSLNIPSVVA LHHARRLIRE NDILIVDGNQ GVVIVDPDEH
VLAEYRLRQS ELELEKQKLK RLRTTVAATL DGTVVELYAN IELPQDVDQV KENGATGVGL
FRSEFLFLNR DSLPDEEEQF EAYRTVARKM RGMPVTVRTF DLGADKNLDH AKRVAANPAL
GLRAIRLSLA EPQMFNIQLR AILRASRYGQ IRILVPMLSN VAEITQTLHL IQSAKQSLRN
EKIPFDEKVQ VGGMIEIPAA ALSLDIFMRK LDFLSIGTND LIQYTLAIDR ADDTVAHLYD
SLHPAVLRLV AHVIRSANRA SIPVSVCGEI AGDVVFTRLL LGFGLRVFSM HPVQLLTVKR
EVLRANLPDL IPITQKILKT ADPEKIHALL AKLNA