Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0220 |
Symbol | |
ID | 3784597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 233680 |
End bp | 235407 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637810292 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_410920 |
Protein GI | 82701354 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTTG TACTGCATGG GGTGGGCGTT TCGGAAGGTA TCGCCATAGG CCATGCTCAT CTTGCGTCTC ATGCCGCGCT GGAAGTCGCC CACCACGTCG TGCCCGAGGA TCAGGTCACC AATGAAATCT CCCGGCTGGA CACCGCTTTC ACAACCGTTC GCAAAGAGCT TCAGGCATTG CACGACACGG TCGTCAGCGG TCCGGCGGCT GCCGAATACG AAGCATTTCT CGATCTGCAC CGGATGATTC TGGATGATCC CACGCTCTCG ACTGCGGCAA AAGCGTATAT TGCCCAGAAT CAGTGCAATG CCGAATGGGC CATTACCCAG CAGATGGGGG TATTGATGGC GCAGTTCGAG GAAATAGAAG ATCCCTATCT GCGCGAACGT AAAACGGACG TCATCCAGGT GGTGGAGCGC GTTCTCAAGG TATTGCTGGG GCATCCCGGA TATATTCCCC CTTCGCAAAA GCGGGATGGC GACAGTGTCC TGGTAGCGCA CGACTTGAGT CCGGCCGATG TGCTTCAATA TAAGCAGTAT TCGTTTACTG CATTCCTGAC GGATCTTGGC GGCCTGACTT CGCATACCGC CATTGTCGCC CGCAGCCTGA ACATCCCCTC CGTCGTCGCG CTTCACCATG CCCGCCGCCT CATCCGGGAA AATGACATTC TGATCGTGGA TGGCAACCAG GGTGTGGTCA TCGTGGACCC GGACGAGCAT GTGCTCGCGG AATATCGGCT GCGCCAGAGC GAGCTGGAGC TGGAAAAACA GAAACTCAAG CGCTTGAGGA CGACGGTCGC GGCGACGCTG GATGGCACGG TAGTGGAACT GTATGCAAAT ATCGAGTTGC CGCAGGACGT CGATCAGGTC AAAGAAAATG GAGCGACCGG CGTCGGCCTC TTTCGGAGCG AATTCCTGTT CCTCAATCGG GACAGCTTGC CTGATGAGGA AGAGCAATTC GAAGCTTATC GAACCGTGGC GCGAAAAATG CGGGGAATGC CGGTAACGGT GCGCACGTTC GATCTTGGCG CGGACAAGAA CCTGGATCAT GCCAAACGGG TGGCTGCGAA TCCCGCCCTG GGCCTGCGAG CCATCCGGCT GAGCCTGGCC GAACCCCAGA TGTTCAATAT CCAGTTGCGC GCTATTTTAC GTGCTTCGCG CTATGGGCAG ATCCGGATTC TGGTGCCGAT GCTTTCCAAT GTTGCTGAGA TAACGCAGAC GCTGCATTTG ATCCAAAGCG CCAAACAAAG CCTGCGTAAC GAAAAGATTC CTTTTGACGA AAAAGTGCAG GTGGGAGGAA TGATAGAAAT TCCAGCCGCT GCGCTGAGTC TTGACATTTT CATGCGCAAG CTCGATTTCC TGTCAATCGG CACCAATGAT CTCATCCAGT ATACGCTGGC GATAGACCGC GCCGATGACA CTGTCGCGCA CCTCTATGAT TCGCTGCACC CGGCGGTGCT GCGGTTGGTC GCGCATGTCA TACGCAGCGC GAACCGTGCA AGCATACCGG TGTCCGTGTG CGGCGAGATC GCAGGGGATG TGGTGTTTAC CCGTCTGCTG CTCGGTTTCG GCTTACGCGT GTTTTCCATG CACCCCGTCC AACTGCTGAC CGTAAAACGT GAAGTCTTGC GGGCAAATCT GCCCGACCTC ATTCCGATCA CTCAAAAAAT ACTGAAAACC GCTGATCCGG AGAAAATTCA CGCACTGCTG GCGAAGCTCA ACGCTTGA
|
Protein sequence | MSFVLHGVGV SEGIAIGHAH LASHAALEVA HHVVPEDQVT NEISRLDTAF TTVRKELQAL HDTVVSGPAA AEYEAFLDLH RMILDDPTLS TAAKAYIAQN QCNAEWAITQ QMGVLMAQFE EIEDPYLRER KTDVIQVVER VLKVLLGHPG YIPPSQKRDG DSVLVAHDLS PADVLQYKQY SFTAFLTDLG GLTSHTAIVA RSLNIPSVVA LHHARRLIRE NDILIVDGNQ GVVIVDPDEH VLAEYRLRQS ELELEKQKLK RLRTTVAATL DGTVVELYAN IELPQDVDQV KENGATGVGL FRSEFLFLNR DSLPDEEEQF EAYRTVARKM RGMPVTVRTF DLGADKNLDH AKRVAANPAL GLRAIRLSLA EPQMFNIQLR AILRASRYGQ IRILVPMLSN VAEITQTLHL IQSAKQSLRN EKIPFDEKVQ VGGMIEIPAA ALSLDIFMRK LDFLSIGTND LIQYTLAIDR ADDTVAHLYD SLHPAVLRLV AHVIRSANRA SIPVSVCGEI AGDVVFTRLL LGFGLRVFSM HPVQLLTVKR EVLRANLPDL IPITQKILKT ADPEKIHALL AKLNA
|
| |