Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_1997 |
Symbol | |
ID | 7086831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 2356854 |
End bp | 2358557 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643460900 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_002357924 |
Protein GI | 217973173 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0479663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00000161829 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAATAA CGGGGATTAT AGTGTCATCG GGAATAGCCT TTGGTCAGGC ACTTCACCTT ATTCACACCG AACACCACCT CGATTATCGC CCTATTCCCC TGTCGAAAAT TCCCCAACAA CAGGGCAAGT TTGCCAAAGC CTTGCAAGAG CTGCAGGCAC AATTAACCCA CAGCCAAGCC GCACTCGATA GCGATTCAGA AAATTATCAG CTGATCGAAG CCGACCTATT GTTATTGGAA GACGATGAAT TAATCGAGCA AGTGAACGAT GCGATTCGTA CCTTACAACT GTCCGCAAGT GTGGCGGTTG AACGCATATT TGCCCATCAA GCCAACGAGT TGCAATCCCT AGATGATCCC TATTTAGCCA ATAGAGCCCA AGATGTGCGC TGTTTAGGCC AACGTGTTGT CGCCGCGATC AATGGCCATT TAAACCAAGG GCTCGACAAA CTCGATAAGC CCACCATCTT GTTAGCGCAA GATTTAACCC CCGCCGAATT TGCCTTACTG CCGAGGGAAA ACCTCTGCGG TATTGTGCTC AAAACTGGCG GTTTAACCAG TCATACGGCG ATTTTAGCCC GAGCTGCCGG CATTCCAGCC ATCTTAAGTT GTCAGTTTGA TGCCGATTCG ATCCCCAACG GCACGCCCTT AGTACTCGAT GCGCTCAATG GTGAGCTTTG CGTTAATCCC AATCCAGATC AACAGGCAAG ACTCACAGTC ACCTTTCACC ACGAACAGGC AAGACGGGCA GCGCTGCAAA CCTATAAGGA TGGCCCCGCG CAAACGCAAG ATGGCCATAT CGTGGGGCTT ATGGCTAACG TCGGCAATCT CAACGACATC ACCCATGTCA GCGATGTTGG CGCCGATGGT ATAGGTTTGT TTCGCACCGA ATTTATGCTG ATGAACGTCA GCACCCTGCC CGATGAGAAA GCCCAATACA GCTTATATTG CGATGCATTG CACGCTCTGG GCGGTAAGAC CTTTACCATC CGCACCTTAG ATATCGGTGC CGACAAAGAA CTGCCTTGCC TGTGCCAAGA AATAGAAGAT AATCCCGCCT TAGGGCTGCG CGGCATTCGC TACACCTTGG CACACCCCGA CTTATTTAAA ACCCAATTGA GGGCTATTTT GCGCGCCGCA AACCACGGTC CGATCCGCTT GATGTTCCCT ATGGTTAATC AAGTCGAAGA ATTGGATGAA GTGTTTGCAC TGATTGCCCA GTGCCAAGAT GCCCTGGAAG AAGAAGAGAA AGGTTACGGT GAACTCAGCT ACGGTATCGT TGTCGAAACC CCCGCAGCGG TATTTAACCT CAATGCTATG CTGCCACGAC TCGACTTTGT CAGCATTGGC ACCAATGATT TAACCCAATA TGCAATGGCA GCCGATAGGA CCAACCCGCA GCTTACCCGC GACTATCCGA GCCTTTCGCC TGCCATTTTA GCGTTAATTA ACATGACAAT AGTCCAAGCA AAAGCGGCCA ATGTGAAAGT GTCGCTGTGC GGCGAACTGG CCAGTTCACC ACAAATTGCA CCGCTGTTAA TCGGCATGGG GCTGGACGAA CTCAGTGTTA ACTTAAGCTC ACTGTTAGAA GTCAAAGCTG CCATTTGCCA AGGCAACATC CAACAATTTT CGGCGCTGGC GCACACTGCA TTACAACAAG ATAGAATTGC AGGTCTACAG CAGTGTATAA CAAGCTATAA ATAG
|
Protein sequence | MSITGIIVSS GIAFGQALHL IHTEHHLDYR PIPLSKIPQQ QGKFAKALQE LQAQLTHSQA ALDSDSENYQ LIEADLLLLE DDELIEQVND AIRTLQLSAS VAVERIFAHQ ANELQSLDDP YLANRAQDVR CLGQRVVAAI NGHLNQGLDK LDKPTILLAQ DLTPAEFALL PRENLCGIVL KTGGLTSHTA ILARAAGIPA ILSCQFDADS IPNGTPLVLD ALNGELCVNP NPDQQARLTV TFHHEQARRA ALQTYKDGPA QTQDGHIVGL MANVGNLNDI THVSDVGADG IGLFRTEFML MNVSTLPDEK AQYSLYCDAL HALGGKTFTI RTLDIGADKE LPCLCQEIED NPALGLRGIR YTLAHPDLFK TQLRAILRAA NHGPIRLMFP MVNQVEELDE VFALIAQCQD ALEEEEKGYG ELSYGIVVET PAAVFNLNAM LPRLDFVSIG TNDLTQYAMA ADRTNPQLTR DYPSLSPAIL ALINMTIVQA KAANVKVSLC GELASSPQIA PLLIGMGLDE LSVNLSSLLE VKAAICQGNI QQFSALAHTA LQQDRIAGLQ QCITSYK
|
| |