Gene Sbal223_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1997 
Symbol 
ID7086831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2356854 
End bp2358557 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content50% 
IMG OID643460900 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_002357924 
Protein GI217973173 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0479663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000161829 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAATAA CGGGGATTAT AGTGTCATCG GGAATAGCCT TTGGTCAGGC ACTTCACCTT 
ATTCACACCG AACACCACCT CGATTATCGC CCTATTCCCC TGTCGAAAAT TCCCCAACAA
CAGGGCAAGT TTGCCAAAGC CTTGCAAGAG CTGCAGGCAC AATTAACCCA CAGCCAAGCC
GCACTCGATA GCGATTCAGA AAATTATCAG CTGATCGAAG CCGACCTATT GTTATTGGAA
GACGATGAAT TAATCGAGCA AGTGAACGAT GCGATTCGTA CCTTACAACT GTCCGCAAGT
GTGGCGGTTG AACGCATATT TGCCCATCAA GCCAACGAGT TGCAATCCCT AGATGATCCC
TATTTAGCCA ATAGAGCCCA AGATGTGCGC TGTTTAGGCC AACGTGTTGT CGCCGCGATC
AATGGCCATT TAAACCAAGG GCTCGACAAA CTCGATAAGC CCACCATCTT GTTAGCGCAA
GATTTAACCC CCGCCGAATT TGCCTTACTG CCGAGGGAAA ACCTCTGCGG TATTGTGCTC
AAAACTGGCG GTTTAACCAG TCATACGGCG ATTTTAGCCC GAGCTGCCGG CATTCCAGCC
ATCTTAAGTT GTCAGTTTGA TGCCGATTCG ATCCCCAACG GCACGCCCTT AGTACTCGAT
GCGCTCAATG GTGAGCTTTG CGTTAATCCC AATCCAGATC AACAGGCAAG ACTCACAGTC
ACCTTTCACC ACGAACAGGC AAGACGGGCA GCGCTGCAAA CCTATAAGGA TGGCCCCGCG
CAAACGCAAG ATGGCCATAT CGTGGGGCTT ATGGCTAACG TCGGCAATCT CAACGACATC
ACCCATGTCA GCGATGTTGG CGCCGATGGT ATAGGTTTGT TTCGCACCGA ATTTATGCTG
ATGAACGTCA GCACCCTGCC CGATGAGAAA GCCCAATACA GCTTATATTG CGATGCATTG
CACGCTCTGG GCGGTAAGAC CTTTACCATC CGCACCTTAG ATATCGGTGC CGACAAAGAA
CTGCCTTGCC TGTGCCAAGA AATAGAAGAT AATCCCGCCT TAGGGCTGCG CGGCATTCGC
TACACCTTGG CACACCCCGA CTTATTTAAA ACCCAATTGA GGGCTATTTT GCGCGCCGCA
AACCACGGTC CGATCCGCTT GATGTTCCCT ATGGTTAATC AAGTCGAAGA ATTGGATGAA
GTGTTTGCAC TGATTGCCCA GTGCCAAGAT GCCCTGGAAG AAGAAGAGAA AGGTTACGGT
GAACTCAGCT ACGGTATCGT TGTCGAAACC CCCGCAGCGG TATTTAACCT CAATGCTATG
CTGCCACGAC TCGACTTTGT CAGCATTGGC ACCAATGATT TAACCCAATA TGCAATGGCA
GCCGATAGGA CCAACCCGCA GCTTACCCGC GACTATCCGA GCCTTTCGCC TGCCATTTTA
GCGTTAATTA ACATGACAAT AGTCCAAGCA AAAGCGGCCA ATGTGAAAGT GTCGCTGTGC
GGCGAACTGG CCAGTTCACC ACAAATTGCA CCGCTGTTAA TCGGCATGGG GCTGGACGAA
CTCAGTGTTA ACTTAAGCTC ACTGTTAGAA GTCAAAGCTG CCATTTGCCA AGGCAACATC
CAACAATTTT CGGCGCTGGC GCACACTGCA TTACAACAAG ATAGAATTGC AGGTCTACAG
CAGTGTATAA CAAGCTATAA ATAG
 
Protein sequence
MSITGIIVSS GIAFGQALHL IHTEHHLDYR PIPLSKIPQQ QGKFAKALQE LQAQLTHSQA 
ALDSDSENYQ LIEADLLLLE DDELIEQVND AIRTLQLSAS VAVERIFAHQ ANELQSLDDP
YLANRAQDVR CLGQRVVAAI NGHLNQGLDK LDKPTILLAQ DLTPAEFALL PRENLCGIVL
KTGGLTSHTA ILARAAGIPA ILSCQFDADS IPNGTPLVLD ALNGELCVNP NPDQQARLTV
TFHHEQARRA ALQTYKDGPA QTQDGHIVGL MANVGNLNDI THVSDVGADG IGLFRTEFML
MNVSTLPDEK AQYSLYCDAL HALGGKTFTI RTLDIGADKE LPCLCQEIED NPALGLRGIR
YTLAHPDLFK TQLRAILRAA NHGPIRLMFP MVNQVEELDE VFALIAQCQD ALEEEEKGYG
ELSYGIVVET PAAVFNLNAM LPRLDFVSIG TNDLTQYAMA ADRTNPQLTR DYPSLSPAIL
ALINMTIVQA KAANVKVSLC GELASSPQIA PLLIGMGLDE LSVNLSSLLE VKAAICQGNI
QQFSALAHTA LQQDRIAGLQ QCITSYK