Gene EcSMS35_3710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3710 
SymbolglgP 
ID6145870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3774520 
End bp3776967 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content52% 
IMG OID641618536 
Productglycogen phosphorylase 
Protein accessionYP_001745676 
Protein GI170682143 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR02093] glycogen/starch/alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.30951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCTC CGTTTACATA TTCATCGCCC ACGCTTAGCG TAGAAGCTCT TAAGCATTCT 
ATCGCTTACA AGCTGATGTT TACGATTGGC AAGGACCCGG TCGTCGCCAA TAAACATGAA
TGGCTGAACG CAACATTATT TGCTGTGCGC GATCGTCTCG TGGAGCGCTG GTTACGTTCA
AACCGTGCCC AGTTGTCGCA AGAAACTCGT CAGGTTTACT ACCTGTCGAT GGAGTTTTTG
ATTGGCCGTA CGCTCTCCAA CGCCATGTTG TCGCTAGGAA TTTACGAAGA TGTACAGGGC
GCACTGGAAG CGATGGGGTT AAATCTCGAA GAGCTGATTG ATGAAGAAAA TGACCCGGGC
CTCGGTAACG GTGGCCTGGG ACGTCTGGCG GCTTGCTTCC TCGATTCTCT GGCGACGTTA
GGGTTGCCGG GGCGCGGTTA CGGCATCCGC TATGACTACG GTATGTTCAA GCAGAACATC
GTTAACGGTA GCCAGAAAGA GTCGCCTGAC TACTGGCTGG AATACGGTAA CCCGTGGGAA
TTCAAACGCC ACAACACGCG CTATAAAGTC CGTTTTGGCG GTCGCATTCA GCAGGAAGGT
AAAAAAACGC GCTGGATTGA AACCGAAGAG ATTCTGGGAG TAGCTTACGA TCAGATAATC
CCAGGTTACG ACACCGACGC GACCAACACG TTGCGTTTGT GGAGTGCCCA AGCCAGTAGC
GAAATTAACC TCGGTAAATT CAACCAGGGC GACTACTTCG CGGCAGTGGA AGATAAAAAC
CACTCCGAGA ACGTATCTCG CGTACTGTAT CCGGACGACT CCACCTACTC CGGGCGTGAG
CTGCGACTGC GTCAGGAATA CTTCCTGGTT TCCTCGACCA TTCAGGACAT TTTAAGCCGC
CATTATCAGT TGCATAAAAC CTACGATAAC CTGGCGGATA AAATTGCGAT TCACCTCAAC
GATACCCATC CGGTGCTGTC GATTCCTGAG ATGATGCGTC TGCTGATCGA TGAGCACCAA
TTTAGCTGGG ACGACGCGTT TGAAGTGTGT TGTCAGGTCT TCTCCTACAC CAACCACACG
CTAATGAGCG AGGCGCTGGA AACCTGGCCA GTCGATATGC TGGGTAAAAT TCTGCCGCGT
CACCTGCAGA TCATCTTTGA AATCAACGAC TATTTCCTGA AAACTTTGCA GGAACAGTAT
CCGAACGATA CCGATCTGCT GGGACGGGCG TCGATCATCG ATGAATCCAA CGGTCGTCGT
GTGCGTATGG CCTGGCTGGC GGTTGTGGTG AGCCACAAAG TTAACGGTGT ATCGGAGCTG
CACTCTAACC TGATGGTGCA ATCGCTATTT GCCGACTTTG CGAAAATCTT CCCGGGTCGT
TTCACCAACG TCACCAACGG TGTGACGCCG CGTCGCTGGC TGGCGGTAGC GAACCCATCG
CTTTCAGCTG TGTTGGACGA ACACCTGGGC CGCAACTGGC GCACCGACCT TAGTCTGCTT
AATGAGCTGC AACAACACTG TGATTTCCCA ATGGTTAATC ACGCGGTGCA TCAGGCGAAG
CTGGAGAACA AAAAGCGTCT GGCGGAGTAT ATCGCCCAGC AGCTGAATGT GGTGGTAAAT
CCGAAAGCGC TGTTCGATGT GCAAATCAAA CGTATTCACG AATACAAACG TCAATTGATG
AATGTGTTGC ACGTGATCAC CCGCTATAAC CGCATCAAGG CCGACCCGGA TGCGAAGTGG
GTACCGCGCG TGAATATTTT TGGCGGTAAG GCGGCTTCGG CCTATTACAT GGCGAAGCAC
ATTATTCATT TGATCAATGA CGTAGCGAAA GTGATCAACA ACGATCCGCA GATTGGCGAC
AAGCTGAAAG TCGTGTTCAT CCCGAACTAC AGTGTTAGCC TGGCGCAGTT GATCATTCCG
GCGGCCGATC TATCTGAACA GATTTCGCTG GCGGGGACGG AAGCTTCCGG CACCAGTAAC
ATGAAGTTTG CGCTTAACGG CGCGCTGACT ATCGGTACGC TGGACGGTGC GAATGTCGAG
ATGCTGGATC ATGTCGGTGC TGACAATATC TTTATCTTTG GTAACACAGC GGAAGAAGTG
GAAGAACTGC GTCGTCAGGG CTACAAACCG CGTGAATACT ACGAGAAAGA TGAGGAGCTG
CATCAGGTGC TGACGCAAAT CGGCAGCGGT GTATTCAGTC CGGAAGATCC GGGTCGCTAT
CGCGATCTGG TCGATTCGCT GATCAACTTC GGCGATCACT ACCAGGTACT GGCGGATTAT
CGCAGCTATG TCGATTGTCA GGATAAAGTC GACGAACTCT ACGAGCGTCA GGAAGAGTGG
ACCGCAAAAG CGATGCTGAA CATTGCCAAT ATGGGCTACT TCTCTTCTGA CCGTACGATC
AAAGAGTACG CCGATCATAT CTGGCATATT GATCCGGTGA GATTGTAA
 
Protein sequence
MNAPFTYSSP TLSVEALKHS IAYKLMFTIG KDPVVANKHE WLNATLFAVR DRLVERWLRS 
NRAQLSQETR QVYYLSMEFL IGRTLSNAML SLGIYEDVQG ALEAMGLNLE ELIDEENDPG
LGNGGLGRLA ACFLDSLATL GLPGRGYGIR YDYGMFKQNI VNGSQKESPD YWLEYGNPWE
FKRHNTRYKV RFGGRIQQEG KKTRWIETEE ILGVAYDQII PGYDTDATNT LRLWSAQASS
EINLGKFNQG DYFAAVEDKN HSENVSRVLY PDDSTYSGRE LRLRQEYFLV SSTIQDILSR
HYQLHKTYDN LADKIAIHLN DTHPVLSIPE MMRLLIDEHQ FSWDDAFEVC CQVFSYTNHT
LMSEALETWP VDMLGKILPR HLQIIFEIND YFLKTLQEQY PNDTDLLGRA SIIDESNGRR
VRMAWLAVVV SHKVNGVSEL HSNLMVQSLF ADFAKIFPGR FTNVTNGVTP RRWLAVANPS
LSAVLDEHLG RNWRTDLSLL NELQQHCDFP MVNHAVHQAK LENKKRLAEY IAQQLNVVVN
PKALFDVQIK RIHEYKRQLM NVLHVITRYN RIKADPDAKW VPRVNIFGGK AASAYYMAKH
IIHLINDVAK VINNDPQIGD KLKVVFIPNY SVSLAQLIIP AADLSEQISL AGTEASGTSN
MKFALNGALT IGTLDGANVE MLDHVGADNI FIFGNTAEEV EELRRQGYKP REYYEKDEEL
HQVLTQIGSG VFSPEDPGRY RDLVDSLINF GDHYQVLADY RSYVDCQDKV DELYERQEEW
TAKAMLNIAN MGYFSSDRTI KEYADHIWHI DPVRL