Gene EcSMS35_4365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4365 
SymbolglpX 
ID6147330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4451997 
End bp4453007 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content56% 
IMG OID641619186 
Productfructose 1,6-bisphosphatase II 
Protein accessionYP_001746310 
Protein GI170681842 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1494] Fructose-1,6-bisphosphatase/sedoheptulose 1,7-bisphosphatase and related proteins 
TIGRFAM ID[TIGR00330] fructose-1,6-bisphosphatase, class II 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.732468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAG AACTTGCCAT CGAATTTTCC CGCGTCACCG AATCTGCGGC GCTGGCTGGC 
TACAAATGGC TAGGACGCGG CGATAAAAAC ACCGCGGACG GCGCAGCGGT AAACGCCATG
CGTATTATGC TCAACCAGGT CAACATTGAC GGCACCATCG TTATTGGTGA AGGTGAAATC
GACGAAGCAC CGATGCTCTA CATTGGTGAA AAAGTCGGTA CTGGTCGCGG CGACGCGGTA
GATATTGCTG TTGATCCGAT TGAAGGCACG CGCATGACGG CGATGGGCCA GGCTAACGCG
CTGGCGGTGC TGGCTGTTGG CGATAAAGGC TGCTTCCTCA ATGCGCCAGA TATGTATATG
GAGAAGCTGA TCGTCGGACC GGGAGCCAAA GGCACCATTG ATCTGAACCT GCCGCTGGCG
GATAACCTGC GCAATGTTGC GGCGGCGCTC GGTAAACCGT TGAGCGAACT GACGGTAACA
ATTCTGGCTA AACCACGCCA CGATGCCGTT ATCGCTGAAA TGCAGCAACT CGGCGTACGC
GTATTTGCTA TTCCGGACGG CGACGTTGCG GCCTCAATTC TCACCTGTAT GCCAGACAGC
GAAGTTGACG TGCTGTACGG TATTGGTGGC GCGCCGGAAG GCGTGGTTTC TGCAGCGGTG
ATCCGCGCAT TAGATGGTGA CATGAACGGT CGTCTGCTGG CGCGTCATGA CGTCAAAGGC
GACAACGAAG AGAATCGTCG CATTGGCGAG CAGGAGCTGG CACGCTGCAA AGCAATGGGC
ATCGAAGCCG GTAAAGTATT GCGTCTGGGC GATATGGCGC GCAGCGATAA CGTCATCTTC
TCTGCCACCG GTATTACCAA AGGCGATCTG CTGGAAGGCA TTAGCCGCAA AGGCAATATC
GCGACTACCG AAACGCTGCT GATCCGCGGC AAGTCACGCA CTATTCGCCG CATTCAGTCC
ATCCACTATC TGGATCGCAA AGACCCGGAA ATGCAGGTGC ACATTCTCTG A
 
Protein sequence
MRRELAIEFS RVTESAALAG YKWLGRGDKN TADGAAVNAM RIMLNQVNID GTIVIGEGEI 
DEAPMLYIGE KVGTGRGDAV DIAVDPIEGT RMTAMGQANA LAVLAVGDKG CFLNAPDMYM
EKLIVGPGAK GTIDLNLPLA DNLRNVAAAL GKPLSELTVT ILAKPRHDAV IAEMQQLGVR
VFAIPDGDVA ASILTCMPDS EVDVLYGIGG APEGVVSAAV IRALDGDMNG RLLARHDVKG
DNEENRRIGE QELARCKAMG IEAGKVLRLG DMARSDNVIF SATGITKGDL LEGISRKGNI
ATTETLLIRG KSRTIRRIQS IHYLDRKDPE MQVHIL