Gene EcHS_A4156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4156 
SymbolglpX 
ID5591720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4142578 
End bp4143588 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content57% 
IMG OID640923258 
Productfructose 1,6-bisphosphatase II 
Protein accessionYP_001460717 
Protein GI157163399 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1494] Fructose-1,6-bisphosphatase/sedoheptulose 1,7-bisphosphatase and related proteins 
TIGRFAM ID[TIGR00330] fructose-1,6-bisphosphatase, class II 


Plasmid Coverage information

Num covering plasmid clones63 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACGAG AACTTGCCAT CGAATTTTCC CGCGTCACCG AATCAGCGGC GCTGGCTGGC 
TACAAATGGT TAGGACGCGG CGATAAAAAC ACCGCGGACG GCGCGGCGGT AAACGCCATG
CGTATTATGC TCAACCAGGT CAACATTGAC GGCACCATCG TCATTGGTGA AGGTGAAATC
GACGAAGCAC CGATGCTCTA CATTGGTGAA AAAGTCGGTA CTGGTCGCGG CGACGCGGTA
GATATTGCTG TTGATCCGAT TGAAGGCACG CGCATGACGG CGATGGGCCA GGCTAACGCG
CTGGCGGTGC TGGCAGTAGG CGATAAAGGC TGCTTCCTCA ATGCGCCGGA TATGTATATG
GAGAAGCTGA TTGTCGGGCC GGGAGCCAAA GGCACCATTG ATCTGAACCT GCCGCTGGCG
GATAACCTGC GCAATGTAGC GGCGGCGCTC GGCAAACCGT TGAGCGAACT GACGGTAACG
ATTCTGGCTA AACCACGCCA CGATGCCGTT ATCGCTGAAA TGCAGCAACT CGGCGTACGC
GTATTTGCTA TTCCGGACGG CGACGTTGCG GCCTCAATTC TCACCTGTAT GCCAGACAGC
GAAGTTGACG TGCTGTACGG TATTGGTGGC GCGCCGGAAG GCGTAGTTTC TGCGGCGGTG
ATCCGCGCAT TAGATGGCGA CATGAACGGT CGTCTGCTGG CGCGTCATGA CGTCAAAGGC
GACAACGAAG AGAATCGTCG CATTGGCGAG CAGGAGCTGG CACGCTGCAA AGCGATGGGC
ATCGAAGCCG GTAAAGTATT GCGCCTGGGC GATATGGCGC GCAGCGATAA CGTCATCTTC
TCTGCCACCG GTATTACCAA AGGCGATCTG CTGGAAGGCA TTAGCCGCAA AGGCAATATC
GCGACTACCG AAACGCTGCT GATCCGCGGC AAGTCACGCA CCATTCGCCG CATTCAGTCC
ATCCACTATC TGGATCGCAA AGACCCGGAA ATGCAGGTGC ACATCCTCTG A
 
Protein sequence
MRRELAIEFS RVTESAALAG YKWLGRGDKN TADGAAVNAM RIMLNQVNID GTIVIGEGEI 
DEAPMLYIGE KVGTGRGDAV DIAVDPIEGT RMTAMGQANA LAVLAVGDKG CFLNAPDMYM
EKLIVGPGAK GTIDLNLPLA DNLRNVAAAL GKPLSELTVT ILAKPRHDAV IAEMQQLGVR
VFAIPDGDVA ASILTCMPDS EVDVLYGIGG APEGVVSAAV IRALDGDMNG RLLARHDVKG
DNEENRRIGE QELARCKAMG IEAGKVLRLG DMARSDNVIF SATGITKGDL LEGISRKGNI
ATTETLLIRG KSRTIRRIQS IHYLDRKDPE MQVHIL