Gene EcolC_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0100 
SymbolgpsA 
ID6068343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp104918 
End bp105937 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content57% 
IMG OID641599504 
ProductNAD(P)H-dependent glycerol-3-phosphate dehydrogenase 
Protein accessionYP_001723113 
Protein GI170018159 
COG category[C] Energy production and conversion 
COG ID[COG0240] Glycerol-3-phosphate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAAC GTAATGCTTC AATGACTGTG ATCGGTGCCG GCTCGTACGG CACCGCTCTT 
GCCATTACCC TGGCAAGAAA TGGCCACGAG GTTGTCCTCT GGGGCCATGA CCCTGAACAT
ATCGCAACGC TTGAACGCGA CCGCTGTAAC GCCGCGTTTC TCCCCGATGT GCCTTTTCCC
GATACGCTCC ATCTTGAAAG CGATCTCGCC ACTGCGCTGG CAGCCAGCCG TAATATTCTC
GTCGTCGTAC CCAGCCATGT CTTTGGTGAA GTGCTGCGCC AGATTAAACC GCTGATGCGT
CCTGATGCGC GTCTGGTGTG GGCGACCAAA GGGCTGGAAG CGGAAACCGG GCGTCTGTTA
CAGGACGTGG CCCGCGAGGC GTTAGGCGAT CAAATTCCGC TGGCGGTTAT CTCTGGCCCA
ACGTTTGCGA AAGAACTGGC GGCAGGTTTA CCGACAGCTA TTTCGCTGGC CTCTACCGAC
CAGACCTTTG CCGATGATCT CCAACAATTG CTGCACTGTG GCAAAAGTTT CCGCGTTTAC
AGCAACCCGG ATTTCATTGG CGTGCAGCTT GGCGGTGCGG TGAAAAACGT CATTGCCATT
GGCGCGGGGA TGTCCGACGG TATCGGTTTT GGTGCGAATG CGCGTACGGC ACTGATCACC
CGTGGGCTGG CTGAAATGTC GCGTCTTGGC GCGGCGCTGG GTGCCGATCC TGCCACCTTT
ATGGGCATGG CGGGGCTGGG CGATCTTGTG CTTACCTGTA CCGACAACCA GTCGCGTAAC
CGCCGTTTTG GCATGATGCT CGGTCAGGGC ATGGATGTAC AAAGCGCGCA GGAGAAGATT
GGTCAGGTGG TGGAAGGCTA CCGCAATACG AAAGAAGTCC GCGAACTGGC GTATCGCTTC
GGCGTAGAAA TGCCAATAAC CGAGGAAATT TATCAAGTAT TATATTGCGG AAAAAACGCG
CGCGAGGCAG CATTGACGTT ATTAGGTCGT GCACGCAAGG ACGAGCGCAG CAGCCACTAA
 
Protein sequence
MNQRNASMTV IGAGSYGTAL AITLARNGHE VVLWGHDPEH IATLERDRCN AAFLPDVPFP 
DTLHLESDLA TALAASRNIL VVVPSHVFGE VLRQIKPLMR PDARLVWATK GLEAETGRLL
QDVAREALGD QIPLAVISGP TFAKELAAGL PTAISLASTD QTFADDLQQL LHCGKSFRVY
SNPDFIGVQL GGAVKNVIAI GAGMSDGIGF GANARTALIT RGLAEMSRLG AALGADPATF
MGMAGLGDLV LTCTDNQSRN RRFGMMLGQG MDVQSAQEKI GQVVEGYRNT KEVRELAYRF
GVEMPITEEI YQVLYCGKNA REAALTLLGR ARKDERSSH