Gene EcHS_A2384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2384 
SymbolglpC 
ID5591521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2396503 
End bp2397693 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content54% 
IMG OID640921511 
Productsn-glycerol-3-phosphate dehydrogenase subunit C 
Protein accessionYP_001459045 
Protein GI157161727 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID[TIGR03379] glycerol-3-phosphate dehydrogenase, anaerobic, C subunit 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA CCAGCTTCGA AAACTGCATT AAGTGCACCG TCTGCACCAC CGCCTGCCCG 
GTGAGCCGGG TGAATCCCGG TTATCCAGGG CCAAAACAAG CCGGGCCGGA TGGCGAGCGT
CTGCGTTTGA AAGATGGCGC ACTGTATGAC GAGGCGCTGA AATATTGCAT CAACTGCAAA
CGTTGTGAAG TCGCCTGCCC GTCCGATGTG AAGATTGGCG ATATTATCCA GCGCGCGCGG
GCAAAATATG ACACCACGCG CCCGTCGCTG CGTAATTTTG TGTTGAGTCA TACCGACCTG
ATGGGTAGCG TTTCCACGCC GTTCGCACCA ATCGTCAACA CCGCTACCTC GCTGAAACCG
GTGCGGCAGC TGCTTGATGC GGCGTTAAAA ATCGATCATC GTCGCACGCT ACCGAAATAC
TCCTTCGGCA CGTTCCGTCG CTGGTATCGC AGCATAGCGG CTCAGCAAGC ACAATATAAA
GACCAGGTCG CTTTCTTTCA CGGATGCTTC GTTAACTACA ACCATCCGCA GTTAGGTAAA
GATTTAATTA AAGTGCTCAA CGCAATGGGT ACCGGTGTAC AACTGCTCAG CAAAGAAAAA
TGCTGCGGCG TACCGCTAAT CGCCAACGGC TTTACCGATA AAGCACGCAA ACAGGCAATT
ACGAATGTAG AGTCGATCCG CGAAGCTGTG GGAGTAAAAG GCATTCCGGT GATTGCCACC
TCCTCAACCT GTACATTTGC CCTGCGCGAC GAATACCCGG AAGTGCTGAA TGTCGACAAC
AAAGGCTTGC GCGATCATAT CGAACTGGCA ACCCGCTGGC TGTGGCGCAA GCTGGACGAA
GGCAAAACGT TACCGCTGAA ACCGCTGCCG CTGAAAGTGG TTTATCACAC TCCGTGCCAT
ATGGAAAAAA TGGGCTGGAC GCTCTACACC CTGGAGCTGT TGCGTAAAAT CCCGGGGCTT
GAGTTAACGG TGCTGGATTC CCAGTGCTGC GGTATTGCGG GTACTTACGG TTTCAAAAAA
GAGAACTACC CCACCTCACA AGCCATCGGC GCACCACTGT TCCGCCAGAT AGAAGAAAGC
GGCGCAGATC TGGTGGTCAC CGACTGCGAA ACCTGTAAAT GGCAGATTGA GATGTCCACA
AGTCTTCGCT GCGAACATCC GATTACGCTA CTGGCCCAGG CGCTGGCTTA A
 
Protein sequence
MNDTSFENCI KCTVCTTACP VSRVNPGYPG PKQAGPDGER LRLKDGALYD EALKYCINCK 
RCEVACPSDV KIGDIIQRAR AKYDTTRPSL RNFVLSHTDL MGSVSTPFAP IVNTATSLKP
VRQLLDAALK IDHRRTLPKY SFGTFRRWYR SIAAQQAQYK DQVAFFHGCF VNYNHPQLGK
DLIKVLNAMG TGVQLLSKEK CCGVPLIANG FTDKARKQAI TNVESIREAV GVKGIPVIAT
SSTCTFALRD EYPEVLNVDN KGLRDHIELA TRWLWRKLDE GKTLPLKPLP LKVVYHTPCH
MEKMGWTLYT LELLRKIPGL ELTVLDSQCC GIAGTYGFKK ENYPTSQAIG APLFRQIEES
GADLVVTDCE TCKWQIEMST SLRCEHPITL LAQALA