Gene EcSMS35_2395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2395 
SymbolglpC 
ID6145649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2444841 
End bp2446031 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content54% 
IMG OID641617268 
Productsn-glycerol-3-phosphate dehydrogenase subunit C 
Protein accessionYP_001744440 
Protein GI170683880 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID[TIGR03379] glycerol-3-phosphate dehydrogenase, anaerobic, C subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA CCAGCTTCGA AAACTGCATT AAGTGCACCG TCTGCACCAC CGCCTGCCCG 
GTGAGCCGGG TGAATCCGGG ATATCCGGGG CCAAAACAAG CCGGGCCGGA TGGCGAGCGT
CTGCGTTTGA AAGATGGCGC ACTGTATGAC GAGGCACTGA AATATTGCAT CAACTGCAAA
CGTTGCGAAG TCGCCTGCCC GTCCGATGTG AAGATTGGCG ATATCATCCA GCGCGCGCGG
GCGAAATATG ACACCACACG CCCGTCGCTG CGCAATTTTG TGTTGAGTCA TACCGACCTG
ATGGGTAGCG TTTCCACGCC ATTCGCACCA ATCGTCAATA CCGCTACCTC GCTGAAACCA
GTGCGGCAGC TGCTTGATGC GGCGTTAAAA ATCGATCATC GCCGCACGCT ACCGAAATAC
TCCTTCGGTA CGTTCCGTCG CTGGTATCGC AGCGTGGCGG CTCAGCAAGC GCAGTATAAA
GACCAGGTCG CTTTCTTTCA CGGCTGCTTC GTTAACTACA ACCATCCGCA GTTAGGTAAA
GATTTAATTA AAGTGCTCAA CGCAATGGGT ACCGGTGTAC AACTGCTCAG CAAAGAAAAA
TGCTGCGGCG TACCGCTGAT CGCCAACGGC TTTACCGATA AAGCACGTAA ACAGGCAATT
ACGAATGTAG AGTCGATCCG CGAAGCTGTG GGAGTAAAAG GCATTCCGGT GATTGCCACC
TCCTCAACTT GTACATTTGC CCTGCGCGAC GAATACCCAG AAGTGCTGAA TGTAGACAAC
AAAGGCTTGC GCGATCATAT CGAACTGGCG ACCCGCTGGC TGTGGCGCAA GCTGGACGAA
GGCAAAACGT TACCGCTGAA AACGCTGCCG CTGAAAGTGG TTTATCACAC TCCGTGCCAT
ATGGAAAAAA TGGGCTGGAC GCTCTACACC CTGGAGCTGT TGCGTAAAAT CCCGGGGCTT
GAGTTAACGG TGCTGGATTC CCAGTGCTGC GGTATTGCGG GAACTTACGG TTTCAAAAAA
GAGAACTACC CCACTTCACA AGCCATCGGC GCACCACTGT TCCGCCAGAT AGAAGAGAGC
GGCGCAGATC TGGTGGTCAC CGACTGCGAA ACCTGTAAAT GGCAGATTGA GATGTCCACA
AGTCTTCGCT GCGAACATCC GATTACGCTG CTGGCCCAGG CACTGGCTTA A
 
Protein sequence
MNDTSFENCI KCTVCTTACP VSRVNPGYPG PKQAGPDGER LRLKDGALYD EALKYCINCK 
RCEVACPSDV KIGDIIQRAR AKYDTTRPSL RNFVLSHTDL MGSVSTPFAP IVNTATSLKP
VRQLLDAALK IDHRRTLPKY SFGTFRRWYR SVAAQQAQYK DQVAFFHGCF VNYNHPQLGK
DLIKVLNAMG TGVQLLSKEK CCGVPLIANG FTDKARKQAI TNVESIREAV GVKGIPVIAT
SSTCTFALRD EYPEVLNVDN KGLRDHIELA TRWLWRKLDE GKTLPLKTLP LKVVYHTPCH
MEKMGWTLYT LELLRKIPGL ELTVLDSQCC GIAGTYGFKK ENYPTSQAIG APLFRQIEES
GADLVVTDCE TCKWQIEMST SLRCEHPITL LAQALA