Gene EcSMS35_4861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4861 
SymbolcglE 
ID6146491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4969010 
End bp4970374 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content52% 
IMG OID641619665 
Productdihydrolipoamide dehydrogenase CglE 
Protein accessionYP_001746772 
Protein GI170681610 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.371121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGACA TGATTAATGA AAGTGCACGG CAAACGCCAG TCATTGCACA AACGGACGTT 
CTGGTTATCG GGGGCGGTCC GGCAGGATTA ACGGCGGCGA TAGCGGCGGG GCGGCTAGGT
GCCAGAACCA TGATTGTTGA GCGTTACGGC TCACTAGGCG GCGTATTGAC GCAGGTTGGT
GTGGAAAGTT TTGCCTGGTA TCGCCATCCG GGGACGGAAG ATTGTGAAGG GATTTGTCGT
GAATATGAAG GCCGCGCGCG AGCATTGGGC TTCACGCGAC CAGAACCTCA GTCAATTAGC
GAAGTTATAG ATACTGAAGG ATTTAAAGTT GTTGCCGATC AGATGATAAC GGAAGCTGGC
GTTGAGCCGT TATATCACTC CTGGGTTGTG GATGTGATCA AGGAAGGGGA TACGTTATGC
GGTGTTATTG TCGAGAACAA ATCAGGACGA GGGGCAATTC TGGCGAAAAG AATCGTCGAT
TGCACGGGGG ATGCTGATAT TGCTGCTCGT GCAGGCGCGC CCTGGACGAA ACGGGGAAAG
GACCAACTGA TGGGCGTCAC CGTGATGTTC AGTTGCGCAG GTGTCGATGT GGCGCGCTTT
AACCGTTTTG TTGCGGAAGA ACTTAAGCCG ACCTACGCGG ATTGGGGAAA AAACTGGACG
ATTCAAACCA CGGGCAAAGA AGACCAGATG TTTAGCCCTT ATATGGAGGA TATTTTTACC
CGGGCGCAGC AGGATGGTGT GATTCCAGGC GACGCCCAGG CGATTGCCGG AACCTGGTCA
ACCTTTTCTG AAAGCGGCGA AGCGTTTCAG ATGAATATGG TGTACGCCTT TGGTTTTGAC
TGTACCGATG TCTTCGATTT AACCAAAGCA GAAATCGCCG GAAGGCAGCA AGCATTATGG
GCAATTGACG CTCTACGCCA TTATGTTCCG GGCTTTGAAA ATGTACGGTT ACGCAATTTT
GGCGCCACGC TGGGTACACG CGAATCACGG CTTATTGAGG GGGAAATACG TATTGCTGAT
GATTACGTCC TTAATCAGGG GCGTTGTTCG GACAGTGTAG GGATTTTCCC GGAATTTATT
GATGGTTCCG GTTATCTCAT TTTGCCAACG ACCGGGCGTT TCTTTCAGAT CCCCTATGGC
TGTCTGGTGC CACAAAAAGT GGAGAACCTT TTGGTCGCCG GTCGCTGTAT TTCCGCAGGC
GTAGTTGCAC ATACTTCTAT GCGCAACATG ATGTGTTGTG CCGTTACCGG TGAGGCCGCA
GGCACTGCCG CCGTGGTTTC GCTACAGCAA CATTGCACCG TGCGTCAGGT TGCTATTCCT
GATTTGCAAA ACACGCTGCA ACAGCAGGGC GTACGTCTGG CATAA
 
Protein sequence
MVDMINESAR QTPVIAQTDV LVIGGGPAGL TAAIAAGRLG ARTMIVERYG SLGGVLTQVG 
VESFAWYRHP GTEDCEGICR EYEGRARALG FTRPEPQSIS EVIDTEGFKV VADQMITEAG
VEPLYHSWVV DVIKEGDTLC GVIVENKSGR GAILAKRIVD CTGDADIAAR AGAPWTKRGK
DQLMGVTVMF SCAGVDVARF NRFVAEELKP TYADWGKNWT IQTTGKEDQM FSPYMEDIFT
RAQQDGVIPG DAQAIAGTWS TFSESGEAFQ MNMVYAFGFD CTDVFDLTKA EIAGRQQALW
AIDALRHYVP GFENVRLRNF GATLGTRESR LIEGEIRIAD DYVLNQGRCS DSVGIFPEFI
DGSGYLILPT TGRFFQIPYG CLVPQKVENL LVAGRCISAG VVAHTSMRNM MCCAVTGEAA
GTAAVVSLQQ HCTVRQVAIP DLQNTLQQQG VRLA