Gene EcSMS35_0736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0736 
SymbolsdhA 
ID6143566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp740732 
End bp742498 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content56% 
IMG OID641615625 
Productsuccinate dehydrogenase flavoprotein subunit 
Protein accessionYP_001742824 
Protein GI170681511 
COG category[C] Energy production and conversion 
COG ID[COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit 
TIGRFAM ID[TIGR01812] succinate dehydrogenase or fumarate reductase, flavoprotein subunitGram-negative/mitochondrial subgroup
[TIGR01816] succinate dehydrogenase, flavoprotein subunit, E. coli/mitochondrial subgroup 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGC CAGTCAGAGA ATTTGATGCA GTTGTGATTG GTGCCGGTGG CGCAGGTATG 
CGCGCGGCGC TGCAAATTTC CCAGAGCGGT CAGACCTGTG CGCTGCTCTC TAAAGTCTTC
CCGACCCGTT CCCATACCGT TTCTGCGCAA GGCGGCATTA CCGTTGCGCT GGGTAATACC
CATGAAGATA ACTGGGAATG GCATATGTAC GATACCGTAA AAGGGTCGGA CTATATCGGT
GACCAGGACG CGATTGAATA TATGTGTAAA ACCGGGCCGG AAGCGATTCT GGAACTCGAA
CACATGGGCC TGCCGTTTTC GCGTCTTGAT GATGGTCGTA TCTATCAACG TCCGTTTGGC
GGTCAGTCGA AAAACTTCGG CGGCGAGCAG GCGGCACGTA CTGCGGCGGC TGCTGACCGT
ACCGGTCACG CACTGTTGCA CACCCTTTAT CAGCAAAACC TGAAAAACCA CACCACCATT
TTCTCCGAGT GGTATGCGCT GGATCTGGTG AAAAACCAGG ATGGCGCGGT GGTCGGTTGT
ACCGCACTGT GCATCGAAAC TGGTGAAGTG GTTTACTTCA AAGCTCGCGC TACCGTGCTG
GCGACTGGCG GGGCAGGGCG TATTTATCAG TCCACCACCA ACGCCCACAT TAACACTGGC
GACGGTGTCG GCATGGCTAT CCGTGCAGGC GTACCGGTAC AGGATATGGA AATGTGGCAG
TTCCACCCGA CCGGTATTGC CGGTGCGGGC GTACTGGTCA CCGAAGGTTG CCGTGGTGAA
GGCGGTTATC TGCTGAACAA ACATGGCGAA CGCTTTATGG AGCGTTATGC GCCGAACGCC
AAAGACCTGG CGGGCCGTGA TGTGGTGGCG CGTTCCATCA TGATCGAAAT CCGTGAAGGT
CGCGGCTGTG ATGGTCCGTG GGGGCCACAC GCGAAACTGA AACTCGATCA CCTGGGTAAA
GAAGTTCTCG AATCCCGTCT GCCGGGCATC CTGGAGCTTT CCCGTACCTT CGCTCACGTT
GATCCGGTGA AAGAGCCGAT TCCGGTTATC CCAACCTGTC ACTACATGAT GGGCGGTATT
CCGACCAAAG TGACCGGTCA GGCGCTGACG GTGAACGAGA AAGGCGAAGA TGTGGTTGTT
CCGGGGCTGT TTGCCGTTGG TGAAATCGCT TGTGTATCGG TACACGGCGC TAACCGTCTG
GGCGGCAACT CGCTGCTGGA CCTGGTGGTC TTTGGTCGCG CAGCGGGTCT GCATCTGCAA
GAGTCTATCG CCGAGCAGGG CGCACTGCGC GATGCCAGCG AGTCTGATGT AGAAGCGTCT
CTGGATCGCC TGAACCGCTG GAACAACAAC CGTAACGGTG AAGATCCGGT GGCGATCCGT
AAAGCACTGC AAGAATGTAT GCAGCATAAC TTCTCGGTCT TCCGTGAAGG TGATGCGATG
GCGAAAGGTC TTGAGCAGTT GAAAGTTATC CGCGAGCGTC TGAAAAATGC CCGTCTGGAT
GACACTTCCA GCGAGTTCAA CACCCAGCGC GTTGAGTGCC TGGAACTGGA TAACCTGATG
GAAACGGCGT ATGCAACGGC TGTTTCTGCC AACTTCCGTA CCGAAAGCCG TGGCGCGCAT
AGCCGCTTCG ACTTCCCGGA TCGTGATGAT GAAAACTGGC TGTGCCACTC CCTGTATCTG
CCTGAGTCGG AATCCATGAC GCGCCGAAGC GTCAACATGG AACCGAAACT GCGCCCGGCA
TTCCCGCCGA AGATTCGTAC TTACTAA
 
Protein sequence
MKLPVREFDA VVIGAGGAGM RAALQISQSG QTCALLSKVF PTRSHTVSAQ GGITVALGNT 
HEDNWEWHMY DTVKGSDYIG DQDAIEYMCK TGPEAILELE HMGLPFSRLD DGRIYQRPFG
GQSKNFGGEQ AARTAAAADR TGHALLHTLY QQNLKNHTTI FSEWYALDLV KNQDGAVVGC
TALCIETGEV VYFKARATVL ATGGAGRIYQ STTNAHINTG DGVGMAIRAG VPVQDMEMWQ
FHPTGIAGAG VLVTEGCRGE GGYLLNKHGE RFMERYAPNA KDLAGRDVVA RSIMIEIREG
RGCDGPWGPH AKLKLDHLGK EVLESRLPGI LELSRTFAHV DPVKEPIPVI PTCHYMMGGI
PTKVTGQALT VNEKGEDVVV PGLFAVGEIA CVSVHGANRL GGNSLLDLVV FGRAAGLHLQ
ESIAEQGALR DASESDVEAS LDRLNRWNNN RNGEDPVAIR KALQECMQHN FSVFREGDAM
AKGLEQLKVI RERLKNARLD DTSSEFNTQR VECLELDNLM ETAYATAVSA NFRTESRGAH
SRFDFPDRDD ENWLCHSLYL PESESMTRRS VNMEPKLRPA FPPKIRTY