Gene EcolC_3856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3856 
Symbol 
ID6067558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4214758 
End bp4216566 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content56% 
IMG OID641603271 
Productfumarate reductase flavoprotein subunit 
Protein accessionYP_001726787 
Protein GI170021833 
COG category[C] Energy production and conversion 
COG ID[COG1053] Succinate dehydrogenase/fumarate reductase, flavoprotein subunit 
TIGRFAM ID[TIGR01176] fumarate reductase, flavoprotein subunit
[TIGR01812] succinate dehydrogenase or fumarate reductase, flavoprotein subunitGram-negative/mitochondrial subgroup 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00198988 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCAAACCT TTCAAGCCGA TCTTGCCATT GTAGGCGCCG GTGGCGCGGG ATTACGTGCT 
GCAATTGCTG CCGCGCAGGC AAATCCAAAT GCAAAAATCG CACTAATCTC AAAAGTATAC
CCGATGCGTA GCCATACCGT TGCTGCAGAA GGGGGCTCCG CCGCTGTCGC GCAGGATCAT
GACAGCTTCG AATATCACTT TCACGATACA GTAGCGGGTG GCGACTGGTT GTGTGAGCAG
GATGTCGTGG ATTATTTCGT CCACCACTGC CCAACCGAAA TGACCCAACT GGAACTGTGG
GGGTGCCCAT GGAGCCGTCG CCCGGATGGT AGCGTCAACG TACGTCGCTT CGGCGGCATG
AAAATCGAGC GTACCTGGTT CGCCGCCGAT AAGACCGGCT TCCATATGCT GCACACGCTG
TTCCAGACCT CTCTGCAATT CCCGCAGATC CAGCGTTTTG ACGAACATTT CGTGCTGGAT
ATTCTGGTTG ATGATGGTCA TGTTCGCGGC CTGGTAGCAA TGAACATGAT GGAAGGCACG
CTGGTGCAGA TCCGTGCTAA CGCGGTCGTT ATGGCTACCG GCGGTGCGGG TCGCGTTTAT
CGTTACAACA CCAACGGCGG CATCGTTACC GGTGACGGTA TGGGTATGGC GCTAAGCCAC
GGCGTTCCGC TGCGTGACAT GGAATTCGTT CAGTATCACC CAACCGGTCT GCCAGGTTCC
GGTATCCTGA TGACCGAAGG CTGCCGTGGT GAAGGCGGTA TTCTGGTCAA CAAAAATGGC
TACCGTTATC TGCAAGATTA CGGCATGGGC CCGGAAACTC CGCTGGGCGA GCCGAAAAAC
AAATATATGG AACTGGGTCC ACGCGACAAA GTTTCTCAGG CCTTCTGGCA CGAATGGCGT
AAAGGCAACA CCATCTCCAC GCCGCGTGGT GATGTGGTTT ATCTCGACCT GCGTCACCTC
GGCGAGAAAA AACTGCATGA ACGTCTGCCG TTCATCTGCG AACTGGCGAA AGCGTACGTT
GGCGTCGATC CGGTTAAAGA ACCGATTCCG GTACGTCCGA CCGCACACTA CACCATGGGC
GGTATCGAAA CCGATCAGAA CTGTGAAACC CGCATTAAAG GTCTGTTCGC CGTGGGTGAA
TGTTCCTCTG TTGGTCTGCA CGGTGCAAAC CGTCTGGGCT CCAACTCCCT GGCGGAACTG
GTGGTCTTCG GCCGTCTGGC CGGTGAACAA GCGACAGAGC GTGCAGCAAC TGCCGGTAAT
GGCAACGAAG CGGCAATTGA AGCGCAGGCA GCTGGCGTTG AACAACGTCT GAAAGATCTG
GTTAACCAGG ATGGCGGCGA AAACTGGGCG AAGATCCGCG ACGAAATGGG CCTGGCAATG
GAAGAAGGCT GCGGTATCTA CCGTACGCCG GAACTGATGC AGAAAACCAT CGACAAGCTG
GCTGAGCTGC AGGAACGCTT CAAGCGCGTG CGCATCACCG ACACTTCCAG CGTGTTCAAC
ACCGACCTGC TCTACACCAT TGAACTGGGC CACGGTCTGA ACGTTGCTGA ATGTATGGCG
CACTCCGCAA TGGCACGTAA AGAGTCCCGC GGCGCACACC AGCGTCTGGA CGAAGGTTGC
ACCGAGCGTG ACGACGTCAA CTTCCTCAAA CACACCCTCG CCTTCCGCGA TGCTGATGGC
ACGACTCGCC TGGAGTACAG CGACGTGAAG ATTACTACGC TGCCGCCAGC TAAACGCGTT
TACGGTGGCG AAGCGGATGC AGCCGATAAG GCGGAAGCAG CCAATAAGAA GGAGAAGGCG
AATGGCTGA
 
Protein sequence
MQTFQADLAI VGAGGAGLRA AIAAAQANPN AKIALISKVY PMRSHTVAAE GGSAAVAQDH 
DSFEYHFHDT VAGGDWLCEQ DVVDYFVHHC PTEMTQLELW GCPWSRRPDG SVNVRRFGGM
KIERTWFAAD KTGFHMLHTL FQTSLQFPQI QRFDEHFVLD ILVDDGHVRG LVAMNMMEGT
LVQIRANAVV MATGGAGRVY RYNTNGGIVT GDGMGMALSH GVPLRDMEFV QYHPTGLPGS
GILMTEGCRG EGGILVNKNG YRYLQDYGMG PETPLGEPKN KYMELGPRDK VSQAFWHEWR
KGNTISTPRG DVVYLDLRHL GEKKLHERLP FICELAKAYV GVDPVKEPIP VRPTAHYTMG
GIETDQNCET RIKGLFAVGE CSSVGLHGAN RLGSNSLAEL VVFGRLAGEQ ATERAATAGN
GNEAAIEAQA AGVEQRLKDL VNQDGGENWA KIRDEMGLAM EEGCGIYRTP ELMQKTIDKL
AELQERFKRV RITDTSSVFN TDLLYTIELG HGLNVAECMA HSAMARKESR GAHQRLDEGC
TERDDVNFLK HTLAFRDADG TTRLEYSDVK ITTLPPAKRV YGGEADAADK AEAANKKEKA
NG