Gene EcSMS35_4882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4882 
Symbol 
ID6145977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4997804 
End bp4998955 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content53% 
IMG OID641619686 
Productputative 2-hydroxyglutaryl-CoA dehydratase, D-component 
Protein accessionYP_001746793 
Protein GI170680225 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1775] Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.30993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.992487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTTG TCACCGATCT ACCCGCTATT TTCGATCAGT TCTCTGAAGC TCGCCAGAAA 
GGCTTTCTCA CCGTCATGGA TCTCAAGGAG CGCGGCATTC CGCTGGTTGG CACCTACTGC
ACCTTTATGC CGCAAGAGAT CCCGATGGCA GCCGGTGCAG TTGTGGTTTC GCTCTGTTCC
ACCTCCGATG AAACCATTGA AGAAGCCGAG AAAGATCTGC CGCGCAATCT CTGCCCGCTG
ATTAAAAGCA GCTACGGCTT CGGCAAAACC GATAAATGCC CCTACTTCTA CTTTTCCGAT
CTGGTGGTGG GTGAAACCAC CTGCGACGGC AAAAAGAAAA TGTATGAATA CATGGCAGAG
TTTAAGCCCG TGCATGTGAT GCAATTGCCC AACAGCGTTA AGGACGATGC CTCGCGTGCA
TTATGGAAAG CTGAGATGCT GCGCTTACAA AAAGCGGTAG AGGAACGTTT TGGACACGAG
ATTAGCGAAG ATGCTCTGCG CGATGCCATT GCGCTGAAAA ACCGCGAACG TCGCGCACTG
GCTAATTTTT ATCATCTTGG GCAGTTAAAT CCTCCGGCGC TTAGCGGCAG CGACATTCTG
AAAGTGGTTT ACGGCGCAAC CTTCCGGTTC GATAAAGAGG CGTTGGTTGA CGAACTGGAT
GCAATGACCG CCCGCGTTCG TCAGCAGTGG GAAGAAGGCC AGCGACTGGA CCCGCGTCCA
CGCATTTTAA TCACCGGCTG CCCAATTGGC GGCGCAGCAG AGAAAGTGGT GCGCGCGATT
GAAGAGAATG GCGGCTGGGT TGTCGGTTAT GAAAACTGCA CCGGGGCGAA AGCCACCGAG
CAATGCGTGG CGGAAACGGG TGATGTGTAC GACGCGCTGG CGGATAAATA TCTGGCGATT
GGCTGCTCCT GTGTTTCGCC GAACGATCAG CGCCTGCAAA TGCTCAGCCA GATGGTTGAA
GAATATCAGG TCGATGGCGT AGTTGATGTG ATTTTGCAGG CGTGCCATAC CTACGCGGTG
GAATCGCTGG CGATTAAACG TCATGTGCGT CAGCAGCACA ACATTCCTTA TATCGCTATT
GAAACAGACT ACTCCACCTC AGATGTCGGG CAGCTCAGCA CCCGTGTCGC GGCCTTTATT
GAGATGCTGT AA
 
Protein sequence
MSLVTDLPAI FDQFSEARQK GFLTVMDLKE RGIPLVGTYC TFMPQEIPMA AGAVVVSLCS 
TSDETIEEAE KDLPRNLCPL IKSSYGFGKT DKCPYFYFSD LVVGETTCDG KKKMYEYMAE
FKPVHVMQLP NSVKDDASRA LWKAEMLRLQ KAVEERFGHE ISEDALRDAI ALKNRERRAL
ANFYHLGQLN PPALSGSDIL KVVYGATFRF DKEALVDELD AMTARVRQQW EEGQRLDPRP
RILITGCPIG GAAEKVVRAI EENGGWVVGY ENCTGAKATE QCVAETGDVY DALADKYLAI
GCSCVSPNDQ RLQMLSQMVE EYQVDGVVDV ILQACHTYAV ESLAIKRHVR QQHNIPYIAI
ETDYSTSDVG QLSTRVAAFI EML