Gene EcolC_3728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3728 
Symbol 
ID6065654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4081205 
End bp4082356 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content53% 
IMG OID641603145 
Product2-hydroxyglutaryl-CoA dehydratase D-component 
Protein accessionYP_001726665 
Protein GI170021711 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1775] Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTTA TCACCGATCT ACCCGCCATT TTCGATCAAT TCTCTGAAGC TCGCCAGAAA 
GGTTTTCTCG CCGTTATGGA TTTCAAAGAG CGCGGCATTC CGCTGGTTGG CACCTACTGC
ACGTTCATGC CGCAAGAGAT CCCGATGGCT GCCGGTGCGG TTGTGGTTTC ACTCTGCTCC
ACCTCCGATG AAACCATCGA AGAAGCCGAG AAAGACCTGC CGCGCAATCT CTGCCCGCTG
ATCAAAAGCA GCTACGGCTT TGGCAAAACC GATAAATGCC CCTACTTCTA CTTTTCCGAT
CTGGTGGTTG GTGAAACTAC CTGCGACGGC AAAAAGAAAA TGTATGAATA CATGGCGGAG
TTTAAGCCCG TTCATGTGAT GCAATTGCCT AACAGCGTGC AGGATGACGC CTCACGCGCG
TTGTGGAAAG CCGAGATGCT GCGCCTGCAA ACAGCGGTAG AAGAGCGTTT TGGCAAAGAG
ATAACCGAAG AGGCGCTGCG CGATGCCATT GCACTGAAAA ACCGCGAACG CCGTGCGCTG
GCCAATTTTT ATCATCTTGG GCAGTTAAAT CCTCCGGCGT TAAGCGGCAG CGACATTCTG
AAAGTGGTTT ACGGCGCAAC ATTCCGTTTT GACAAAGAAG CGCTGATCGA CGAACTCGAT
GCAATGACCG CCCGCGTTCG CCAGCAGTGG GAAGAGGGTC AGCGGCTGGA CACGCGTCCA
CGCATTCTGA TAACTGGCTG CCCGATTGGC GGTGCGGCAG AGAAAGTGGT GCGTGCCATC
GAAGAGAATG GTGGCTGGGT TGTCGGTTAT GAAAACTGCA CCGGGGCGAA AGCGACAGAG
CAATGCGTGG CGGAAACGGG CGATGTGTAC GACGCACTGG CTGATAAGTA TCTGGCAATC
GGCTGCTCCT GTGTTTCGCC GAACGATCAG CGCCTGAAAA TGCTCAGCCA GATGGTGGAA
GAATATCAGG TCGATGGCGT AGTTGATGTG ATTTTGCAGG CGTGCCATAC CTACGCGGTG
GAATCGCTGG CGATTAAACG TCATGTGCGT CAGCAGCACA ACATTCCTTA TATCGCTATT
GAAACAGACT ACTCCACCTC AGATGTCGGG CAGCTCAGTA CCCGTGTCGC GGCCTTTATT
GAGATGCTGT AA
 
Protein sequence
MSLITDLPAI FDQFSEARQK GFLAVMDFKE RGIPLVGTYC TFMPQEIPMA AGAVVVSLCS 
TSDETIEEAE KDLPRNLCPL IKSSYGFGKT DKCPYFYFSD LVVGETTCDG KKKMYEYMAE
FKPVHVMQLP NSVQDDASRA LWKAEMLRLQ TAVEERFGKE ITEEALRDAI ALKNRERRAL
ANFYHLGQLN PPALSGSDIL KVVYGATFRF DKEALIDELD AMTARVRQQW EEGQRLDTRP
RILITGCPIG GAAEKVVRAI EENGGWVVGY ENCTGAKATE QCVAETGDVY DALADKYLAI
GCSCVSPNDQ RLKMLSQMVE EYQVDGVVDV ILQACHTYAV ESLAIKRHVR QQHNIPYIAI
ETDYSTSDVG QLSTRVAAFI EML