Gene EcHS_A4563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4563 
Symbol 
ID5595311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4569707 
End bp4570858 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content53% 
IMG OID640923659 
Productputative 2-hydroxyglutaryl-CoA dehydratase, D-component 
Protein accessionYP_001461099 
Protein GI157163781 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1775] Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACTTA TCACCGATCT ACCCGCCATT TTCGATCAGT TCTCTGAAGC TCGCCAGAAA 
GGCTTTCTCA CCGTCATGGA TCTCAAGGAG CGCGGCATTC CGCTGGTTGG CACTTACTGC
ACCTTTATGC CGCAAGAGAT CCCGATGGCA GCCGGTGCGG TTGTGGTTTC GCTCTGTTCT
ACCTCTGATG AAACCATTGA AGAAGCGGAG AAAGATCTGC CGCGCAACCT CTGCCCGCTG
ATTAAAAGCA GCTACGGCTT CGGCAAAACC GATAAATGCC CCTACTTCTA CTTTTCGGAT
CTGGTGGTCG GTGAAACCAC CTGCGACGGC AAAAAGAAAA TGTATGAATA CATGGCGGAG
TTTAAGCCCG TTCATGTGAT GCAGTTGCCG AACAGCGTTA AGGACGATGC CTCGCGTGCG
TTATGGAAAG CCGAGATGCT GCGCTTGCAA AAAACGATAG AAGAACGTTT TGGGCACGAG
ATTAGCGAAG ATGCTCTGCG CGATGCCATT GCGCTGAAAA ACCGCGAACG TCGCGCACTG
GCCAATTTTT ATCATCTTGG GCAGTTAAAT CCTCCGGCGC TTAGCGGCAG CGACATTCTG
AAAGTGGTTT ACGGCGCAAC CTTCCGGTTC GATAAAGAGG CGTTGATCAA TGAACTGGAC
GCGATGACAG CCCGCGTTCG TCAGCAGTGG GAAGAAGGCC AGCGGCTGGC CCTGCGTCCA
CGCATTTTAA TCACCGGCTG CCCGATTGGC GGCGCAGCAG AAAAAGTGGT GCGCGCGATT
GAAGAGAATG GCGGCTGGGT TGTCGGTTAT GAAAACTGCA CCGGGGCGAA AGCGACCGAG
CAATGCGTGG CAGAAACGGG CGATGTCTAC GACGCACTGG CGGATAAATA TCTGGCGATT
GGCTGCTCCT GTGTTTCGCC GAACGATCAG CGCCTGCAAA TGCTCAGCCA GATGGTGGAA
GAATATCAGG TCGATGGCGT AGTTGATGTG ATTTTGCAGG CGTGCCATAC CTACGCGGTG
GAATCGCTGG CGATTAAACG TCATGTGCGT CAGCAGCACA ACATTCCTTA TATCGCTATT
GAAACAGACT ACTCCACCTC AGATGTCGGG CAGCTCAGTA CCCGTGTCGC GGCCTTTATT
GAGATGCTGT AA
 
Protein sequence
MSLITDLPAI FDQFSEARQK GFLTVMDLKE RGIPLVGTYC TFMPQEIPMA AGAVVVSLCS 
TSDETIEEAE KDLPRNLCPL IKSSYGFGKT DKCPYFYFSD LVVGETTCDG KKKMYEYMAE
FKPVHVMQLP NSVKDDASRA LWKAEMLRLQ KTIEERFGHE ISEDALRDAI ALKNRERRAL
ANFYHLGQLN PPALSGSDIL KVVYGATFRF DKEALINELD AMTARVRQQW EEGQRLALRP
RILITGCPIG GAAEKVVRAI EENGGWVVGY ENCTGAKATE QCVAETGDVY DALADKYLAI
GCSCVSPNDQ RLQMLSQMVE EYQVDGVVDV ILQACHTYAV ESLAIKRHVR QQHNIPYIAI
ETDYSTSDVG QLSTRVAAFI EML