Gene EcHS_A0400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0400 
SymbolprpE 
ID5595153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp418302 
End bp420188 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content58% 
IMG OID640919585 
Productpropionyl-CoA synthetase 
Protein accessionYP_001457170 
Protein GI157159852 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02316] propionate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTTA GCGAATTTTA TCAGCGTTCG ATTAACGAAC CGGAGCAGTT CTGGGCCGAG 
CAGGCCCGGC GTATTGACTG GCAGACGCCC TTTACGCAAA CGCTCGATCA CAGCAATCCG
CCGTTTGCCC GTTGGTTTTG TGAAGGCCGA ACCAACTTGT GCCACAACGC TATCGACCGC
TGGCTGGAGA AACAACCAGA GGCGCTGGCG CTGATTGCCG TCTCTTCGGA AACAGAAGAA
GAGCGCACCT TTACCTTTCG TCAGCTTCAT GACGAAGTAA ACGCGGTGGC GTCAATGCTG
CGCTCGCTGG GCGTGCAGCG TGGCGATCGG GTGCTGGTGT ATATGCCGAT GATTGCCGAA
GCGCATATTA CTCTGCTGGC CTGCGCGCGC ATTGGCGCTA TTCATTCGGT GGTGTTTGGG
GGATTTGCCT CGCACAGCGT GGCGGCGCGA ATTGATGATG CTAAACCGGT GCTGATTGTC
TCGGCTGATG CCGGAGCGCG CGGTGGCAAA ATCATTCCCT ATAAAAAATT GCTCGACGAT
GCGATAAGTC AGGCGCAGCA CCAGCCACGC CATGTTTTGC TGGTGGATCG CGGGCTGGCG
AAAATGGCAC GCGTCAGCGG GCGGGATGTC GATTTCGTGT CGTTGCGCCA TCAACACATC
GGCGCGCGGG TACCGGTGGC GTGGCTGGAA TCCAACGAAA CCTCCTGCAT TCTCTACACC
TCCGGCACGA CCGGCAAACC TAAAGGCGTG CAGCGTGACG TCGGCAGATA TGCGGTGGCG
CTGGCGACCT CGATGGACAC CATTTTTGGC GGCAAAGCGG GCGGTGTGTT CTTTTGTGCT
TCGGATATCG GCTGGGTGGT GGGGCATTCG TATATCGTCT ACGCGCCGCT GCTGGCGGGG
ATGGCGACTA TCGTTTACGA AGGATTACCG ACCTGGCCGG ACTGCGGCGT GTGGTGGAAA
ATCGTCGAGA AATATCAGGT TAGCCGGATG TTCTCAGCGC CGACCGCCAT TCGCGTGCTG
AAAAAATTCC CTACCGCTGA AATTCGCAAA CACGATCTCT CGTCGCTGGA AGTGCTCTAT
CTGGCTGGAG AACCGCTGGA CGAGCCGACC GCCAGTTGGG TGAGCAATAC GCTGGATGTG
CCGGTCATCG ACAACTACTG GCAGACCGAA TCCGGCTGGC CGATTATGGC GATTGCTCGC
GGTCTGGACG ACAGGCCGAC GCGTCTGGGA AGCCCCGGTG TGCCGATGTA TGGCTATAAC
GTGCAGTTGC TTAATGAAGT CACCGGCGAA CCGTGTGGCG TCAACGAGAA AGGGATGCTG
GTGGTGGAAG GGCCGCTGCC GCCGGGGTGT ATTCAGACCA TCTGGGGCGA CGACGGCCGC
TTTGTGAAGA CTTACTGGTC GCTGTTTTCC CGCCCGGTGT ACGCCACCTT TGACTGGGGC
ATCCGTGACG CTGACGGTTA TCACTTTATT CTCGGGCGCA CTGACGATGT AATTAACGTT
GCCGGGCATC GGCTGGGGAC GCGCGAGATT GAAGAGAGTA TCTCCAGCCA TCCGGGCGTT
GCCGAAGTGG CGGTGGTTGG GGTGAAAGAT GCGCTGAAAG GGCAGGTGGC GGTGGCGTTT
GTCATTCCGA AAGAGAGCGA CAGTCTGGAA GATCGTGATG TGGCGCACTC GCAAGAGAAG
GCGATTATGG CGCTGGTGGA CAGCCAGATT GGCAACTTTG GCCGCCCGGC GCACGTCTGG
TTTGTCTCGC AATTGCCAAA AACGCGATCC GGAAAAATGC TGCGCCGCAC GATCCAGGCG
ATTTGCGAAG GACGCGATCC TGGGGATCTG ACGACCATTG ATGATCCTGC GTCGTTGGAT
CAGATCCGCC AGGCGATGGA AGAGTAA
 
Protein sequence
MSFSEFYQRS INEPEQFWAE QARRIDWQTP FTQTLDHSNP PFARWFCEGR TNLCHNAIDR 
WLEKQPEALA LIAVSSETEE ERTFTFRQLH DEVNAVASML RSLGVQRGDR VLVYMPMIAE
AHITLLACAR IGAIHSVVFG GFASHSVAAR IDDAKPVLIV SADAGARGGK IIPYKKLLDD
AISQAQHQPR HVLLVDRGLA KMARVSGRDV DFVSLRHQHI GARVPVAWLE SNETSCILYT
SGTTGKPKGV QRDVGRYAVA LATSMDTIFG GKAGGVFFCA SDIGWVVGHS YIVYAPLLAG
MATIVYEGLP TWPDCGVWWK IVEKYQVSRM FSAPTAIRVL KKFPTAEIRK HDLSSLEVLY
LAGEPLDEPT ASWVSNTLDV PVIDNYWQTE SGWPIMAIAR GLDDRPTRLG SPGVPMYGYN
VQLLNEVTGE PCGVNEKGML VVEGPLPPGC IQTIWGDDGR FVKTYWSLFS RPVYATFDWG
IRDADGYHFI LGRTDDVINV AGHRLGTREI EESISSHPGV AEVAVVGVKD ALKGQVAVAF
VIPKESDSLE DRDVAHSQEK AIMALVDSQI GNFGRPAHVW FVSQLPKTRS GKMLRRTIQA
ICEGRDPGDL TTIDDPASLD QIRQAMEE