Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0400 |
Symbol | prpE |
ID | 5595153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 418302 |
End bp | 420188 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640919585 |
Product | propionyl-CoA synthetase |
Protein accession | YP_001457170 |
Protein GI | 157159852 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | [TIGR02316] propionate--CoA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 66 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTTA GCGAATTTTA TCAGCGTTCG ATTAACGAAC CGGAGCAGTT CTGGGCCGAG CAGGCCCGGC GTATTGACTG GCAGACGCCC TTTACGCAAA CGCTCGATCA CAGCAATCCG CCGTTTGCCC GTTGGTTTTG TGAAGGCCGA ACCAACTTGT GCCACAACGC TATCGACCGC TGGCTGGAGA AACAACCAGA GGCGCTGGCG CTGATTGCCG TCTCTTCGGA AACAGAAGAA GAGCGCACCT TTACCTTTCG TCAGCTTCAT GACGAAGTAA ACGCGGTGGC GTCAATGCTG CGCTCGCTGG GCGTGCAGCG TGGCGATCGG GTGCTGGTGT ATATGCCGAT GATTGCCGAA GCGCATATTA CTCTGCTGGC CTGCGCGCGC ATTGGCGCTA TTCATTCGGT GGTGTTTGGG GGATTTGCCT CGCACAGCGT GGCGGCGCGA ATTGATGATG CTAAACCGGT GCTGATTGTC TCGGCTGATG CCGGAGCGCG CGGTGGCAAA ATCATTCCCT ATAAAAAATT GCTCGACGAT GCGATAAGTC AGGCGCAGCA CCAGCCACGC CATGTTTTGC TGGTGGATCG CGGGCTGGCG AAAATGGCAC GCGTCAGCGG GCGGGATGTC GATTTCGTGT CGTTGCGCCA TCAACACATC GGCGCGCGGG TACCGGTGGC GTGGCTGGAA TCCAACGAAA CCTCCTGCAT TCTCTACACC TCCGGCACGA CCGGCAAACC TAAAGGCGTG CAGCGTGACG TCGGCAGATA TGCGGTGGCG CTGGCGACCT CGATGGACAC CATTTTTGGC GGCAAAGCGG GCGGTGTGTT CTTTTGTGCT TCGGATATCG GCTGGGTGGT GGGGCATTCG TATATCGTCT ACGCGCCGCT GCTGGCGGGG ATGGCGACTA TCGTTTACGA AGGATTACCG ACCTGGCCGG ACTGCGGCGT GTGGTGGAAA ATCGTCGAGA AATATCAGGT TAGCCGGATG TTCTCAGCGC CGACCGCCAT TCGCGTGCTG AAAAAATTCC CTACCGCTGA AATTCGCAAA CACGATCTCT CGTCGCTGGA AGTGCTCTAT CTGGCTGGAG AACCGCTGGA CGAGCCGACC GCCAGTTGGG TGAGCAATAC GCTGGATGTG CCGGTCATCG ACAACTACTG GCAGACCGAA TCCGGCTGGC CGATTATGGC GATTGCTCGC GGTCTGGACG ACAGGCCGAC GCGTCTGGGA AGCCCCGGTG TGCCGATGTA TGGCTATAAC GTGCAGTTGC TTAATGAAGT CACCGGCGAA CCGTGTGGCG TCAACGAGAA AGGGATGCTG GTGGTGGAAG GGCCGCTGCC GCCGGGGTGT ATTCAGACCA TCTGGGGCGA CGACGGCCGC TTTGTGAAGA CTTACTGGTC GCTGTTTTCC CGCCCGGTGT ACGCCACCTT TGACTGGGGC ATCCGTGACG CTGACGGTTA TCACTTTATT CTCGGGCGCA CTGACGATGT AATTAACGTT GCCGGGCATC GGCTGGGGAC GCGCGAGATT GAAGAGAGTA TCTCCAGCCA TCCGGGCGTT GCCGAAGTGG CGGTGGTTGG GGTGAAAGAT GCGCTGAAAG GGCAGGTGGC GGTGGCGTTT GTCATTCCGA AAGAGAGCGA CAGTCTGGAA GATCGTGATG TGGCGCACTC GCAAGAGAAG GCGATTATGG CGCTGGTGGA CAGCCAGATT GGCAACTTTG GCCGCCCGGC GCACGTCTGG TTTGTCTCGC AATTGCCAAA AACGCGATCC GGAAAAATGC TGCGCCGCAC GATCCAGGCG ATTTGCGAAG GACGCGATCC TGGGGATCTG ACGACCATTG ATGATCCTGC GTCGTTGGAT CAGATCCGCC AGGCGATGGA AGAGTAA
|
Protein sequence | MSFSEFYQRS INEPEQFWAE QARRIDWQTP FTQTLDHSNP PFARWFCEGR TNLCHNAIDR WLEKQPEALA LIAVSSETEE ERTFTFRQLH DEVNAVASML RSLGVQRGDR VLVYMPMIAE AHITLLACAR IGAIHSVVFG GFASHSVAAR IDDAKPVLIV SADAGARGGK IIPYKKLLDD AISQAQHQPR HVLLVDRGLA KMARVSGRDV DFVSLRHQHI GARVPVAWLE SNETSCILYT SGTTGKPKGV QRDVGRYAVA LATSMDTIFG GKAGGVFFCA SDIGWVVGHS YIVYAPLLAG MATIVYEGLP TWPDCGVWWK IVEKYQVSRM FSAPTAIRVL KKFPTAEIRK HDLSSLEVLY LAGEPLDEPT ASWVSNTLDV PVIDNYWQTE SGWPIMAIAR GLDDRPTRLG SPGVPMYGYN VQLLNEVTGE PCGVNEKGML VVEGPLPPGC IQTIWGDDGR FVKTYWSLFS RPVYATFDWG IRDADGYHFI LGRTDDVINV AGHRLGTREI EESISSHPGV AEVAVVGVKD ALKGQVAVAF VIPKESDSLE DRDVAHSQEK AIMALVDSQI GNFGRPAHVW FVSQLPKTRS GKMLRRTIQA ICEGRDPGDL TTIDDPASLD QIRQAMEE
|
| |