Gene EcolC_2995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2995 
Symbol 
ID6065896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3273840 
End bp3275510 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content50% 
IMG OID641602412 
Product2-alkenal reductase 
Protein accessionYP_001725947 
Protein GI170020993 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAATG CAGAACTCGC CATTGGTATC GATCTCGGTA CTACCAATAG TTTAATTGCC 
GTCTGGAAAG ACGGTGCCGC GCAATTAATT CCAAATAAGT TCGGTGAATA TTTAACACCA
TCCATAATTA GCATGGATGA AAATAATCAT ATTTTAGTCG GAAAACCGGC TGTATCACGG
CGTACTTCGC ATCCGGATAA AACGGCAGCG TTATTTAAAC GTGCAATGGG CAGTAATACC
AACTGGCGGT TAGGCAGCGA CACATTTAAC GCGCCAGAAC TGTCCTCTTT GGTATTACGC
TCATTAAAAG AAGATGCCGA AGAATTTCTG CAACGTCCGA TTAAAGATGT GGTGATCTCC
GTTCCGGCTT ATTTCAGCGA TGAACAACGC AAGCATACCC GTTTAGCAGC GGAGTTAGCC
GGGTTAAATG CGGTACGCTT AATTAATGAA CCCACAGCAG CTGCGATGGC GTATGGCCTG
CATACCCAAC AAAATACCCG TTCGCTGGTG TTTGATCTCG GTGGCGGCAC GTTTGACGTT
ACGGTGCTTG AGTACGCCAC GCCGGTGATT GAAGTTCACG CCTCCGCTGG CGACAACTTT
CTTGGTGGCG AAGATTTTAC CCATATGCTG GTCGATGAGG TTTTAAAACG CGCGGATGTC
GCCAGGACCA CGCTTAACGA GAGTGAACTG GCAGCCTTGT ACGCCTGTGT GGAAGCGGCA
AAATGTAGCA ATCAATCGCC ATTGCACATT CGCTGGCAGT ATCAGGAAGA AACGCGGGAA
TGCGAATTTT ACGAGAACGA ACTGGAAGAT TTGTGGTTGC CGCTGCTCAA TCGCTTGCGA
GTGCCGATTG AACAGGCGTT GCGCGATGCG CGTCTGAAGC CGAGTCAAAT CGACAGTCTG
GTGCTGGTTG GCGGCGCGTC ACAAATGCCG CTGGTGCAGC GAATCGCCGT GCGTCTGTTT
GGCAAATTAC CGTATCAAAG TTACGATCCG AGCACCATTG TCGCGCTGGG CGCAGCAATC
CAGGCCGCCT GCCGCTTACG CAGTGAAGAT ATTGAAGAGG TAATCCTCAC TGATATTTGC
CCTTACTCGT TGGGCGTTGA AGTTAACCGC CAGGGCGTTT CCGGCATTTT CTCGCCGATT
ATTGAACGAA ACACCACTGT GCCCGTGTCG CGTGTAGAAA CTTATTCAAC CATGCACCCG
GAACAGGATT CAATTACGGT TAACGTCTAT CAGGGAGAAA ACCACAAAGT TAAAAACAAC
ATTCTGGTGG AATCCTTCGA TGTGCCGTTG AAGAAAACCG GGGCTTATCA GTCGATTGAT
ATTCGCTTTA GTTATGATAT CAACGGGTTG CTTGAAGTTG ACGTGCTTCT GGAAGACGGC
AGCGTTAAGT CCAGAGTGAT TAACCACAGC CCGGTAACAT TGAGCGCGCA GCAGATTGAA
GAGAGTCGGA CGCGGTTATC CGCATTGAAA ATTTATCCGC GCGATATGCT CATCAATCGC
ACCTTTAAAG CCAAACAGGA AGAGTTGTGG GCGCGGGCGC TGGGTGACGA GCGAGAAGAG
ATCGGCCGGG TGATCACCGA TTTTGATGCG GCGTTGCAGT CAAACGATAT GGCCCGCGTG
GATGAAGTTC GGCGGCGGGC GAGCGATTAT TTAGCCATTG AGATCCCATA A
 
Protein sequence
MDNAELAIGI DLGTTNSLIA VWKDGAAQLI PNKFGEYLTP SIISMDENNH ILVGKPAVSR 
RTSHPDKTAA LFKRAMGSNT NWRLGSDTFN APELSSLVLR SLKEDAEEFL QRPIKDVVIS
VPAYFSDEQR KHTRLAAELA GLNAVRLINE PTAAAMAYGL HTQQNTRSLV FDLGGGTFDV
TVLEYATPVI EVHASAGDNF LGGEDFTHML VDEVLKRADV ARTTLNESEL AALYACVEAA
KCSNQSPLHI RWQYQEETRE CEFYENELED LWLPLLNRLR VPIEQALRDA RLKPSQIDSL
VLVGGASQMP LVQRIAVRLF GKLPYQSYDP STIVALGAAI QAACRLRSED IEEVILTDIC
PYSLGVEVNR QGVSGIFSPI IERNTTVPVS RVETYSTMHP EQDSITVNVY QGENHKVKNN
ILVESFDVPL KKTGAYQSID IRFSYDINGL LEVDVLLEDG SVKSRVINHS PVTLSAQQIE
ESRTRLSALK IYPRDMLINR TFKAKQEELW ARALGDEREE IGRVITDFDA ALQSNDMARV
DEVRRRASDY LAIEIP