Gene EcDH1_2519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2519 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2691499 
End bp2692620 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content54% 
IMG OID 
ProductCupin 4 family protein 
Protein accessionACX40155 
Protein GI260449733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0666495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATACC AACTCACTCT TAACTGGCCC GATTTTCTTG AACGTCACTG GCAGAAACGC 
CCGGTGGTGT TAAAACGCGG CTTTAATAAT TTTATTGACC CGATCTCTCC AGACGAGTTG
GCGGGTCTGG CGATGGAAAG CGAAGTTGAC AGTCGACTGG TCAGTCACCA GGATGGCAAA
TGGCAGGTCA GCCACGGCCC GTTCGAAAGC TACGATCATC TCGGTGAAAC CAACTGGTCA
TTACTGGTAC AGGCAGTGAA CCACTGGCAT GAGCCGACCG CCGCGCTGAT GCGACCGTTC
CGTGAACTAC CGGACTGGCG TATTGATGAT CTGATGATTT CTTTTTCTGT ACCCGGCGGC
GGCGTCGGCC CGCATCTCGA TCAGTACGAC GTGTTTATCA TTCAGGGTAC CGGACGTCGT
CGCTGGCGAG TGGGCGAAAA GCTGCAAATG AAACAGCACT GCCCACATCC GGATCTGTTA
CAGGTCGATC CGTTCGAAGC CATCATCGAT GAAGAGCTGG AGCCTGGTGA TATTCTTTAT
ATTCCGCCAG GATTCCCGCA TGAAGGCTAC GCGCTGGAAA ATGCGATGAA CTATTCCGTG
GGCTTTCGCG CGCCAAATAC GCGGGAACTG ATTAGTGGAT TTGCCGATTA TGTGCTGCAA
CGTGAACTGG GCGGCAACTA CTACAGCGAT CCGGATGTTC CACCTCGCGC TCATCCTGCG
GATGTTCTGC CGCAAGAGAT GGATAAACTG CGTGAGATGA TGCTCGAATT GATCAACCAG
CCGGAACACT TTAAGCAATG GTTTGGCGAG TTTATATCCC AGTCACGTCA TGAACTGGAT
ATCGCGCCGC CAGAGCCGCC TTATCAGCCG GATGAAATCT ACGATGCGCT GAAACAAGGT
GAAGTGCTGG TGCGCCTGGG TGGTCTGCGC GTATTGCGCA TTGGCGACGA CGTGTATGCC
AATGGTGAGA AGATCGATTC CCCGCACCGT CCGGCACTGG ATGCACTCGC CAGCAACATT
GCGCTGACTG CGGAGAATTT TGGCGATGCG CTGGAAGATC CGTCATTCCT CGCGATGCTC
GCGGCGCTGG TCAATAGCGG GTATTGGTTC TTCGAAGGGT AA
 
Protein sequence
MEYQLTLNWP DFLERHWQKR PVVLKRGFNN FIDPISPDEL AGLAMESEVD SRLVSHQDGK 
WQVSHGPFES YDHLGETNWS LLVQAVNHWH EPTAALMRPF RELPDWRIDD LMISFSVPGG
GVGPHLDQYD VFIIQGTGRR RWRVGEKLQM KQHCPHPDLL QVDPFEAIID EELEPGDILY
IPPGFPHEGY ALENAMNYSV GFRAPNTREL ISGFADYVLQ RELGGNYYSD PDVPPRAHPA
DVLPQEMDKL REMMLELINQ PEHFKQWFGE FISQSRHELD IAPPEPPYQP DEIYDALKQG
EVLVRLGGLR VLRIGDDVYA NGEKIDSPHR PALDALASNI ALTAENFGDA LEDPSFLAML
AALVNSGYWF FEG