Gene EcDH1_3785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3785 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4083305 
End bp4084717 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content53% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX41388 
Protein GI260450966 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000255253 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGATC AGGTAAAAGT CGTTGCCGAT GATCAGGCTC CGGCTGAACA GTCGCTACGG 
CGCAATCTCA CAAACCGACA TATTCAGCTT ATTGCCATTG GCGGTGCCAT TGGTACGGGG
TTGTTTATGG GGTCTGGCAA AACGATTAGC CTTGCCGGGC CGTCGATCAT TTTCGTTTAT
ATGATCATTG GTTTTATGCT CTTTTTCGTG ATGCGGGCAA TGGGGGAATT GCTGCTTTCG
AATCTGGAAT ACAAATCTTT TAGTGACTTC GCTTCCGATT TACTCGGGCC GTGGGCAGGA
TATTTCACCG GCTGGACTTA CTGGTTCTGC TGGGTTGTAA CCGGTATGGC AGACGTGGTG
GCGATCACGG CTTATGCTCA GTTCTGGTTC CCCGATCTCT CCGACTGGGT CGCCTCGCTG
GCGGTGATAG TGCTGCTGCT GACGCTCAAT CTCGCCACCG TGAAAATGTT CGGTGAGATG
GAGTTCTGGT TTGCGATGAT CAAAATCGTC GCCATTGTGT CGCTGATTGT CGTCGGCCTG
GTCATGGTGG CGATGCACTT TCAGTCACCG ACTGGTGTGG AAGCGTCATT CGCGCATTTG
TGGAATGACG GCGGCTGGTT CCCGAAAGGT TTAAGTGGCT TCTTTGCCGG ATTCCAGATA
GCGGTTTTCG CTTTCGTGGG GATTGAGCTG GTAGGTACAA CAGCTGCGGA AACCAAAGAT
CCAGAGAAAT CACTGCCACG CGCGATTAAC TCCATTCCGA TCCGTATCAT TATGTTCTAC
GTCTTCGCGC TGATTGTGAT TATGTCCGTG ACGCCGTGGA GTTCGGTAGT CCCGGAGAAA
AGCCCGTTTG TTGAACTGTT CGTGTTGGTA GGGCTGCCTG CTGCCGCAAG CGTGATCAAC
TTTGTGGTGC TGACCTCTGC GGCGTCTTCC GCTAACAGCG GCGTCTTCTC TACCAGCCGT
ATGCTGTTTG GTCTGGCGCA GGAAGGTGTG GCACCGAAAG CGTTCGCTAA ACTTTCTAAG
CGCGCAGTAC CCGCGAAAGG GCTGACGTTC TCGTGTATCT GTCTGCTCGG CGGCGTGGTG
ATGTTGTATG TGAATCCTAG TGTGATTGGC GCGTTCACGA TGATTACAAC CGTTTCCGCG
ATTCTGTTTA TGTTCGTCTG GACGATTATC CTTTGCTCGT ACCTTGTGTA TCGCAAACAG
CGTCCTCATC TACATGAGAA GTCGATCTAC AAGATGCCGC TCGGCAAGCT GATGTGCTGG
GTATGTATGG CGTTCTTTGT GTTCGTGGTC GTGTTGCTGA CACTGGAAGA TGACACTCGC
CAGGCGCTGC TGGTTACCCC GCTGTGGTTT ATCGCGCTGG GGTTGGGCTG GCTGTTTATT
GGTAAGAAGC GGGCTGCTGA ACTGCGGAAA TAA
 
Protein sequence
MVDQVKVVAD DQAPAEQSLR RNLTNRHIQL IAIGGAIGTG LFMGSGKTIS LAGPSIIFVY 
MIIGFMLFFV MRAMGELLLS NLEYKSFSDF ASDLLGPWAG YFTGWTYWFC WVVTGMADVV
AITAYAQFWF PDLSDWVASL AVIVLLLTLN LATVKMFGEM EFWFAMIKIV AIVSLIVVGL
VMVAMHFQSP TGVEASFAHL WNDGGWFPKG LSGFFAGFQI AVFAFVGIEL VGTTAAETKD
PEKSLPRAIN SIPIRIIMFY VFALIVIMSV TPWSSVVPEK SPFVELFVLV GLPAAASVIN
FVVLTSAASS ANSGVFSTSR MLFGLAQEGV APKAFAKLSK RAVPAKGLTF SCICLLGGVV
MLYVNPSVIG AFTMITTVSA ILFMFVWTII LCSYLVYRKQ RPHLHEKSIY KMPLGKLMCW
VCMAFFVFVV VLLTLEDDTR QALLVTPLWF IALGLGWLFI GKKRAAELRK