Gene EcDH1_3851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3851 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4144169 
End bp4145425 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content52% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX41453 
Protein GI260451031 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGAC TCAAACAAGA ACTGGGGCTG GCCCAGGGCA TTGGCCTGCT ATCGACGTCA 
TTATTAGGCA CTGGCGTGTT TGCCGTTCCT GCGTTAGCTG CGCTGGTAGC GGGCAATAAC
AGCCTGTGGG CGTGGCCCGT TTTGATTATC TTAGTGTTCC CGATTGCGAT TGTGTTTGCG
ATTCTGGGTC GCCACTATCC CAGCGCAGGC GGCGTCGCGC ACTTCGTCGG TATGGCGTTT
GGTTCGCGGC TTGAGCGAGT CACCGGCTGG CTGTTTTTAT CGGTCATTCC CGTGGGTTTG
CCTGCCGCAC TACAAATTGC CGCCGGGTTC GGCCAGGCGA TGTTTGGCTG GCATAGCTGG
CAACTGTTGT TGGCAGAACT CGGTACGCTG GCGCTGGTGT GGTATATCGG TACTCGCGGT
GCCAGTTCCA GTGCTAATCT ACAAACCGTT ATTGCCGGAC TTATCGTCGC GCTGATTGTC
GCTATCTGGT GGGCGGGCGA TATCAAACCT GCGAATATCC CCTTTCCGGC ACCTGGTAAT
ATCGAACTTA CCGGGTTATT TGCTGCGTTA TCAGTGATGT TCTGGTGTTT TGTCGGTCTG
GAGGCATTTG CCCATCTCGC CTCGGAATTT AAAAATCCAG AGCGTGATTT TCCTCGTGCT
TTGATGATTG GTCTGCTGCT GGCAGGATTA GTCTACTGGG GCTGTACGGT AGTCGTCTTA
CACTTCGACG CCTATGGTGA AAAAATGGCG GCGGCAGCAT CGCTTCCAAA AATTGTAGTG
CAGTTGTTCG GTGTAGGAGC GTTATGGATT GCCTGCGTGA TTGGCTATCT GGCCTGCTTT
GCCAGTCTCA ACATTTATAT ACAGAGCTTC GCCCGCCTGG TCTGGTCGCA GGCGCAACAT
AATCCTGACC ACTACCTGGC ACGCCTCTCT TCTCGCCATA TCCCGAATAA TGCCCTCAAT
GCGGTGCTCG GCTGCTGTGT GGTGAGCACT TTGGTGATTC ATGCTTTAGA GATCAATCTG
GACGCTCTTA TTATTTATGC CAATGGCATC TTTATTATGA TTTATCTGTT ATGCATGCTG
GCAGGCTGTA AATTATTGCA AGGACGTTAT CGACTACTGG CGGTGGTTGG CGGGCTGTTA
TGCGTTCTGT TACTGGCAAT GGTCGGCTGG AAAAGTCTCT ATGCGCTGAT CATGCTGGCG
GGGTTATGGC TGTTGCTGCC AAAACGAAAA ACGCCGGAAA ATGGCATAAC CACATAA
 
Protein sequence
MSGLKQELGL AQGIGLLSTS LLGTGVFAVP ALAALVAGNN SLWAWPVLII LVFPIAIVFA 
ILGRHYPSAG GVAHFVGMAF GSRLERVTGW LFLSVIPVGL PAALQIAAGF GQAMFGWHSW
QLLLAELGTL ALVWYIGTRG ASSSANLQTV IAGLIVALIV AIWWAGDIKP ANIPFPAPGN
IELTGLFAAL SVMFWCFVGL EAFAHLASEF KNPERDFPRA LMIGLLLAGL VYWGCTVVVL
HFDAYGEKMA AAASLPKIVV QLFGVGALWI ACVIGYLACF ASLNIYIQSF ARLVWSQAQH
NPDHYLARLS SRHIPNNALN AVLGCCVVST LVIHALEINL DALIIYANGI FIMIYLLCML
AGCKLLQGRY RLLAVVGGLL CVLLLAMVGW KSLYALIMLA GLWLLLPKRK TPENGITT