Gene EcDH1_1852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1852 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2001624 
End bp2002805 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content53% 
IMG OID 
Productcyanate transporter 
Protein accessionACX39510 
Protein GI260449088 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.74691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTGTT CAACTTCATT AAGCGGCAAA AACAGGATTG TCCTTATCGC TGGCATTCTG 
ATGATTGCCA CAACATTACG CGTCACCTTT ACCGGCGCAG CACCGTTACT GGATACGATT
CGTTCCGCTT ACTCGCTGAC GACAGCGCAA ACCGGCTTAT TGACCACCCT GCCATTATTG
GCCTTTGCGC TAATCTCACC TTTGGCTGCC CCGGTAGCGC GACGTTTTGG TATGGAACGT
AGCCTGTTTG CCGCGTTACT TTTGATCTGT GCTGGTATCG CAATTCGCTC TCTCCCTTCG
CCTTACTTAT TATTTGGCGG TACAGCGGTC ATTGGCGGTG GGATTGCATT AGGCAATGTC
TTACTGCCAG GATTAATTAA ACGCGATTTC CCTCATTCCG TCGCCAGACT TACCGGCGCA
TATTCCCTGA CAATGGGAGC TGCAGCGGCA CTGGGATCGG CTATGGTCGT GCCGCTGGCT
TTGAACGGTT TTGGCTGGCA AGGCGCGTTG CTCATGCTGA TGTGTTTTCC TCTGCTGGCT
CTTTTTTTAT GGCTGCCACA GTGGCGAAGT CAACAACATG CAAATTTGAG TACCTCGCGC
GCCTTACATA CTCGGGGTAT CTGGCGTTCA CCGCTTGCCT GGCAGGTCAC ATTGTTTCTT
GGGATCAACT CACTGGTCTA TTACGTGATT ATTGGCTGGC TTCCGGCGAT CCTCATCAGT
CACGGCTATA GCGAAGCACA GGCGGGTTCA CTGCATGGTT TGCTGCAACT AGCCACAGCA
GCACCCGGTT TGCTGATCCC ACTTTTCTTA CATCATGTGA AAGATCAGCG TGGTATTGCA
GCGTTCGTTG CCTTGATGTG CGCAGTGGGC GCGGTTGGGC TCTGCTTTAT GCCAGCGCAC
GCGATCACCT GGACTCTGCT TTTCGGTTTT GGTTCCGGCG CAACAATGAT ACTGGGGTTG
ACGTTCATTG GTCTGCGGGC TAGTTCTGCG CATCAGGCGG CGGCACTCTC GGGGATGGCA
CAATCCGTCG GGTATTTGTT GGCAGCCTGT GGGCCGCCGC TGATGGGTAA AATACACGAT
GCTAACGGTA ACTGGTCTGT ACCACTTATG GGTGTTGCCA TACTTTCACT ACTGATGGCG
ATTTTCGGAC TTTGCGCCGG GAGAGACAAA GAAATTCGCT AA
 
Protein sequence
MTCSTSLSGK NRIVLIAGIL MIATTLRVTF TGAAPLLDTI RSAYSLTTAQ TGLLTTLPLL 
AFALISPLAA PVARRFGMER SLFAALLLIC AGIAIRSLPS PYLLFGGTAV IGGGIALGNV
LLPGLIKRDF PHSVARLTGA YSLTMGAAAA LGSAMVVPLA LNGFGWQGAL LMLMCFPLLA
LFLWLPQWRS QQHANLSTSR ALHTRGIWRS PLAWQVTLFL GINSLVYYVI IGWLPAILIS
HGYSEAQAGS LHGLLQLATA APGLLIPLFL HHVKDQRGIA AFVALMCAVG AVGLCFMPAH
AITWTLLFGF GSGATMILGL TFIGLRASSA HQAAALSGMA QSVGYLLAAC GPPLMGKIHD
ANGNWSVPLM GVAILSLLMA IFGLCAGRDK EIR