Gene EcDH1_1736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1736 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1887163 
End bp1888374 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content51% 
IMG OID 
Productaromatic amino acid transporter 
Protein accessionACX39395 
Protein GI260448973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000018039 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAACA GAACCCTGGG AAGTGTTTTT ATCGTGGCGG GAACCACAAT TGGCGCAGGC 
ATGCTGGCAA TGCCGCTGGC TGCGGCCGGT GTTGGTTTTA GCGTTACGTT AATCTTGTTG
ATTGGGCTTT GGGCGTTGAT GTGCTACACG GCGCTATTAC TGCTGGAGGT GTACCAGCAT
GTTCCGGCAG ATACCGGTCT GGGCACGCTG GCAAAACGCT ATCTGGGACG CTACGGTCAA
TGGCTGACGG GCTTCAGTAT GATGTTCTTA ATGTATGCTC TGACTGCGGC ATACATCAGC
GGTGCCGGTG AATTGTTGGC CTCCAGCATC AGCGACTGGA CAGGTATTTC TATGTCGGCA
ACCGCTGGCG TGCTGTTGTT CACTTTTGTT GCCGGTGGCG TGGTTTGTGT CGGAACTTCA
CTGGTCGATT TATTTAACCG TTTTCTGTTC AGCGCCAAGA TTATTTTTCT GGTGGTAATG
CTGGTACTAC TGCTGCCGCA TATTCACAAA GTGAATCTTT TAACCCTGCC GTTGCAACAG
GGGCTGGCTC TGTCTGCAAT CCCGGTGATT TTTACGTCGT TTGGTTTTCA CGGTAGCGTG
CCGAGTATTG TCAGCTATAT GGATGGCAAC ATTCGTAAGC TACGCTGGGT GTTTATAATC
GGTAGTGCGA TCCCCCTGGT GGCATATATT TTCTGGCAGG TGGCGACGCT TGGCAGCATT
GATTCAACAA CCTTTATGGG ATTGCTGGCT AATCATGCTG GATTAAACGG GCTGTTACAG
GCGTTACGCG AAATGGTGGC CTCTCCGCAT GTTGAGCTGG CAGTGCATTT ATTTGCTGAT
TTAGCCCTCG CCACGTCATT TCTCGGCGTT GCGTTAGGCT TATTTGATTA TCTGGCTGAT
TTATTTCAGC GTTCAAATAC CGTTGGTGGA CGGTTGCAAA CTGGTGCAAT TACGTTTCTG
CCGCCGTTGG CGTTTGCACT GTTTTATCCA CGAGGATTTG TGATGGCGCT GGGTTACGCC
GGTGTGGCGC TGGCGGTACT GGCATTGATT ATCCCTTCGC TGTTGACCTG GCAAAGCAGA
AAGCACAATC CTCAGGCGGG TTACCGGGTC AAAGGTGGTC GTCCGGCGCT GGTGGTGGTG
TTTCTCTGTG GTATTGCTGT GATTGGCGTG CAATTTTTGA TTGCGGCAGG GTTGTTACCA
GAAGTGGGGT GA
 
Protein sequence
MKNRTLGSVF IVAGTTIGAG MLAMPLAAAG VGFSVTLILL IGLWALMCYT ALLLLEVYQH 
VPADTGLGTL AKRYLGRYGQ WLTGFSMMFL MYALTAAYIS GAGELLASSI SDWTGISMSA
TAGVLLFTFV AGGVVCVGTS LVDLFNRFLF SAKIIFLVVM LVLLLPHIHK VNLLTLPLQQ
GLALSAIPVI FTSFGFHGSV PSIVSYMDGN IRKLRWVFII GSAIPLVAYI FWQVATLGSI
DSTTFMGLLA NHAGLNGLLQ ALREMVASPH VELAVHLFAD LALATSFLGV ALGLFDYLAD
LFQRSNTVGG RLQTGAITFL PPLAFALFYP RGFVMALGYA GVALAVLALI IPSLLTWQSR
KHNPQAGYRV KGGRPALVVV FLCGIAVIGV QFLIAAGLLP EVG