Gene EcDH1_3124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3124 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3357087 
End bp3358379 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content53% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX40750 
Protein GI260450328 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.554048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAACA CGGAAGGTAA TAACGGTAAC AAACCTCTCG GTCTATGGAA CGTCGTTTCC 
ATCGGCATTG GGGCAATGGT GGGGGCGGGG ATCTTCGCGC TGCTGGGGCA GGCTGCATTG
CTAATGGAAG CCTCGACCTG GGTCGCCTTT GCTTTTGGCG GTATTGTGGC GATGTTTTCC
GGTTATGCCT ATGCGCGTCT GGGGGCGAGC TATCCCAGCA ATGGCGGCAT TATCGACTTC
TTTCGTCGCG GATTAGGCAA CGGCGTCTTT TCGCTGGCGC TCTCGTTACT GTACCTGTTG
ACGCTGGCGG TGAGCATCGC CATGGTCGCC CGTGCTTTTG GCGCTTATGC CGTGCAGTTT
TTGCATGAAG GCAGCCAGGA GGAGCACCTT ATTTTGCTCT ACGCGTTGGG GATCATTGCG
GTGATGACGC TTTTCAACTC CTTAAGCAAC CATGCGGTAG GGCGGCTGGA AGTGATCCTC
GTCGGCATTA AAATGATGAT CCTGTTATTG CTGATTATTG CCGGTGTCTG GTCGCTGCAA
CCGGCGCATA TTTCCGTCTC TGCGCCCCCC AGCTCCGGTG CGTTCTTCTC CTGTATTGGG
ATAACTTTCC TTGCCTATGC GGGCTTTGGC ATGATGGCGA ACGCGGCGGA TAAAGTGAAA
GATCCGCAGG TCATTATGCC ACGGGCGTTT CTGGTGGCGA TTGGCGTTAC CACGTTGCTT
TATATCTCGC TGGCACTGGT TTTGCTTAGC GATGTATCGG CATTAGAGTT AGAAAAATAT
GCCGATACCG CCGTAGCGCA GGCTGCTTCT CCGCTGCTCG GGCATGTGGG TTATGTGATC
GTCGTCATCG GCGCTTTACT GGCGACGGCT TCAGCCATTA ACGCGAACCT GTTCGCCGTG
TTTAACATCA TGGACAACAT GGGCAGCGAA CGCGAACTGC CGAAGCTAAT GAATAAATCC
CTGTGGCGGC AGAGTACCTG GGGCAACATT ATTGTCGTGG TGTTGATTAT GCTGATGACG
GCGGCACTGA ATTTAGGCTC ACTCGCCAGC GTTGCCAGCG CCACCTTTTT GATTTGCTAC
CTGGCGGTGT TTGTGGTGGC GATCCGCCTG CGTCATGATA TTCACGCCTC GTTGCCGATT
CTTATCGTTG GTACGTTGGT GATGTTGTTG GTGATCGTTG GCTTTATCTA CAGTCTGTGG
TCCCAGGGTA GCCGTGCGTT GATATGGATT ATTGGCTCAC TCTTACTCAG CCTTATTGTG
GCAATGGTCA TGAAGCGCAA TAAAACCGTA TAA
 
Protein sequence
MMNTEGNNGN KPLGLWNVVS IGIGAMVGAG IFALLGQAAL LMEASTWVAF AFGGIVAMFS 
GYAYARLGAS YPSNGGIIDF FRRGLGNGVF SLALSLLYLL TLAVSIAMVA RAFGAYAVQF
LHEGSQEEHL ILLYALGIIA VMTLFNSLSN HAVGRLEVIL VGIKMMILLL LIIAGVWSLQ
PAHISVSAPP SSGAFFSCIG ITFLAYAGFG MMANAADKVK DPQVIMPRAF LVAIGVTTLL
YISLALVLLS DVSALELEKY ADTAVAQAAS PLLGHVGYVI VVIGALLATA SAINANLFAV
FNIMDNMGSE RELPKLMNKS LWRQSTWGNI IVVVLIMLMT AALNLGSLAS VASATFLICY
LAVFVVAIRL RHDIHASLPI LIVGTLVMLL VIVGFIYSLW SQGSRALIWI IGSLLLSLIV
AMVMKRNKTV