Gene EcDH1_1743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1743 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1892986 
End bp1894500 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content52% 
IMG OID 
ProductABC transporter related protein 
Protein accessionACX39402 
Protein GI260448980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.50451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAGT CTACCCCGTA TCTCTCATTT CGCGGCATCG GTAAAACGTT TCCCGGCGTT 
AAGGCGCTGA CGGATATTAG TTTTGACTGC TATGCCGGTC AGGTTCATGC GTTGATGGGT
GAAAATGGCG CAGGAAAATC AACTCTCTTA AAAATCCTCA GCGGCAACTA TGCGCCAACC
ACGGGTTCTG TAGTGATTAA TGGGCAGGAA ATGTCCTTTT CCGACACGAC CGCAGCACTT
AACGCGGGCG TGGCGATTAT TTACCAGGAA CTGCATCTCG TGCCGGAAAT GACCGTCGCG
GAAAACATCT ATCTCGGCCA GCTGCCGCAT AAAGGCGGCA TTGTGAATCG CTCATTGCTG
AATTATGAGG CGGGTTTACA ACTTAAACAT CTTGGTATGG ATATTGACCC GGACACGCCG
CTGAAATATC TCTCCATTGG TCAGTGGCAG ATGGTTGAAA TCGCCAAAGC GCTGGCGCGT
AACGCCAAAA TTATCGCCTT TGATGAGCCA ACCAGCTCCC TCTCTGCCCG TGAAATCGAC
AATCTTTTCC GCGTTATTCG TGAACTGCGA AAAGAGGGGC GGGTAATCTT ATACGTTTCT
CACCGTATGG AAGAAATATT TGCCCTCAGC GATGCCATTA CTGTCTTTAA AGATGGACGT
TATGTCAAAA CCTTTACCGA TATGCAGCAG GTTGACCACG ACGCGCTGGT GCAGGCGATG
GTCGGGCGCG ACATTGGCGA TATCTACGGC TGGCAACCGC GTAGTTATGG CGAGGAGCGC
CTACGTCTTG ATGCTGTGAA AGCACCAGGC GTGCGTACGC CAATAAGTCT GGCGGTTCGC
AGTGGTGAAA TTGTTGGGCT GTTTGGTCTG GTAGGGGCGG GGCGTAGCGA ATTAATGAAA
GGCATGTTTG GCGGGACGCA AATCACCGCC GGTCAGGTTT ATATCGACCA ACAGCCGATC
GATATTCGTA AACCGAGCCA CGCCATTGCC GCAGGCATGA TGCTCTGCCC GGAAGATCGC
AAAGCGGAAG GCATTATTCC CGTGCACTCC GTTCGCGACA ATATCAACAT CAGTGCCAGA
CGTAAACATG TGCTCGGCGG TTGTGTAATC AACAACGGTT GGGAAGAAAA CAATGCCGAT
CACCACATTC GTTCGCTCAA CATCAAAACG CCGGGCGCGG AGCAACTGAT CATGAATCTC
TCAGGCGGAA ATCAGCAAAA AGCCATTCTG GGCCGCTGGT TATCGGAAGA GATGAAGGTC
ATTTTGCTGG ATGAACCTAC GCGCGGCATT GATGTTGGCG CTAAGCACGA AATATATAAC
GTAATTTATG CGCTGGCGGC GCAGGGCGTG GCGGTGCTGT TTGCCTCCAG CGACTTACCT
GAAGTCCTCG GCGTTGCCGA CCGGATTGTG GTGATGCGGG AAGGTGAAAT CGCCGGTGAA
TTGTTACACG AGCAGGCAGA TGAGCGTCAG GCACTGAGCC TTGCGATGCC TAAAGTCAGC
CAGGCTGTTG CCTGA
 
Protein sequence
MQQSTPYLSF RGIGKTFPGV KALTDISFDC YAGQVHALMG ENGAGKSTLL KILSGNYAPT 
TGSVVINGQE MSFSDTTAAL NAGVAIIYQE LHLVPEMTVA ENIYLGQLPH KGGIVNRSLL
NYEAGLQLKH LGMDIDPDTP LKYLSIGQWQ MVEIAKALAR NAKIIAFDEP TSSLSAREID
NLFRVIRELR KEGRVILYVS HRMEEIFALS DAITVFKDGR YVKTFTDMQQ VDHDALVQAM
VGRDIGDIYG WQPRSYGEER LRLDAVKAPG VRTPISLAVR SGEIVGLFGL VGAGRSELMK
GMFGGTQITA GQVYIDQQPI DIRKPSHAIA AGMMLCPEDR KAEGIIPVHS VRDNINISAR
RKHVLGGCVI NNGWEENNAD HHIRSLNIKT PGAEQLIMNL SGGNQQKAIL GRWLSEEMKV
ILLDEPTRGI DVGAKHEIYN VIYALAAQGV AVLFASSDLP EVLGVADRIV VMREGEIAGE
LLHEQADERQ ALSLAMPKVS QAVA