Gene EcDH1_2384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2384 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2556731 
End bp2558293 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content54% 
IMG OID 
Productanthranilate synthase component I 
Protein accessionACX40027 
Protein GI260449605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00032032 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAC AAAAACCGAC TCTCGAACTG CTAACCTGCG AAGGCGCTTA TCGCGACAAT 
CCCACCGCGC TTTTTCACCA GTTGTGTGGG GATCGTCCGG CAACGCTGCT GCTGGAATCC
GCAGATATCG ACAGCAAAGA TGATTTAAAA AGCCTGCTGC TGGTAGACAG TGCGCTGCGC
ATTACAGTTT TAGGTGACAC TGTCACAATC CAGGCACTTT CCGGCAACGG CGAAGCCCTC
CTGGCACTAC TGGATAACGC CCTGCCTGCG GGTGTGGAAA GTGAACAATC ACCAAACTGC
CGTGTGCTGC GCTTCCCCCC TGTCAGTCCA CTGCTGGATG AAGACGCCCG CTTATGCTCC
CTTTCGGTTT TTGACGCTTT CCGTTTATTG CAGAATCTGT TGAATGTACC GAAGGAAGAA
CGAGAAGCCA TGTTCTTCGG CGGCCTGTTC TCTTATGACC TTGTGGCGGG ATTTGAAGAT
TTACCGCAAC TGTCAGCGGA AAATAACTGC CCTGATTTCT GTTTTTATCT CGCTGAAACG
CTGATGGTGA TTGACCATCA GAAAAAAAGC ACCCGTATTC AGGCCAGCCT GTTTGCTCCG
AATGAAGAAG AAAAACAACG TCTCACTGCT CGCCTGAACG AACTACGTCA GCAACTGACC
GAAGCCGCGC CGCCGCTGCC AGTGGTTTCC GTGCCGCATA TGCGTTGTGA ATGTAATCAG
AGCGATGAAG AGTTCGGTGG CGTAGTGCGT TTGTTGCAAA AAGCGATTCG CGCTGGAGAA
ATTTTCCAGG TGGTGCCATC TCGCCGTTTC TCTCTGCCCT GCCCGTCACC GCTGGCGGCC
TATTACGTGC TGAAAAAGAG TAATCCCAGC CCGTACATGT TTTTTATGCA GGATAATGAT
TTCACCCTAT TTGGCGCGTC GCCGGAAAGC TCGCTCAAGT ATGATGCCAC CAGCCGCCAG
ATTGAGATCT ACCCGATTGC CGGAACACGC CCACGCGGTC GTCGCGCCGA TGGTTCACTG
GACAGAGATC TCGACAGCCG TATTGAACTG GAAATGCGTA CCGATCATAA AGAGCTGTCT
GAACATCTGA TGCTGGTTGA TCTCGCCCGT AATGATCTGG CACGCATTTG CACCCCCGGC
AGCCGCTACG TCGCCGATCT CACCAAAGTT GACCGTTATT CCTATGTGAT GCACCTCGTC
TCTCGCGTAG TCGGCGAACT GCGTCACGAT CTTGACGCCC TGCACGCTTA TCGCGCCTGT
ATGAATATGG GGACGTTAAG CGGTGCGCCG AAAGTACGCG CTATGCAGTT AATTGCCGAG
GCGGAAGGTC GTCGCCGCGG CAGCTACGGC GGCGCGGTAG GTTATTTCAC CGCGCATGGC
GATCTCGACA CCTGCATTGT GATCCGCTCG GCGCTGGTGG AAAACGGTAT CGCCACCGTG
CAAGCGGGTG CTGGTGTAGT CCTTGATTCT GTTCCGCAGT CGGAAGCCGA CGAAACCCGT
AACAAAGCCC GCGCTGTACT GCGCGCTATT GCCACCGCGC ATCATGCACA GGAGACTTTC
TGA
 
Protein sequence
MQTQKPTLEL LTCEGAYRDN PTALFHQLCG DRPATLLLES ADIDSKDDLK SLLLVDSALR 
ITVLGDTVTI QALSGNGEAL LALLDNALPA GVESEQSPNC RVLRFPPVSP LLDEDARLCS
LSVFDAFRLL QNLLNVPKEE REAMFFGGLF SYDLVAGFED LPQLSAENNC PDFCFYLAET
LMVIDHQKKS TRIQASLFAP NEEEKQRLTA RLNELRQQLT EAAPPLPVVS VPHMRCECNQ
SDEEFGGVVR LLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND
FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS
EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC
MNMGTLSGAP KVRAMQLIAE AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV
QAGAGVVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF