Gene EcDH1_2385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2385 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2558293 
End bp2559888 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content56% 
IMG OID 
Productanthranilate phosphoribosyltransferase 
Protein accessionACX40028 
Protein GI260449606 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.101159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACA TTCTGCTGCT CGATAATATC GACTCTTTTA CGTACAACCT GGCAGATCAG 
TTGCGCAGCA ATGGGCATAA CGTGGTGATT TACCGCAACC ATATTCCGGC GCAAACCTTA
ATTGAACGCC TGGCGACCAT GAGCAATCCG GTGCTGATGC TTTCTCCTGG CCCCGGTGTG
CCGAGCGAAG CCGGTTGTAT GCCGGAACTC CTCACCCGCT TGCGTGGCAA GCTGCCCATT
ATTGGCATTT GCCTCGGACA TCAGGCGATT GTCGAAGCTT ACGGGGGCTA TGTCGGTCAG
GCGGGCGAAA TTCTCCACGG TAAAGCCTCC AGCATTGAAC ATGACGGTCA GGCGATGTTT
GCCGGATTAA CAAACCCGCT GCCGGTGGCG CGTTATCACT CGCTGGTTGG CAGTAACATT
CCGGCCGGTT TAACCATCAA CGCCCATTTT AATGGCATGG TGATGGCAGT ACGTCACGAT
GCGGATCGCG TTTGTGGATT CCAGTTCCAT CCGGAATCCA TTCTCACCAC CCAGGGCGCT
CGCCTGCTGG AACAAACGCT GGCCTGGGCG CAGCAGAAAC TAGAGCCAGC CAACACGCTG
CAACCGATTC TGGAAAAACT GTATCAGGCG CAGACGCTTA GCCAACAAGA AAGCCACCAG
CTGTTTTCAG CGGTGGTGCG TGGCGAGCTG AAGCCGGAAC AACTGGCGGC GGCGCTGGTG
AGCATGAAAA TTCGCGGTGA GCACCCGAAC GAGATCGCCG GGGCAGCAAC CGCGCTACTG
GAAAACGCAG CGCCGTTCCC GCGCCCGGAT TATCTGTTTG CTGATATCGT CGGTACTGGC
GGTGACGGCA GCAACAGTAT CAATATTTCT ACCGCCAGTG CGTTTGTCGC CGCGGCCTGT
GGGCTGAAAG TGGCGAAACA CGGCAACCGT AGCGTCTCCA GTAAATCTGG TTCGTCCGAT
CTGCTGGCGG CGTTCGGTAT TAATCTTGAT ATGAACGCCG ATAAATCGCG CCAGGCGCTG
GATGAGTTAG GTGTATGTTT CCTCTTTGCG CCGAAGTATC ACACCGGATT CCGCCACGCG
ATGCCGGTTC GCCAGCAACT GAAAACCCGC ACCCTGTTCA ATGTGCTGGG GCCATTGATT
AACCCGGCGC ATCCGCCGCT GGCGTTAATT GGTGTTTATA GTCCGGAACT GGTGCTGCCG
ATTGCCGAAA CCTTGCGCGT GCTGGGGTAT CAACGCGCGG CGGTGGTGCA CAGCGGCGGG
ATGGATGAAG TTTCATTACA CGCGCCGACA ATCGTTGCCG AACTGCATGA CGGCGAAATT
AAAAGCTATC AGCTCACCGC AGAAGACTTT GGCCTGACAC CCTACCACCA GGAGCAACTG
GCAGGCGGAA CACCGGAAGA AAACCGTGAC ATTTTAACAC GTTTGTTACA AGGTAAAGGC
GACGCCGCCC ATGAAGCAGC CGTCGCTGCG AACGTCGCCA TGTTAATGCG CCTGCATGGC
CATGAAGATC TGCAAGCCAA TGCGCAAACC GTTCTTGAGG TACTGCGCAG TGGTTCCGCT
TACGACAGAG TCACCGCACT GGCGGCACGA GGGTAA
 
Protein sequence
MADILLLDNI DSFTYNLADQ LRSNGHNVVI YRNHIPAQTL IERLATMSNP VLMLSPGPGV 
PSEAGCMPEL LTRLRGKLPI IGICLGHQAI VEAYGGYVGQ AGEILHGKAS SIEHDGQAMF
AGLTNPLPVA RYHSLVGSNI PAGLTINAHF NGMVMAVRHD ADRVCGFQFH PESILTTQGA
RLLEQTLAWA QQKLEPANTL QPILEKLYQA QTLSQQESHQ LFSAVVRGEL KPEQLAAALV
SMKIRGEHPN EIAGAATALL ENAAPFPRPD YLFADIVGTG GDGSNSINIS TASAFVAAAC
GLKVAKHGNR SVSSKSGSSD LLAAFGINLD MNADKSRQAL DELGVCFLFA PKYHTGFRHA
MPVRQQLKTR TLFNVLGPLI NPAHPPLALI GVYSPELVLP IAETLRVLGY QRAAVVHSGG
MDEVSLHAPT IVAELHDGEI KSYQLTAEDF GLTPYHQEQL AGGTPEENRD ILTRLLQGKG
DAAHEAAVAA NVAMLMRLHG HEDLQANAQT VLEVLRSGSA YDRVTALAAR G