Gene EcHS_A2302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2302 
Symbol 
ID5594943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2299606 
End bp2300544 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content49% 
IMG OID640921428 
Productindigoidine synthase A like protein 
Protein accessionYP_001458964 
Protein GI157161646 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2313] Uncharacterized enzyme involved in pigment biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value0.146223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAAT TAAAAATTTC CCCTGAATTA TTACAAATTT CCCCGGAAGT GCAGGAAGCT 
TTAAAAAACA AAAAACCGGT TGTGGCGCTG GAATCGACCA TTATTTCTCA CGGGATGCCG
TTCCCACAAA ATGCCCAGAC CGCAATTGAA GTAGAAGAAA CTATTCGTAA ACAGGGCGCA
GTACCTGCCA CTATCGCCAT TATTGGCGGC GTGATGAAAG TGGGTTTAAG TAAAGAAGAA
ATTGAATTAC TGGGTCGTGA AGGGCATAAC GTGACCAAAG TTAGTCGTCG CGATTTACCT
TTTGTTGTTG CCGCCGGAAA AAATGGCGCA ACCACTGTGG CTTCAACGAT GATTATTGCG
GCGCTTGCCG GAATTAAAGT ATTTGCCACC GGGGGAATTG GTGGTGTGCA TCGCGGGGCG
GAACATACCT TCGATATTTC TGCCGATTTG CAAGAACTGG CAAATACTAA TGTCACCGTT
GTTTGTGCCG GGGCGAAATC TATTCTCGAT TTAGGATTAA CCACTGAGTA TTTAGAAACC
TTCGGTGTGC CGTTAATTGG CTATCAGACT AAAGCGCTGC CTGCGTTTTT CTGCCGTACC
AGCTCGTTTG ACGTCAGCAT TCGTCTCGAC AGCGCCAGTG AAATTGCCCG TGCAATGGCG
GTGAAATGGC AAAGCGGGCT GAACGGTGGC CTCGTGGTAG CGAACCCGAT CCCGGAACAG
TTTGCGATGC CGGAGGAATC TATCAATGCA GCCATAGATC AAGCCGTCGC CGAAGCCGAA
GAGCAGGGCG TTATTGGTAA AGAAAGTACA CCGTTCCTGC TGGCGCGCGT TGCTGAACTG
ACCGGCGGTG ACAGCCTGAA ATCCAATATC CAGCTGGTGT TCAACAACGC CATTCTGGCG
AGTGAAATTG CCAAAGAATA CCAGCGTCTC GCGGGTTAA
 
Protein sequence
MSKLKISPEL LQISPEVQEA LKNKKPVVAL ESTIISHGMP FPQNAQTAIE VEETIRKQGA 
VPATIAIIGG VMKVGLSKEE IELLGREGHN VTKVSRRDLP FVVAAGKNGA TTVASTMIIA
ALAGIKVFAT GGIGGVHRGA EHTFDISADL QELANTNVTV VCAGAKSILD LGLTTEYLET
FGVPLIGYQT KALPAFFCRT SSFDVSIRLD SASEIARAMA VKWQSGLNGG LVVANPIPEQ
FAMPEESINA AIDQAVAEAE EQGVIGKEST PFLLARVAEL TGGDSLKSNI QLVFNNAILA
SEIAKEYQRL AG