Gene EcSMS35_2312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2312 
Symbol 
ID6146114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2343086 
End bp2344024 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content49% 
IMG OID641617186 
Productindigoidine synthase A like protein 
Protein accessionYP_001744359 
Protein GI170682601 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2313] Uncharacterized enzyme involved in pigment biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.085961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.000230697 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCTGAAT TAAAAATTTC CCCTGAATTA TTACAAATCT CCCCGGAAGT GCAGGACGCT 
TTAAAAAACA AAAAACCGGT TGTGGCGCTG GAATCGACTA TTATTTCTCA CGGGATGCCG
TTCCCACAAA ATGCCCAGAC CGCAATTGAA GTTGAAGAAA CTATTCGTAA ACAGGGCGCT
GTACCTGCCA CTATCGCCAT TATTGGCGGC GTGATGAAAG TGGGTTTAAG CAAAGAAGAA
ATTGAATTGC TGGGTCGCGA AGGGCATAAC GTGACCAAAG TTAGTCGTCG CGATTTACCT
TTTGTCGTTG CCGCCGGAAA AAATGGCGCA ACCACTGTGG CTTCAACGAT GATTATTGCG
GCGCTTGCCG GAATTAAAGT ATTTGCCACC GGGGGAATTG GTGGTGTGCA TCGCGGGGCG
GAACATACCT TCGATATTTC TGCCGATTTG CAAGAACTGG CAAATACTAA TGTCACCGTT
GTTTGTGCCG GTGCAAAATC TATTCTCGAT TTAGGATTAA CAACAGAATA TTTAGAAACC
TTCGGTGTGC CGTTAATTGG CTATCAGACA AAAGCGCTGC CTGCGTTTTT CTGCCGTACC
AGCCCATTTG AAGTTAGTAT TCGCCTTGAT AGTGCCACGG AAATCGCTCG TGCAATGGCG
GTGAAATGGC AAAGCGGTCT GAACGGTGGC CTGGTGGTGG CAAATCCGAT CCCGGAACAG
TTTGCGATGC CGGAGGAATC TATCAATGCA GCCATAGATC AAGCCGTCGC CGAAGCCGAA
GAGCAGGGCG TTATTGGTAA AGAAAGCACA CCGTTCCTGC TGGCTCGTGT TGCTGAACTG
ACTGGCGGTG ACAGCCTGAA ATCCAACATC CAACTGGTGT TCAACAACGC CATTCTGGCG
AGCGAAATTG CCAAAGAATA CCAGCGTCTC GCGGGTTAA
 
Protein sequence
MSELKISPEL LQISPEVQDA LKNKKPVVAL ESTIISHGMP FPQNAQTAIE VEETIRKQGA 
VPATIAIIGG VMKVGLSKEE IELLGREGHN VTKVSRRDLP FVVAAGKNGA TTVASTMIIA
ALAGIKVFAT GGIGGVHRGA EHTFDISADL QELANTNVTV VCAGAKSILD LGLTTEYLET
FGVPLIGYQT KALPAFFCRT SPFEVSIRLD SATEIARAMA VKWQSGLNGG LVVANPIPEQ
FAMPEESINA AIDQAVAEAE EQGVIGKEST PFLLARVAEL TGGDSLKSNI QLVFNNAILA
SEIAKEYQRL AG