Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2312 |
Symbol | |
ID | 6146114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2343086 |
End bp | 2344024 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641617186 |
Product | indigoidine synthase A like protein |
Protein accession | YP_001744359 |
Protein GI | 170682601 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2313] Uncharacterized enzyme involved in pigment biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.085961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.000230697 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCTGAAT TAAAAATTTC CCCTGAATTA TTACAAATCT CCCCGGAAGT GCAGGACGCT TTAAAAAACA AAAAACCGGT TGTGGCGCTG GAATCGACTA TTATTTCTCA CGGGATGCCG TTCCCACAAA ATGCCCAGAC CGCAATTGAA GTTGAAGAAA CTATTCGTAA ACAGGGCGCT GTACCTGCCA CTATCGCCAT TATTGGCGGC GTGATGAAAG TGGGTTTAAG CAAAGAAGAA ATTGAATTGC TGGGTCGCGA AGGGCATAAC GTGACCAAAG TTAGTCGTCG CGATTTACCT TTTGTCGTTG CCGCCGGAAA AAATGGCGCA ACCACTGTGG CTTCAACGAT GATTATTGCG GCGCTTGCCG GAATTAAAGT ATTTGCCACC GGGGGAATTG GTGGTGTGCA TCGCGGGGCG GAACATACCT TCGATATTTC TGCCGATTTG CAAGAACTGG CAAATACTAA TGTCACCGTT GTTTGTGCCG GTGCAAAATC TATTCTCGAT TTAGGATTAA CAACAGAATA TTTAGAAACC TTCGGTGTGC CGTTAATTGG CTATCAGACA AAAGCGCTGC CTGCGTTTTT CTGCCGTACC AGCCCATTTG AAGTTAGTAT TCGCCTTGAT AGTGCCACGG AAATCGCTCG TGCAATGGCG GTGAAATGGC AAAGCGGTCT GAACGGTGGC CTGGTGGTGG CAAATCCGAT CCCGGAACAG TTTGCGATGC CGGAGGAATC TATCAATGCA GCCATAGATC AAGCCGTCGC CGAAGCCGAA GAGCAGGGCG TTATTGGTAA AGAAAGCACA CCGTTCCTGC TGGCTCGTGT TGCTGAACTG ACTGGCGGTG ACAGCCTGAA ATCCAACATC CAACTGGTGT TCAACAACGC CATTCTGGCG AGCGAAATTG CCAAAGAATA CCAGCGTCTC GCGGGTTAA
|
Protein sequence | MSELKISPEL LQISPEVQDA LKNKKPVVAL ESTIISHGMP FPQNAQTAIE VEETIRKQGA VPATIAIIGG VMKVGLSKEE IELLGREGHN VTKVSRRDLP FVVAAGKNGA TTVASTMIIA ALAGIKVFAT GGIGGVHRGA EHTFDISADL QELANTNVTV VCAGAKSILD LGLTTEYLET FGVPLIGYQT KALPAFFCRT SPFEVSIRLD SATEIARAMA VKWQSGLNGG LVVANPIPEQ FAMPEESINA AIDQAVAEAE EQGVIGKEST PFLLARVAEL TGGDSLKSNI QLVFNNAILA SEIAKEYQRL AG
|
| |