Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1386 |
Symbol | |
ID | 6144608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1374296 |
End bp | 1375420 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616264 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001743444 |
Protein GI | 170679804 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.373829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATC TGAGCCCTGA CTTTGTACTA CCCGAAAATT TTTGCGCTAA CCCGCAAGAG GCGTGGACCA TTCCTGCCCG TTTTTATACC GATCAGAACG CCTTTGAACA CGAAAAAGAG AGCGTCTTCG CCAAAAGCTG GATTTGCGTC GCTCACAGCA GCGAACTGGC GAATGCCAAT GATTATGTGA CGCGTGAGAT CATTGGCGAA AGCATTGTGC TGGTACGCGG TCGTGATAAG GTTTTGCGCG CGTTCTATAA CGTGTGTCCG CACCGTGGTC ATCAGTTGTT GAGCGGTGAA GGAAAAGCGA AAAATGTGAT TACCTGCCCA TATCACGCAT GGGCATTCAA ACTCGATGGC AACCTGGCCC ATGCACGTAA CTGCGAAAAC GTCGCCAATT TCGATAGCGA CAAAGCGCAA CTGGTTCCGG TGCGTCTGGA AGAATATGCC GGATTCGTCT TCATCAACAT GGACCCTAAC GCCACCAGCG TAGAAGATCA GTTACCCGGT CTGGGGGCGA AAGTGCTGGA AGCCTGCCCG GAAGTCCACG ACCTGAAACT GGCGGCCCGC TTTACCACCC GCACGCCTGC CAACTGGAAG AACATTGTCG ATAACTATCT CGAGTGCTAT CACTGTGGTC CGGCGCATCC AGGTTTCTCC GACTCCGTAC AGGTTGATCG TTACTGGCAC ACCATGCACG GTAACTGGAC GCTGCAATAC GGTTTCGCCA AACCGTCCGA ACAGTCTTTC AAATTCGAAG AGGGCACGGA TGCGGCATTC CACGGTTTCT GGCTGTGGCC GTGCACGATG CTGAACGTCA CCCCAATCAA AGGGATGATG ACGGCCATTT ATGAATTCCC AGTGGATTCT GAAACTACCC TGCAAAACTA CGATATTTAC TTCACCAATG AAGAGTTAAC CGACGAGCAA AAATCGCTGA TTGAGTGGTA TCGCGATGTG TTCCGTCCGG AAGATTTACG TCTGGTTGAA AGCGTACAGA AAGGGCTGAA ATCGCGTGGC TATCGTGGTC AGGGGCGCAT CATGGCCGAC AGTAGCGGTA GCGGAATTTC CGAACATGGT ATCGCCCATT TCCATAATCT GCTGGCGCAG GTGTTTAAGG ACTAA
|
Protein sequence | MSNLSPDFVL PENFCANPQE AWTIPARFYT DQNAFEHEKE SVFAKSWICV AHSSELANAN DYVTREIIGE SIVLVRGRDK VLRAFYNVCP HRGHQLLSGE GKAKNVITCP YHAWAFKLDG NLAHARNCEN VANFDSDKAQ LVPVRLEEYA GFVFINMDPN ATSVEDQLPG LGAKVLEACP EVHDLKLAAR FTTRTPANWK NIVDNYLECY HCGPAHPGFS DSVQVDRYWH TMHGNWTLQY GFAKPSEQSF KFEEGTDAAF HGFWLWPCTM LNVTPIKGMM TAIYEFPVDS ETTLQNYDIY FTNEELTDEQ KSLIEWYRDV FRPEDLRLVE SVQKGLKSRG YRGQGRIMAD SSGSGISEHG IAHFHNLLAQ VFKD
|
| |