Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3308 |
Symbol | sufI |
ID | 6146929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3384035 |
End bp | 3385447 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641618137 |
Product | repressor protein for FtsI |
Protein accession | YP_001745287 |
Protein GI | 170682156 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2132] Putative multicopper oxidases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.468939 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0130524 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTCA GTCGGCGTCA GTTCATTCAG GCATCGGGGA TTGCACTTTG TGCAGGCGCT GTTCCCCTGA AGGCCAGCGC AGCCGGGCAA CAGCAACCGC TACCCGTTCC GCCGCTGCTT GAATCTCGCC GTGGGCAACC GCTGTTTATG ACTGTACAAC GTGCGCACTG GTCATTTACG CCAGGGACAC GCGCGTCGGT TTGGGGAATC AATGGTCGTT ACCTGGGGCC GACTATCCGC GTCTGGAAGG GCGACGATGT TAAGCTTATT TACAGCAACC GCCTGACAGA AAATGTCTCA ATGACGGTGG CCGGGCTACA GGTACCTGGC CCGCTGATGG GCGGTCCGGC ACGGATGATG TCGCCAAACG CTGACTGGGC ACCCGTACTG CCTATTCGCC AGAACGCAGC TACTCTGTGG TATCACGCCA ATACTCCCAA CCGCACGGCT CAGCAGGTCT ATAACGGCCT TGCCGGAATG TGGCTGGTGG AAGATGAAGT CAGCAAGTCG CTGCCTATCC CCAACCATTA TGGTGTGGAT GATTTTCCGG TCATTATTCA GGATAAACGG CTGGATAACT TTGGTACGCC AGAATACAAC GAACCGGGAA GCGGCGGCTT TGTTGGCGAT ACGCTGCTGG TTAACGGTGT ACAAAGCCCG TACGTTGAAG TCTCGCGTGG CTGGGTGCGC TTGCGTCTGT TGAACGCGTC GAACTCTCGT CGCTATCAAC TACAGATGAG CGATGGTCGC CCGTTACATG TGATTTCTGG CGATCAGGGA TTCCTGCCCG CTCCTGTATC GGTGAAGCAA CTTTCGCTGG CACCGGGCGA GCGTCGTGAG ATTCTGGTGG ATATGAGCAA CGGTGATGAA GTGTCGATCA CCTGTGGTGA AGCGGCGAGC ATTGTTGATC GTATTCGCGG CTTCTTTGAG CCATCCAGTA TTCTGGTTTC TACCCTGGTG CTAACGCTGC GCCCAACCGG CCTTCTGCCG CTGGTCACAG ACAGTCTTCC GATGCGCTTG CTGCCAACTG AAATTATGGC CGGTTCGCCG ATTCGAAGTC GTGATATCAG TCTGGGTGAT GACCCGGGGA TTAATGGGCA GTTGTGGGAC GTCAACCGTA TTGATGTTAC TGCGCAGCAA GGAACGTGGG AACGCTGGAC GGTACGCGCG GACGAGCCGC AAGCGTTCCA TATTGAAGGC GTGATGTTCC AGATCCGTAA CGTGAATGGT GCGATGCCGT TCCCGGAAGA CAGAGGCTGG AAAGATACCG TTTGGGTTGA CGGACAAGTG GAGCTGCTGG TTTATTTCGG TCAGCCTTCC TGGGCGCACT TCCCGTTCTA CTTCAACAGT CAGACGCTGG AAATGGCGGA CCGTGGTTCG ATTGGGCAAC TGTTGGTCAA TCCGGTACCG TAA
|
Protein sequence | MSLSRRQFIQ ASGIALCAGA VPLKASAAGQ QQPLPVPPLL ESRRGQPLFM TVQRAHWSFT PGTRASVWGI NGRYLGPTIR VWKGDDVKLI YSNRLTENVS MTVAGLQVPG PLMGGPARMM SPNADWAPVL PIRQNAATLW YHANTPNRTA QQVYNGLAGM WLVEDEVSKS LPIPNHYGVD DFPVIIQDKR LDNFGTPEYN EPGSGGFVGD TLLVNGVQSP YVEVSRGWVR LRLLNASNSR RYQLQMSDGR PLHVISGDQG FLPAPVSVKQ LSLAPGERRE ILVDMSNGDE VSITCGEAAS IVDRIRGFFE PSSILVSTLV LTLRPTGLLP LVTDSLPMRL LPTEIMAGSP IRSRDISLGD DPGINGQLWD VNRIDVTAQQ GTWERWTVRA DEPQAFHIEG VMFQIRNVNG AMPFPEDRGW KDTVWVDGQV ELLVYFGQPS WAHFPFYFNS QTLEMADRGS IGQLLVNPVP
|
| |