Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1513 |
Symbol | sufB |
ID | 6147175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1498855 |
End bp | 1500342 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616391 |
Product | cysteine desulfurase activator complex subunit SufB |
Protein accession | YP_001743571 |
Protein GI | 170683248 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0719] ABC-type transport system involved in Fe-S cluster assembly, permease component |
TIGRFAM ID | [TIGR01980] FeS assembly protein SufB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00000205188 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTCGTA ATACTGAAGC AACTGACGAT GTCAAAACCT GGACCGGCGG CCCGCTGAAT TATAAAGAAG GATTCTTCAC CCAGTTAGCC ACCGATGAGC TGGCAAAGGG GATAAACGAA GAGGTGGTGC GCGCAATTTC GGCGAAGCGT AATGAGCCGG AGTGGATGCT GGAGTTTCGT CTCAACGCCT ATCGCGCATG GCTGGAGATG GAAGAACCGC ACTGGTTGAA AGCGCACTAC GACAAGCTGA ATTATCAGGA TTACAGCTAC TACTCAGCAC CATCGTGCGG TAATTGTGAC GACAACTGCG CGTCTGAACC CGGCGCGGTG CAGCAAACTG GTGCGAACGC CTTTTTAAGT AAAGAGGTGG AGGCGGCGTT TGAGCAGTTG GGCGTTCCCG TGCGGGAAGG CAAAGAGGTG GCGGTGGATG CCATTTTCGA CTCTGTTTCG GTTGCCACCA CTTATCGTGA AAAACTGGCG GAGCAGGGAA TTATTTTCTG TTCCTTTGGC GAGGCGATCC ACGATCACCC GGAACTTGTG CGTAAATATC TCGGCACCGT GGTGCCGGGG AATGACAACT TCTTTGCCGC ACTTAATGCG GCGGTAGCCT CTGACGGAAC GTTTATTTAT GTGCCTAAAG GCGTGCGCTG CCCGATGGAA CTTTCCACCT ATTTTCGCAT TAACGCGGAA AAAACAGGGC AGTTTGAGCG CACCATTCTG GTGGCCGACG AAGACAGCTA TGTCAGCTAC ATTGAAGGCT GTTCCGCTCC GGTGCGTGAC AGCTATCAGT TACACGCGGC GGTGGTTGAA GTCATCATCC ATAAAAACGC CGAGGTGAAA TATTCCACGG TACAAAACTG GTTCCCTGGC GATAACAACA CCGGCGGTAT TCTCAACTTC GTCACCAAGC GTGCTTTGTG CGAAGGCGAA AACAGCAAAA TGTCATGGAC GCAATCAGAA ACCGGGTCAG CGATTACGTG GAAATATCCC AGTTGCATTT TGCGCGGCGA TAACTCCATT GGTGAGTTTT ACTCAGTGGC ACTGACCAGC GGTCATCAGC AAGCGGATAC CGGCACCAAG ATGATCCACA TCGGTAAAAA CACCAAATCG ACCATTATCT CGAAAGGGGT CTCTGCCGGA CATAGTCAGA ACAGTTATCG CGGCTTAGTG AAAATCATGC CGACGGCAAC CAATGCGCGC AATTTCACTC AGTGCGACTC AATGCTGATT GGCGCTAATT GTGGGGCGCA TACCTTCCCG TATGTCGAAT GTCGCAATAA CAGCGCACAA CTGGAGCACG AGGCAACGAC ATCACGTATT GGTGAAGATC AACTGTTTTA CTGCCTGCAA CGCGGGATCA GCGAAGAAGA CGCCATCTCG ATGATTGTTA ACGGTTTCTG CAAAGACGTG TTCTCAGAAC TGCCGCTGGA ATTTGCCGTT GAAGCACAAA AACTCCTCGC CATCAGTCTT GAACACAGCG TCGGATAA
|
Protein sequence | MSRNTEATDD VKTWTGGPLN YKEGFFTQLA TDELAKGINE EVVRAISAKR NEPEWMLEFR LNAYRAWLEM EEPHWLKAHY DKLNYQDYSY YSAPSCGNCD DNCASEPGAV QQTGANAFLS KEVEAAFEQL GVPVREGKEV AVDAIFDSVS VATTYREKLA EQGIIFCSFG EAIHDHPELV RKYLGTVVPG NDNFFAALNA AVASDGTFIY VPKGVRCPME LSTYFRINAE KTGQFERTIL VADEDSYVSY IEGCSAPVRD SYQLHAAVVE VIIHKNAEVK YSTVQNWFPG DNNTGGILNF VTKRALCEGE NSKMSWTQSE TGSAITWKYP SCILRGDNSI GEFYSVALTS GHQQADTGTK MIHIGKNTKS TIISKGVSAG HSQNSYRGLV KIMPTATNAR NFTQCDSMLI GANCGAHTFP YVECRNNSAQ LEHEATTSRI GEDQLFYCLQ RGISEEDAIS MIVNGFCKDV FSELPLEFAV EAQKLLAISL EHSVG
|
| |