Gene EcolC_1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1948 
Symbol 
ID6068473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2152682 
End bp2154169 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content52% 
IMG OID641601360 
Productcysteine desulfurase activator complex subunit SufB 
Protein accessionYP_001724921 
Protein GI170019967 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0719] ABC-type transport system involved in Fe-S cluster assembly, permease component 
TIGRFAM ID[TIGR01980] FeS assembly protein SufB 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000269575 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTCGTA ATACTGAAGC AACTGACGAT GTCAAAACCT GGACCGGCGG CCCGCTGAAT 
TATAAAGAAG GATTCTTCAC CCAGTTAGCC ACCGATGAGC TGGCAAAGGG GATAAACGAA
GAGGTGGTGC GCGCAATTTC GGCGAAGCGT AATGAGCCGG AGTGGATGCT GGAGTTTCGT
CTCAATGCCT ATCGCGCATG GCTGGAGATG GAAGAGCCGC ACTGGCTGAA AGCGCACTAC
GACAAGCTGA ATTATCAGGA TTACAGCTAC TACTCAGCAC CATCGTGCGG TAATTGTGAC
GACACTTGCG CGTCTGAACC CGGCGCGGTG CAGCAAACTG GCGCGAACGC CTTTTTAAGT
AAAGAGGTGG AGGCGGCGTT TGAGCAGTTG GGCGTTCCCG TGCGGGAAGG CAAAGAGGTG
GCGGTGGATG CCATTTTCGA CTCTGTTTCG GTTGCCACCA CTTATCGTGA AAAACTGGCG
GAGCAGGGAA TTATTTTCTG TTCCTTTGGT GAGGCGATCC ACGATCACCC GGAACTGGTG
CGTAAATATC TCGGCACCGT GGTACCGGGG AATGACAACT TCTTTGCCGC GCTTAATGCG
GCGGTAGCCT CTGACGGTAC GTTTATTTAT GTGCCTAAAG GTGTGCGCTG CCCTATGGAA
CTTTCCACCT ATTTTCGCAT TAACGCGGAA AAAACCGGGC AGTTTGAGCG CACCATTCTG
GTGGCCGACG AAGACAGCTA CGTCAGCTAC ATTGAAGGCT GTTCCGCTCC GGTGCGTGAC
AGCTATCAGT TACACGCGGC AGTGGTGGAA GTCATCATCC ATAAAAACGC CGAGGTGAAA
TATTCCACGG TACAAAACTG GTTCCCTGGC GATAACAACA CCGGCGGTAT TCTCAACTTC
GTCACCAAGC GTGCTTTGTG CGAAGGCGAA AACAGCAAAA TGTCATGGAC GCAATCAGAA
ACCGGGTCAG CGATTACGTG GAAATATCCC AGCTGCATTT TGCGCGGCGA TAACTCCATT
GGTGAGTTTT ACTCAGTGGC GCTGACCAGC GGTCATCAGC AAGCGGATAC CGGCACCAAG
ATGATCCACA TCGGTAAAAA CACCAAATCG ACCATTATCT CGAAAGGGAT CTCTGCCGGA
CATAGTCAGA ACAGTTATCG CGGCTTAGTG AAAATCATGC CGACGGCAAC CAATGCGCGC
AATTTCACTC AGTGCGACTC AATGCTGATT GGCGCTAATT GTGGGGCGCA TACCTTCCCG
TATGTTGAGT GTCGTAACAA TAGTGCGCAA CTGGAACACG AGGCAACGAC ATCACGTATT
GGTGAAGATC AACTGTTTTA CTGCCTGCAA CGCGGGATCA GCGAAGAAGA CGCCATCTCG
ATGATTGTTA ACGGTTTCTG CAAAGACGTG TTCTCGGAGC TGCCGTTGGA ATTTGCCGTT
GAAGCACAAA AACTCCTCGC CATCAGTCTT GAACACAGCG TCGGATAA
 
Protein sequence
MSRNTEATDD VKTWTGGPLN YKEGFFTQLA TDELAKGINE EVVRAISAKR NEPEWMLEFR 
LNAYRAWLEM EEPHWLKAHY DKLNYQDYSY YSAPSCGNCD DTCASEPGAV QQTGANAFLS
KEVEAAFEQL GVPVREGKEV AVDAIFDSVS VATTYREKLA EQGIIFCSFG EAIHDHPELV
RKYLGTVVPG NDNFFAALNA AVASDGTFIY VPKGVRCPME LSTYFRINAE KTGQFERTIL
VADEDSYVSY IEGCSAPVRD SYQLHAAVVE VIIHKNAEVK YSTVQNWFPG DNNTGGILNF
VTKRALCEGE NSKMSWTQSE TGSAITWKYP SCILRGDNSI GEFYSVALTS GHQQADTGTK
MIHIGKNTKS TIISKGISAG HSQNSYRGLV KIMPTATNAR NFTQCDSMLI GANCGAHTFP
YVECRNNSAQ LEHEATTSRI GEDQLFYCLQ RGISEEDAIS MIVNGFCKDV FSELPLEFAV
EAQKLLAISL EHSVG