Gene ECH74115_2395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2395 
SymbolsufD 
ID6969229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2266526 
End bp2267797 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content53% 
IMG OID643386268 
Productcysteine desulfurase activator complex subunit SufD 
Protein accessionYP_002270750 
Protein GI209397107 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0719] ABC-type transport system involved in Fe-S cluster assembly, permease component 
TIGRFAM ID[TIGR01981] FeS assembly protein SufD, group 1 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.087197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.000115041 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGGCT TACCGAACAG CAGTAACGCG CTGCAACAGT GGCATCACTT GTTTGAAGCT 
GAAGGGACAA AACGCTCCCC GCAAGCACAG CAGCATTTAC AACAATTGCT GCGTACCGGA
CTGCCGACAC GTAAACATGA AAACTGGAAA TATACGCCGC TGGAAGGGCT GACCAATAGT
CAGTTTGTCA GCATTGCGGG AGAGATATCC CCACAGCAGC GAGATGCCTT AGCGTTAACG
TTAGACGCTG TGCGGCTGGT ATTTGTCGAT GGACGTTACG TGCCTGCGCT GAGCGATGCG
ACTGAAGGCA GCGGGTATGA AGTGAGCATT AACGACGACC GTCAGGGGTT ACCCGATGCT
ATTCAGGCGG AAGTGTTTCT GCATTTGACG GAAAGCCTGG CACAAAGCGT GACGCATATC
GCCGTGAAGC GCGGTCAACG GCCGGCAAAG CCATTGCTGT TAATGCATAT CACCCAGGGC
GTGGCAGGTG AAGAGGTGAA CACTGCCCAT TACCGACATC ATCTGGAGCT GGCGGAAGGT
GCCGAAGCAA CGGTGATCGA ACATTTTGTC AGCCTTAATG ATGCTCGTCA TTTTACCGGG
GCACGGTTCT CTATCAACGT CGCAGCGAAT GCCCACTTGC AGCATATCAA GCTGGCGTTT
GAAAACCCGC TCAGTCACCA CTTTGCCCAT AACGATTTGT TGCTGGCTGA CGATGCCACC
GCATTTAGCC ACAGTTTCCT GCTGGGTGGC GCAGTGTTAC GACACAACAC CAGTACGCAA
CTCAATGGCG AAAACTGCAC GCTGCGGATC AATAGCCTGG CGATGCCGGT GAAAAACGAG
GTGTGTGATA CCCGTACCTG GCTGGAACAC AATAAAGGTT TTTGTAACAG CCGACAGTTG
CACAAAACTA TCGTCAGCGA CAAAGGCCGC GCGGTATTTA ACGGTTTGAT CAACGTCGCG
CAGCACGCCA TCAAAACGGA TGGTCAGATG ACCAACAATA ATCTGCTGAT GGGCAAACTG
GCGGAAGTGG ATACGAAACC GCAGCTGGAA ATCTATGCAG ATGATGTGAA ATGCAGCCAC
GGCGCGACGG TGGGGCGTAT TGATGATGAA CAGATGTTCT ATCTGCGCTC GCGCGGGATC
AATCAGCAGG ATGCCCAGCA GATGATCATT TACGCCTTTG CTGCCGAACT GACGGAAGCA
CTGCGTGATG AGGGGCTTAA ACAGCAGGTG CTGACCCGAA TCGGTCAACG GCTGCCAGGA
GGTGCAAGAT GA
 
Protein sequence
MAGLPNSSNA LQQWHHLFEA EGTKRSPQAQ QHLQQLLRTG LPTRKHENWK YTPLEGLTNS 
QFVSIAGEIS PQQRDALALT LDAVRLVFVD GRYVPALSDA TEGSGYEVSI NDDRQGLPDA
IQAEVFLHLT ESLAQSVTHI AVKRGQRPAK PLLLMHITQG VAGEEVNTAH YRHHLELAEG
AEATVIEHFV SLNDARHFTG ARFSINVAAN AHLQHIKLAF ENPLSHHFAH NDLLLADDAT
AFSHSFLLGG AVLRHNTSTQ LNGENCTLRI NSLAMPVKNE VCDTRTWLEH NKGFCNSRQL
HKTIVSDKGR AVFNGLINVA QHAIKTDGQM TNNNLLMGKL AEVDTKPQLE IYADDVKCSH
GATVGRIDDE QMFYLRSRGI NQQDAQQMII YAFAAELTEA LRDEGLKQQV LTRIGQRLPG
GAR