Gene EcolC_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1950 
Symbol 
ID6068456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2154900 
End bp2156171 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content53% 
IMG OID641601362 
Productcysteine desulfurase activator complex subunit SufD 
Protein accessionYP_001724923 
Protein GI170019969 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0719] ABC-type transport system involved in Fe-S cluster assembly, permease component 
TIGRFAM ID[TIGR01981] FeS assembly protein SufD, group 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0520002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000220398 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGGCT TACCGAACAG CAGTAACGCG CTGCAACAGT GGCATCACTT GTTTGAAGCT 
GACGGAGCGA AACGCTCTCC GCAAGCACAG CAGCATTTAC AACAATTGCT GCGTACCGGA
CTGCCGACAC GTAAACATGA AAACTGGAAA TATACGCCGC TGGAAGGGCT GACCAATAGC
CAGTTTGTCA GCATTGCGGG AGAGATATCC CCACAGCAGC GTGATGCCTT AGCGTTAACA
TTAGACGCTG TGCGGCTGGT TTTTGTCGAT GGACGTTACG TGTCAGCACT GAGCGATGCG
ACTGAAGGCA GCGGGTATGA AGTGAGCATT AACGACGACC GTCAGGGGGT ACCCGACGCT
ATTCAGGCGG AAGTGTTTCT GCATTTGACG GAAAGCCTGG CACAAAGCGT GACGCATATC
GCCGTGAAGC GCGGTCAACG GCCGGCAAAG CCATTGCTGT TAATGCATAT TACCCAGGGC
GTGGCAGGTG AAGAGGTCAA CACTGCCCAT TACCGACATC ATCTGGATCT GGCGGAAGGT
GCCGAAGCAA CGGTGATCGA ACATTTTGTC AGCCTTAATG ATGCTCGTCA CTTCACCGGC
GCACGGTTCA CTATCAATGT CGCAGCGAAC GCCCACTTGC AGCATATCAA GCTGGCGTTT
GAAAACCCGG TCAGTCACCA CTTTGCCCAT AACGATTTGT TGCTGGCTGA CGATGCCACC
GCATTTAGCC ATAGTTTCCT GCTGGGTGGC GCAGTGTTAC GACACAACAC CAGTACGCAA
CTCAATGGCG AAAACAGCAC GCTGCGGATC AATAGCCTGG CGATGCCGGT GAAAAACGAG
GTGTGTGATA CCCGCACCTG GCTGGAACAC AATAAAGGTT TTTGTAACAG CCGACAGTTG
CACAAAACTA TCGTCAGCGA CAAAGGCCGC GCGGTATTTA ACGGTTTGAT CAACGTCGCG
CAGCACGCCA TCAAAACGGA TGGTCAGATG ACCAACAATA ATCTGCTGAT GGGCAAACTG
GCGGAAGTGG ATACGAAACC GCAGCTGGAA ATCTATGCAG ATGATGTGAA ATGCAGCCAC
GGCGCGACGG TGGGGCGTAT TGATGATGAA CAGATGTTCT ATCTGCGCTC GCGCGGGATC
AATCAGCAGG ATGCCCAGCA GATGATCATT TACGCCTTTG CTGCTGAACT GACGGAAGCA
CTGCGTGATG AGGGGCTTAA ACAGCAGGTG CTGGCCCGAA TCGGTCAACG ACTGCCAGGA
GGTGTAAGAT GA
 
Protein sequence
MAGLPNSSNA LQQWHHLFEA DGAKRSPQAQ QHLQQLLRTG LPTRKHENWK YTPLEGLTNS 
QFVSIAGEIS PQQRDALALT LDAVRLVFVD GRYVSALSDA TEGSGYEVSI NDDRQGVPDA
IQAEVFLHLT ESLAQSVTHI AVKRGQRPAK PLLLMHITQG VAGEEVNTAH YRHHLDLAEG
AEATVIEHFV SLNDARHFTG ARFTINVAAN AHLQHIKLAF ENPVSHHFAH NDLLLADDAT
AFSHSFLLGG AVLRHNTSTQ LNGENSTLRI NSLAMPVKNE VCDTRTWLEH NKGFCNSRQL
HKTIVSDKGR AVFNGLINVA QHAIKTDGQM TNNNLLMGKL AEVDTKPQLE IYADDVKCSH
GATVGRIDDE QMFYLRSRGI NQQDAQQMII YAFAAELTEA LRDEGLKQQV LARIGQRLPG
GVR