Gene EcSMS35_0339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0339 
Symbol 
ID6144079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp348158 
End bp349585 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content51% 
IMG OID641615235 
Productiron-sulfur cluster binding protein 
Protein accessionYP_001742443 
Protein GI170681838 
COG category[C] Energy production and conversion 
COG ID[COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain 
TIGRFAM ID[TIGR00273] iron-sulfur cluster-binding protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCA AAACCAGTAA TACAGATTTT AAGACACGCA TCCGTCAGCA AATTGAAGAT 
CCGATCATGC GCAAAGCGGT GGCAAACGCG CAGCAGCGTA TCGGGGCAAA TCGGCAAAAA
ATGGTCGATG AATTGGGGCA CTGGGAGGAG TGGCGCGATC GGGCCGCCCA GATACGTGAT
CATGTTCTGA GTAATCTCGA CGCTTATCTG TACCAGCTCT CAGAAAAAGT GACGCAAAAC
GGCGGTCACG TCTATTTTGC AAAAACCAAA GAAGACGCTA CCCGCTACAT TTTACAGGTT
GCCCAACGCA AAAATGCCCG GAAGGTGGTG AAATCTAAAT CGATGGTGAC CGAAGAGATT
GGTGTCAATC ATGTGTTGCA GGATGCTGGC ATTCAGGTGA TTGAAACCGA TCTGGGTGAA
TACATTCTCC AGCTGGATCA AGATCCGCCC TCTCATGTTG TGGTCCCGGC AATTCATAAA
GATCGCCATC AGATCCGTCG GGTGCTACAC GAACGTCTGG GCTATGAGGG GTCGGAAACG
CCTGAAGCAA TGACCTTATT CATCCGGCAA AAAATCCGCG AAGATTTCCT CAGTGCTGAA
ATAGGTATTA CCGGCTGTAA TTTCGCGGTG GCAGAGACCG GTTCGGTATG CCTGGTGACC
AATGAAGGTA ATGCGCGAAT GTGTACCACG CTGCCTAAAA CGCATATTGC AGTGATGGGA
ATGGAGCGTA TTGCCCCCAC GTTTGCCGAG GTAGATGTAT TGATCACCAT GCTGGCGCGC
AGTGCCGTTG GTGCACGTTT GACGGGATAC AACACCTGGC TGACAGGACC GCGCGAAGCG
GGGCACGTTG ATGGTCCTGA AGAGTTTCAT CTGGTTATTG TCGATAACGG GCGTTCTGAG
GTGCTGGCCT CTGAATTTCG GGATGTGCTG CGCTGTATTC GCTGCGGGGC TTGTATGAAT
ACTTGTCCGG CATATCGCCA TATTGGCGGT CATGGATATG GCTCTATTTA TCCAGGGCCA
ATTGGTGCGG TGATTTCTCC GCTACTTGGC GGCTATAAAG ATTTTAAAGA TTTACCCTAC
GCCTGCTCTT TATGCACCGC TTGTGACAGC GTGTGTCCGG TGCGTATTCC GCTGTCAAAA
CTGATTTTGC GTCATCGTCG GGTGATGGCT GAAAAAGGGA TCACCGCAAA AGCAGAGCAA
CGGGCGATAA AAATGTTCGC TTATGCCAAT AGTCATCCAG GATTGTGGAA AGTCGGGATG
ATGGCCGGCG CTCATGCGGC AAGCTGGTTT ATCAATGGCG GCAAAACACC ACTCAAATTT
GGCGCGATTA GCGACTGGAT GGAAGCACGC GATCTTCCTG AAGCTGACGG AGAGAGTTTC
CGTAGTTGGT TTAAGAAACA TCAGGCGCAG GAGAAAAAGA ATGGATAA
 
Protein sequence
MSIKTSNTDF KTRIRQQIED PIMRKAVANA QQRIGANRQK MVDELGHWEE WRDRAAQIRD 
HVLSNLDAYL YQLSEKVTQN GGHVYFAKTK EDATRYILQV AQRKNARKVV KSKSMVTEEI
GVNHVLQDAG IQVIETDLGE YILQLDQDPP SHVVVPAIHK DRHQIRRVLH ERLGYEGSET
PEAMTLFIRQ KIREDFLSAE IGITGCNFAV AETGSVCLVT NEGNARMCTT LPKTHIAVMG
MERIAPTFAE VDVLITMLAR SAVGARLTGY NTWLTGPREA GHVDGPEEFH LVIVDNGRSE
VLASEFRDVL RCIRCGACMN TCPAYRHIGG HGYGSIYPGP IGAVISPLLG GYKDFKDLPY
ACSLCTACDS VCPVRIPLSK LILRHRRVMA EKGITAKAEQ RAIKMFAYAN SHPGLWKVGM
MAGAHAASWF INGGKTPLKF GAISDWMEAR DLPEADGESF RSWFKKHQAQ EKKNG