Gene EcSMS35_3300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3300 
Symbol 
ID6145481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3374020 
End bp3376239 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content54% 
IMG OID641618130 
Producthypothetical protein 
Protein accessionYP_001745280 
Protein GI170681820 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0672097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCTA TCTCCCTGAT CCAACCGGAT CGCGACCTGT TCTCCTGGCC GCAGTACTGG 
GCCGCCTGTT TTGGACCGGC ACCGTTTTTG CCGATGTCAC GTGAAGAGAT GGATCAACTT
GGCTGGGATA GCTGCGACAT CATTTTGGTT ACTGGCGACG CGTATGTCGA TCACCCAAGC
TTCGGGATGG CGATTTGCGG TCGTATGCTG GAAGCGCAGG GCTTTCGCGT CGGGATCATC
GCCCAGCCAG ACTGGAGCAG CAAAGACGAC TTTATGCGTC TGGGTAAACC GAATCTGTTT
TTCGGCGTTA CTGCTGGCAA CATGGACTCG ATGATCAACC GCTATACCGC CGATCGCCGT
TTACGTCATG ACGATGCCTA CACGCCAGAT AATGTCGCGG GTAAGCGTCC GGATCGCGCC
ACACTGGTTT ATACCCAACG TTGTAAAGAG GCGTGGAAAG ATGTGCCGGT GATCCTCGGC
GGTATTGAGG CCAGCCTGCG CCGTACCGCG CATTATGATT ACTGGTCCGA TACCGTGCGC
CGTTCCGTGT TGGTGGATTC GAAAGCCGAC ATGCTGATGT TTGGTAACGG TGAGCGTCCG
CTGGTGGAAG TGGCGCACCG TCTGGCGATG GGCGAGCCGA TTAGTGAAAT CCGCGATGTG
CGTAATACCG CGATTATCGT AAAAGAGGCG TTGCCAGGCT GGAGCGGCGT GGATTCCACC
CGTCTTGATA CCCCAGGGAA AATCGACCCA ATCCCGCATC CGTATGGCGA AGATTTGCCG
TGCGCGGATA ACAAACCGGT AGCTCCGAAA AAGCAGGAAG CCAAAGCTGT AACCGTGCAG
CCACCACGCC CGAAACCGTG GGAAAAAACC TACGTGTTGC TGCCTTCTTT CGAGAAAGTG
AAGGGCGATA AAGTGCTGTA CGCCCATGCT TCGCGCATTC TGCACCACGA AACTAACCCA
GGCTGCGCCC GCGCATTGAT GCAAAAACAT GGCGACCGCT ATGTATGGAT CAACCCGCCT
GCTATTCCGC TTTCTACCGA AGAGATGGAT AGCGTCTTTG CGCTGCCGTA CAAGCGCGTG
CCGCATCCGG CTTACGGTAA TGCCCGAATT CCGGCTTACG AAATGATTCG TTTCTCGGTC
AACATTATGC GTGGCTGCTT TGGCGGCTGC TCTTTCTGTT CTATTACCGA GCACGAAGGG
CGCATTATTC AGAGCCGTTC CGAAGATTCG ATTATTAATG AGATCGAAGC GATCCGCGAC
ACTGTTCCAG GTTTTACGGG CGTGATTTCC GATCTCGGTG GGCCTACTGC CAACATGTAT
ATGTTGCGCT GCAAATCGCC ACGCGCTGAG CAAACCTGCC GTCGTTTGTC GTGCGTTTAT
CCGGATATTT GTCCGCACAT GGACACTAAC CATGAACCGA CCATCAACCT CTATCGCCGC
GCTCGTGATC TGAAAGGCAT TAAAAAGATC CTCATCGCCT CTGGTGTGCG TTATGACATC
GCCGTAGAAG ATCCGCGCTA TATCAAAGAG CTGGCGACCC ATCACGTCGG CGGTTATCTG
AAGATTGCCC CGGAACATAC CGAAGAAGGG CCGTTATCGA AGATGATGAA GCCGGGCATG
GGCAGCTATG ACCGCTTTAA AGAGCTGTTC GATACTTACT CGAAACAGGC AGGTAAAGAA
CAGTATCTGA TCCCGTATTT CATCTCCGCG CACCCGGGTA CGCGTGATGA AGATATGGTG
AATCTGGCGC TGTGGCTGAA AAAGCATCGT TTCCGTCTCG ACCAGGTACA GAACTTCTAC
CCATCGCCGC TGGCTAACTC GACCACCATG TATTACACCG GGAAAAACCC GCTGGCGAAG
ATTGGTTATA AGAGTGAAGA CGTCTTCGTA CCGAAGGGCG ACAAACAGCG TCGTTTGCAT
AAAGCGTTGT TGCGTTACCA CGATCCGGCA AACTGGCCGT TAATCCGTCA GGCGCTGGAA
GCGATGGGCA AAAAGCATCT GATTGGCAGC CGTCGCGATT GCTTAGTGCC TGCGCCAACC
ATTGAAGAGA TGCGTGAAGC TCGCCGCCAG AACCGCAATA CCCGTCCGGC GTTGACTAAA
CATACGCCGA TGGCGACCCA GCGTCAGACG CCTGCTACGG CAAAAAAAGC GTCGTCTACG
CAATCTCGCC TGCAGAATGC TGGTGCGAAG AAACGCCCTA AAGCGGCGGT TGGACGTTAA
 
Protein sequence
MSSISLIQPD RDLFSWPQYW AACFGPAPFL PMSREEMDQL GWDSCDIILV TGDAYVDHPS 
FGMAICGRML EAQGFRVGII AQPDWSSKDD FMRLGKPNLF FGVTAGNMDS MINRYTADRR
LRHDDAYTPD NVAGKRPDRA TLVYTQRCKE AWKDVPVILG GIEASLRRTA HYDYWSDTVR
RSVLVDSKAD MLMFGNGERP LVEVAHRLAM GEPISEIRDV RNTAIIVKEA LPGWSGVDST
RLDTPGKIDP IPHPYGEDLP CADNKPVAPK KQEAKAVTVQ PPRPKPWEKT YVLLPSFEKV
KGDKVLYAHA SRILHHETNP GCARALMQKH GDRYVWINPP AIPLSTEEMD SVFALPYKRV
PHPAYGNARI PAYEMIRFSV NIMRGCFGGC SFCSITEHEG RIIQSRSEDS IINEIEAIRD
TVPGFTGVIS DLGGPTANMY MLRCKSPRAE QTCRRLSCVY PDICPHMDTN HEPTINLYRR
ARDLKGIKKI LIASGVRYDI AVEDPRYIKE LATHHVGGYL KIAPEHTEEG PLSKMMKPGM
GSYDRFKELF DTYSKQAGKE QYLIPYFISA HPGTRDEDMV NLALWLKKHR FRLDQVQNFY
PSPLANSTTM YYTGKNPLAK IGYKSEDVFV PKGDKQRRLH KALLRYHDPA NWPLIRQALE
AMGKKHLIGS RRDCLVPAPT IEEMREARRQ NRNTRPALTK HTPMATQRQT PATAKKASST
QSRLQNAGAK KRPKAAVGR