Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3300 |
Symbol | |
ID | 6145481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3374020 |
End bp | 3376239 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618130 |
Product | hypothetical protein |
Protein accession | YP_001745280 |
Protein GI | 170681820 |
COG category | [C] Energy production and conversion |
COG ID | [COG1032] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0672097 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCTA TCTCCCTGAT CCAACCGGAT CGCGACCTGT TCTCCTGGCC GCAGTACTGG GCCGCCTGTT TTGGACCGGC ACCGTTTTTG CCGATGTCAC GTGAAGAGAT GGATCAACTT GGCTGGGATA GCTGCGACAT CATTTTGGTT ACTGGCGACG CGTATGTCGA TCACCCAAGC TTCGGGATGG CGATTTGCGG TCGTATGCTG GAAGCGCAGG GCTTTCGCGT CGGGATCATC GCCCAGCCAG ACTGGAGCAG CAAAGACGAC TTTATGCGTC TGGGTAAACC GAATCTGTTT TTCGGCGTTA CTGCTGGCAA CATGGACTCG ATGATCAACC GCTATACCGC CGATCGCCGT TTACGTCATG ACGATGCCTA CACGCCAGAT AATGTCGCGG GTAAGCGTCC GGATCGCGCC ACACTGGTTT ATACCCAACG TTGTAAAGAG GCGTGGAAAG ATGTGCCGGT GATCCTCGGC GGTATTGAGG CCAGCCTGCG CCGTACCGCG CATTATGATT ACTGGTCCGA TACCGTGCGC CGTTCCGTGT TGGTGGATTC GAAAGCCGAC ATGCTGATGT TTGGTAACGG TGAGCGTCCG CTGGTGGAAG TGGCGCACCG TCTGGCGATG GGCGAGCCGA TTAGTGAAAT CCGCGATGTG CGTAATACCG CGATTATCGT AAAAGAGGCG TTGCCAGGCT GGAGCGGCGT GGATTCCACC CGTCTTGATA CCCCAGGGAA AATCGACCCA ATCCCGCATC CGTATGGCGA AGATTTGCCG TGCGCGGATA ACAAACCGGT AGCTCCGAAA AAGCAGGAAG CCAAAGCTGT AACCGTGCAG CCACCACGCC CGAAACCGTG GGAAAAAACC TACGTGTTGC TGCCTTCTTT CGAGAAAGTG AAGGGCGATA AAGTGCTGTA CGCCCATGCT TCGCGCATTC TGCACCACGA AACTAACCCA GGCTGCGCCC GCGCATTGAT GCAAAAACAT GGCGACCGCT ATGTATGGAT CAACCCGCCT GCTATTCCGC TTTCTACCGA AGAGATGGAT AGCGTCTTTG CGCTGCCGTA CAAGCGCGTG CCGCATCCGG CTTACGGTAA TGCCCGAATT CCGGCTTACG AAATGATTCG TTTCTCGGTC AACATTATGC GTGGCTGCTT TGGCGGCTGC TCTTTCTGTT CTATTACCGA GCACGAAGGG CGCATTATTC AGAGCCGTTC CGAAGATTCG ATTATTAATG AGATCGAAGC GATCCGCGAC ACTGTTCCAG GTTTTACGGG CGTGATTTCC GATCTCGGTG GGCCTACTGC CAACATGTAT ATGTTGCGCT GCAAATCGCC ACGCGCTGAG CAAACCTGCC GTCGTTTGTC GTGCGTTTAT CCGGATATTT GTCCGCACAT GGACACTAAC CATGAACCGA CCATCAACCT CTATCGCCGC GCTCGTGATC TGAAAGGCAT TAAAAAGATC CTCATCGCCT CTGGTGTGCG TTATGACATC GCCGTAGAAG ATCCGCGCTA TATCAAAGAG CTGGCGACCC ATCACGTCGG CGGTTATCTG AAGATTGCCC CGGAACATAC CGAAGAAGGG CCGTTATCGA AGATGATGAA GCCGGGCATG GGCAGCTATG ACCGCTTTAA AGAGCTGTTC GATACTTACT CGAAACAGGC AGGTAAAGAA CAGTATCTGA TCCCGTATTT CATCTCCGCG CACCCGGGTA CGCGTGATGA AGATATGGTG AATCTGGCGC TGTGGCTGAA AAAGCATCGT TTCCGTCTCG ACCAGGTACA GAACTTCTAC CCATCGCCGC TGGCTAACTC GACCACCATG TATTACACCG GGAAAAACCC GCTGGCGAAG ATTGGTTATA AGAGTGAAGA CGTCTTCGTA CCGAAGGGCG ACAAACAGCG TCGTTTGCAT AAAGCGTTGT TGCGTTACCA CGATCCGGCA AACTGGCCGT TAATCCGTCA GGCGCTGGAA GCGATGGGCA AAAAGCATCT GATTGGCAGC CGTCGCGATT GCTTAGTGCC TGCGCCAACC ATTGAAGAGA TGCGTGAAGC TCGCCGCCAG AACCGCAATA CCCGTCCGGC GTTGACTAAA CATACGCCGA TGGCGACCCA GCGTCAGACG CCTGCTACGG CAAAAAAAGC GTCGTCTACG CAATCTCGCC TGCAGAATGC TGGTGCGAAG AAACGCCCTA AAGCGGCGGT TGGACGTTAA
|
Protein sequence | MSSISLIQPD RDLFSWPQYW AACFGPAPFL PMSREEMDQL GWDSCDIILV TGDAYVDHPS FGMAICGRML EAQGFRVGII AQPDWSSKDD FMRLGKPNLF FGVTAGNMDS MINRYTADRR LRHDDAYTPD NVAGKRPDRA TLVYTQRCKE AWKDVPVILG GIEASLRRTA HYDYWSDTVR RSVLVDSKAD MLMFGNGERP LVEVAHRLAM GEPISEIRDV RNTAIIVKEA LPGWSGVDST RLDTPGKIDP IPHPYGEDLP CADNKPVAPK KQEAKAVTVQ PPRPKPWEKT YVLLPSFEKV KGDKVLYAHA SRILHHETNP GCARALMQKH GDRYVWINPP AIPLSTEEMD SVFALPYKRV PHPAYGNARI PAYEMIRFSV NIMRGCFGGC SFCSITEHEG RIIQSRSEDS IINEIEAIRD TVPGFTGVIS DLGGPTANMY MLRCKSPRAE QTCRRLSCVY PDICPHMDTN HEPTINLYRR ARDLKGIKKI LIASGVRYDI AVEDPRYIKE LATHHVGGYL KIAPEHTEEG PLSKMMKPGM GSYDRFKELF DTYSKQAGKE QYLIPYFISA HPGTRDEDMV NLALWLKKHR FRLDQVQNFY PSPLANSTTM YYTGKNPLAK IGYKSEDVFV PKGDKQRRLH KALLRYHDPA NWPLIRQALE AMGKKHLIGS RRDCLVPAPT IEEMREARRQ NRNTRPALTK HTPMATQRQT PATAKKASST QSRLQNAGAK KRPKAAVGR
|
| |