Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0874 |
Symbol | |
ID | 6143591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 880766 |
End bp | 882451 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641615762 |
Product | hypothetical protein |
Protein accession | YP_001742954 |
Protein GI | 170683131 |
COG category | [R] General function prediction only |
COG ID | [COG2985] Predicted permease |
TIGRFAM ID | [TIGR01625] AspT/YidE/YbjL antiporter duplication domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.938397 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATATAA ACGTCGCCGA ATTGTTAAAT GGGAATTACA TTCTGTTATT ATTTGTGGTC CTCGCGCTTG GGCTATGTCT CGGGAAATTA CGACTTGGTT CGATCCAACT GGGTAATTCC ATTGGCGTTT TAGTCGTATC GCTGTTATTA GGCCAACAAC ATTTCAGCAT TAACACCGAC GCGCTTAATC TTGGCTTTAT GCTGTTTATT TTCTGCGTCG GGGTCGAAGC CGGACCGAAC TTTTTTTCCA TTTTTTTTCG CGATGGGAAA AATTACCTAA TGTTAGCACT GGTGATGGTT GGCAGTGCGC TGGTCATCGC CTTAGGGTTA GGTAAGCTGT TTGGCTGGGA TATTGGCCTG ACGGCCGGTA TGTTAGCAGG CTCTATGACG TCAACACCGG TTTTGGTCGG TGCTGGCGAT ACACTGCGTC ATTCCGGCAT GGAAAGCAGG CAGCTCTCAC TGGCACTGGA TAATCTGAGC CTCGGGTATG CCTTAACCTA TTTAATCGGT CTGGTGAGTT TGATTGTTGG TGCGCGTTAC TTGCCGAAAT TGCAGCATCA GGACTTACAG ACCAGCGCCC AGCAAATCGC CCGCGAACGT GGCCTGGACA CAGATGCCAA CCGTAAGGTT TATTTGCCGG TGATCCGCGC CTACCGCGTC GGCCCGGAGC TGGTGGCCTG GACCGACGGC AAAAATCTGC GTGAACTGGG TATTTATCGA CAAACCGGCT GCTACATTGA ACGTATTCGA CGTAACGGGA TTCTGGCAAA TCCAGACGGT GATGCCGTGC TACAAATGGG CGATGAAATA GCGTTGGTAG GCTATCCCGA CGCCCACGCC CGACTCGATC CCAGTTTCCG TAATGGCAAA GAAGTTTTCG ATCGTGACCT TCTCGACATG CGTATCGTCA CTGAAGAAGT GGTCGTTAAA AACCATAACG CTGTAGGCAA ACGTCTCGCA CAACTGAAGT TGACCGATCA CGGTTGCTTC CTTAACCGCG TCATTCGTAG CCAGATTGAG ATGCCGATTG ATGACAACGT CGTGCTTAAC AAAGGTGACG TTTTACAAGT CAGCGGCGAT GCCCGTCGCG TAAAAACCAT CGCCGATCGC ATCGGCTTTA TCTCGATTCA CAGCCAGGTC ACTGACTTGC TGGCATTCTG CGCCTTCTTT GTGATTGGGC TGATGATCGG GATGATCACC TTCCAGTTCA GCACATTCAG TTTCGGCATG GGGAACGCTG CCGGGTTGTT ATTCGCCGGA ATTATGCTGG GCTTTATGCG TGCTAACCAC CCGACCTTCG GTTACATTCC GCAAGGTGCA TTAAGCATGG TGAAAGAGTT CGGCTTGATG GTGTTTATGG CAGGCGTTGG TCTGAGCGCC GGTAGCGGTA TTAATAACGG CCTGGGCGCG ATTGGCGGTC AAATGTTGAT TGCCGGATTG ATTGTCAGTC TGGTGCCAGT GGTTATCTGC TTCTTGTTCG GTGCTTATGT ATTGCGAATG AACCGCGCAC TGTTGTTCGG CGCAATGATG GGCGCACGCA CCTGCGCGCC GGCAATGGAG ATCATCAGTG ATACAGCTCG CAGTAACATC CCGGCGCTGG GCTATGCGGG CACCTATGCA ATCGCCAACG TCCTGCTGAC GCTAGCAGGG ACAATCATCG TCATGGTATG GCCAGGATTA GGATAA
|
Protein sequence | MNINVAELLN GNYILLLFVV LALGLCLGKL RLGSIQLGNS IGVLVVSLLL GQQHFSINTD ALNLGFMLFI FCVGVEAGPN FFSIFFRDGK NYLMLALVMV GSALVIALGL GKLFGWDIGL TAGMLAGSMT STPVLVGAGD TLRHSGMESR QLSLALDNLS LGYALTYLIG LVSLIVGARY LPKLQHQDLQ TSAQQIARER GLDTDANRKV YLPVIRAYRV GPELVAWTDG KNLRELGIYR QTGCYIERIR RNGILANPDG DAVLQMGDEI ALVGYPDAHA RLDPSFRNGK EVFDRDLLDM RIVTEEVVVK NHNAVGKRLA QLKLTDHGCF LNRVIRSQIE MPIDDNVVLN KGDVLQVSGD ARRVKTIADR IGFISIHSQV TDLLAFCAFF VIGLMIGMIT FQFSTFSFGM GNAAGLLFAG IMLGFMRANH PTFGYIPQGA LSMVKEFGLM VFMAGVGLSA GSGINNGLGA IGGQMLIAGL IVSLVPVVIC FLFGAYVLRM NRALLFGAMM GARTCAPAME IISDTARSNI PALGYAGTYA IANVLLTLAG TIIVMVWPGL G
|
| |