Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4743 |
Symbol | |
ID | 6145706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4842267 |
End bp | 4843349 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619558 |
Product | putative permease |
Protein accession | YP_001746666 |
Protein GI | 170681768 |
COG category | [R] General function prediction only |
COG ID | [COG0795] Predicted permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000687106 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.744445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCTT TTGGCGTACT TGACCGCTAT ATCGGTAAAA CTATTTTCAC CACCATCATG ATGACGCTGT TCATGCTGGT GTCGCTGTCG GGCATTATCA AGTTTGTCGA TCAGCTGAAA AAAGCCGGGC AGGGGAGTTA CGACGCGTTA GGCGCAGGAA TGTATACCTT GCTGAGCGTG CCGAAAGATG TGCAAATCTT CTTCCCGATG GCGGCTCTGC TTGGGGCGTT GCTTGGTCTT GGGATGCTGG CGCAGCGCAG CGAACTGGTA GTCATGCAGG CTTCTGGTTT TACCCGTATG CAGGTGGCGC TGTCGGTGAT GAAAACCGCC ATTCCGCTGG TCTTGCTGAC GATGGCGATT GGTGAATGGG TCGCGCCGCA GGGCGAGCAG ATGGCGCGTA ACTACCGTGC GCAGGCGATG TACGGCGGCT CGTTGCTCTC CACTCAGCAA GGCTTATGGG CGAAAGATGG CAACAACTTC GTCTACATTG AGCGGGTTAA AGGTGACGAA GAGTTAGGTG GCATCAGTAT TTATGCCTTT AACGAGAATC GTCGTCTGCA ATCCGTACGT TATGCCGCTA CCGCGAAGTT TGACCCGGAA CATAAAGTCT GGCGTCTGTC GCAGGTTGAT GAATCTGATC TGACCAATCC GAAACAGATC ACCGGTTCGC AGACGGTGAG CGGCACCTGG AAAACCAACC TCACGCCAGA CAAACTGGGC GTGGTGGCGC TGGACCCGGA TGCACTCTCC ATCAGTGGTT TGCACAACTA TGTGAAGTAT CTGAAGTCGA GCGGTCAGGA TGCCGGACGT TATCAGCTCA ACATGTGGAG CAAAATCTTC CAGCCGCTAT CCGTGGCGGT GATGATGCTG ATGGCGCTGT CGTTTATCTT TGGCCCACTG CGTAGCGTAC CGATGGGCGT GCGTGTGGTC ACCGGTATCA GCTTCGGTTT TGTCTTCTAC GTACTGGACC AGATCTTCGG CCCGCTGACG TTGGTTTATG GCATCCCGCC GATCATCGGC GCACTGTTGC CAAGCGCCAG CTTCTTCTTA ATCAGCCTGT GGCTGTTAAT GAGAAAATCG TAA
|
Protein sequence | MQPFGVLDRY IGKTIFTTIM MTLFMLVSLS GIIKFVDQLK KAGQGSYDAL GAGMYTLLSV PKDVQIFFPM AALLGALLGL GMLAQRSELV VMQASGFTRM QVALSVMKTA IPLVLLTMAI GEWVAPQGEQ MARNYRAQAM YGGSLLSTQQ GLWAKDGNNF VYIERVKGDE ELGGISIYAF NENRRLQSVR YAATAKFDPE HKVWRLSQVD ESDLTNPKQI TGSQTVSGTW KTNLTPDKLG VVALDPDALS ISGLHNYVKY LKSSGQDAGR YQLNMWSKIF QPLSVAVMML MALSFIFGPL RSVPMGVRVV TGISFGFVFY VLDQIFGPLT LVYGIPPIIG ALLPSASFFL ISLWLLMRKS
|
| |