Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3265 |
Symbol | |
ID | 6145542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3343940 |
End bp | 3345010 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618095 |
Product | YjgP/YjgQ permease |
Protein accession | YP_001745245 |
Protein GI | 170683865 |
COG category | [R] General function prediction only |
COG ID | [COG0795] Predicted permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.342085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTGG TTGAACACTA CATCATGCGT GGTACGCGCC GTCTGGTGCT GATTATCGTC GGTTTTCTGA TCTTTATTTT CGCCAGCTAC TCCGCACAGC GTTATCTGAC CGAAGCGGCA AATGGGACCT TAGCTCTTGA TGTTGTGCTG GATATCGTTT TCTACAAAGT GCTGATTGCA CTGGAGATGT TGTTACCTGT TGGTCTGTAT GTGTCAGTTG GCGTGACGCT AGGGCAGATG TATACCGACT CGGAAATTAC CGCTATCTCT GCGGCGGGGG GCAGTCCGGG ACGCTTGTAC AAAGCCGTTC TTTATCTGGC GATACCGCTA AGTATTTTTG TCACCCTTCT GTCGATGTAT GGTCGGCCGT GGGCTTATGC GCAGATTTAT CAACTGGAGC AACAGTCACA GTCGGAGCTG GATGTTCGCC AGTTGCGGGC AAAGAAATTT AACACTAACG ATAACGGACG AATGATCCTT TCGCAGACGG TTGATCAGGA TAATAATCGC CTGACTGACG CGCTGATTTA TACTTCTACT GCCAATCGAA CCCGCATTTT CCGCGCCCGT TCGGTTGATG TGGTTGACCC ATCACCTGAG AAACCGACCG TTATGTTGCA TAACGGGACC GCCTATCTTC TCGATCATCA GGGGCGTGAC GACAACGAAC AGATCTACCG TAATCTGCAA TTACATCTGA ATCCGCTGGA TCAAAGCCCT AACGTCAAAC GCAAAGCAAA ATCGGTCACG GAGCTGGCGC GCTCCGCCTT TCCTGCCGAT CATGCCGAAC TGCAATGGCG ACAAAGCCGT GGCCTGACAG CATTGTTGAT GGCGCTGCTG GCCATTTCAT TAAGTCGGGT AAAACCGCGG CAAGGGCGAT TTTCAACGTT ATTGCCACTG ACGTTGCTGT TTGTTGCCAT TTTTTATGGC GGCGACGTCT GCCGTACGCT GGTGGCTAAC GGTGCGATTC CCCTCATTCC TGGTTTGTGG TTAGTACCCG GACTCATGCT AATGGGCCTG CTGATGCTGG TCGCACGCGA CTTCTCTTTG CTGCAGAAAT TTTCCCGATG A
|
Protein sequence | MKLVEHYIMR GTRRLVLIIV GFLIFIFASY SAQRYLTEAA NGTLALDVVL DIVFYKVLIA LEMLLPVGLY VSVGVTLGQM YTDSEITAIS AAGGSPGRLY KAVLYLAIPL SIFVTLLSMY GRPWAYAQIY QLEQQSQSEL DVRQLRAKKF NTNDNGRMIL SQTVDQDNNR LTDALIYTST ANRTRIFRAR SVDVVDPSPE KPTVMLHNGT AYLLDHQGRD DNEQIYRNLQ LHLNPLDQSP NVKRKAKSVT ELARSAFPAD HAELQWRQSR GLTALLMALL AISLSRVKPR QGRFSTLLPL TLLFVAIFYG GDVCRTLVAN GAIPLIPGLW LVPGLMLMGL LMLVARDFSL LQKFSR
|
| |