Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4689 |
Symbol | |
ID | 6144451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4788034 |
End bp | 4788999 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641619505 |
Product | hypothetical protein |
Protein accession | YP_001746613 |
Protein GI | 170680592 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.16874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAGCG GCGTGCTGTA CGCCCTATTA GCAGGGTTGA TGTGGGGGCT TATTTTTGTC GGGCCGTTGA TCGTGCCGGA ATACCCGGCG ATGTTGCAGT CGATGGGGCG TTATCTGGCG TTAGGGTTAA TTGCGCTACC CATTGCCTGG CTGGGACGCG TGCGTCTGCG TCAGTTGGCG CGTCGCGACT GGCTTACCGC CTTGATGCTC ACTATGATGG GCAACCTCAT CTATTACTTC TGCCTTGCCA GTGCCATTCA ACGTACTGGC GCGCCTGTTT CCACGATGAT TATCGGCACC CTGCCGGTGG TGATTCCCGT TTTTGCCAAT CTGCTTTATA GCCAGCGCGA CGGCAAACTC GCGTGGGGAA AACTCGCCCC GGCACTGATT TGTATTGGCA TCGGCCTGGC GTGTGTGAAT ATTGCTGAGT TAAACCACGG ACTCCCCGAT TTTGACTGGG CACGTTATAC CTCAGGCATC GTGCTAGCGT TAGTTTCCGT GGTCTGCTGG GCATGGTATG CCCTGCGCAA CGCCCGCTGG CTGCGGGAAA ATCCCGACAA ACATCCAATG ATGTGGGCGA CGGCGCAGGC GCTGGTCACG CTGCCGGTTT CTCTCATCGG CTATCTCGTC GCCTGTTACT GGCTGAATAT GCAAACGCCG GACTTCTCCT TACCCTTTGG CCCCCGTCCG CTGGTGTTTA TTAGTCTGAT GGTTGCGATA GCCGTGCTTT GCTCATGGGT TGGCGCACTC TGCTGGAACG TCGCCAGCCA GCGATTACCG ACAGTGATTC TCGGGCCGCT GATTGTTTTC GAAACGCTGG CAGGTTTGCT GTACACCTTT TTACTCCGCC AGCAAATGCC GCCGCTAATG ACGCTGAGCG GTATCGCGCT GTTAGTGATT GGCGTGGTCA TCGCGGTCAG AGCAAAACCA GAAAAACCTT TAACTGAATC TGTCTCAGAA AGTTGA
|
Protein sequence | MISGVLYALL AGLMWGLIFV GPLIVPEYPA MLQSMGRYLA LGLIALPIAW LGRVRLRQLA RRDWLTALML TMMGNLIYYF CLASAIQRTG APVSTMIIGT LPVVIPVFAN LLYSQRDGKL AWGKLAPALI CIGIGLACVN IAELNHGLPD FDWARYTSGI VLALVSVVCW AWYALRNARW LRENPDKHPM MWATAQALVT LPVSLIGYLV ACYWLNMQTP DFSLPFGPRP LVFISLMVAI AVLCSWVGAL CWNVASQRLP TVILGPLIVF ETLAGLLYTF LLRQQMPPLM TLSGIALLVI GVVIAVRAKP EKPLTESVSE S
|
| |