Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4298 |
Symbol | rhaT |
ID | 6144337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4401919 |
End bp | 4402953 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619120 |
Product | rhamnose-proton symporter |
Protein accession | YP_001746244 |
Protein GI | 170683644 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00776] RhaT L-rhamnose-proton symporter family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.871805 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0209281 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAACG CGATTACGAT GGGGATATTT TGGCATTTGA TCGGCGCGGC CAGTGCAGCC TGTTTTTACG CTCCGTTCAA AAAAGTAAAA AAATGGTCAT GGGAAACCAT GTGGTCAGTT GGTGGGATTG TTTCGTGGAT TATTCTGCCG TGGGCCATCA GCGCCCTGTT ACTGCCGAAT TTCTGGGCGT ATTACAGCTC GTTTAGTCTC TCTACGCTGC TGCCTGTTTT TCTGTTCGGC GCTATGTGGG GGATCGGTAA TATCAACTAC GGCCTGACCA TGCGTTATCT CGGCATGTCG ATGGGAATTG GCATCGCCAT TGGCATTACG TTGATTGTCG GTACGCTGAT GACGCCAATT ATCAACGGCA ATTTCGATGT TCTGATTAAT ACCGAAGGCG GACGCATGAC GTTGCTCGGC GTTCTGGTGG CGCTGATTGG CGTAGGGATT GTGACTCGCG CCGGGCAGTT GAAAGAGCGC AAGATGGGCA TTAAAGCCGA AGAGTTCAAC CTGAAAAAAG GGCTGGTGCT GGCGGTGATG TGCGGCATTT TCTCTGCCGG GATGTCCTTT GCGATGAACG CCGCAAAACC GATGCATGAA GCCGCTGCTG CACTGGGCGT CGATCCACTG TATGTCGCTC TGCCAAGCTA TGTTGTCATC ATGGGAGGCG GCGCGATCAT CAACCTCGGT TTCTGCTTCA TTCGTCTGGC AAAAGTGAAG GATTTGTCGC TAAAAGCCGA CTTTTCGCTG GCAAAACCGC TAATCATTCA CAACGTGTTA CTCTCGGCAC TGGGCGGTTT GATGTGGTAT CTGCAATTCT TTTTCTATGC CTGGGGCCAC GCCCGCATTC CGGCGCAGTA TGACTACATC AGCTGGATGC TGCATATGAG CTTCTATGTA TTGTGCGGCG GTATCGTCGG GCTGGTGCTG AAAGAGTGGA ACAATGCAGG CCGCCGTCCG GTAACGGTGT TGAGCCTCGG TTGTGTGGTG ATTATTGTCG CCGCCAACAT CGTCGGCATG GGCATGGCGA ATTAA
|
Protein sequence | MSNAITMGIF WHLIGAASAA CFYAPFKKVK KWSWETMWSV GGIVSWIILP WAISALLLPN FWAYYSSFSL STLLPVFLFG AMWGIGNINY GLTMRYLGMS MGIGIAIGIT LIVGTLMTPI INGNFDVLIN TEGGRMTLLG VLVALIGVGI VTRAGQLKER KMGIKAEEFN LKKGLVLAVM CGIFSAGMSF AMNAAKPMHE AAAALGVDPL YVALPSYVVI MGGGAIINLG FCFIRLAKVK DLSLKADFSL AKPLIIHNVL LSALGGLMWY LQFFFYAWGH ARIPAQYDYI SWMLHMSFYV LCGGIVGLVL KEWNNAGRRP VTVLSLGCVV IIVAANIVGM GMAN
|
| |