Gene EcSMS35_4298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4298 
SymbolrhaT 
ID6144337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4401919 
End bp4402953 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content52% 
IMG OID641619120 
Productrhamnose-proton symporter 
Protein accessionYP_001746244 
Protein GI170683644 
COG category 
COG ID 
TIGRFAM ID[TIGR00776] RhaT L-rhamnose-proton symporter family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.871805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0209281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACG CGATTACGAT GGGGATATTT TGGCATTTGA TCGGCGCGGC CAGTGCAGCC 
TGTTTTTACG CTCCGTTCAA AAAAGTAAAA AAATGGTCAT GGGAAACCAT GTGGTCAGTT
GGTGGGATTG TTTCGTGGAT TATTCTGCCG TGGGCCATCA GCGCCCTGTT ACTGCCGAAT
TTCTGGGCGT ATTACAGCTC GTTTAGTCTC TCTACGCTGC TGCCTGTTTT TCTGTTCGGC
GCTATGTGGG GGATCGGTAA TATCAACTAC GGCCTGACCA TGCGTTATCT CGGCATGTCG
ATGGGAATTG GCATCGCCAT TGGCATTACG TTGATTGTCG GTACGCTGAT GACGCCAATT
ATCAACGGCA ATTTCGATGT TCTGATTAAT ACCGAAGGCG GACGCATGAC GTTGCTCGGC
GTTCTGGTGG CGCTGATTGG CGTAGGGATT GTGACTCGCG CCGGGCAGTT GAAAGAGCGC
AAGATGGGCA TTAAAGCCGA AGAGTTCAAC CTGAAAAAAG GGCTGGTGCT GGCGGTGATG
TGCGGCATTT TCTCTGCCGG GATGTCCTTT GCGATGAACG CCGCAAAACC GATGCATGAA
GCCGCTGCTG CACTGGGCGT CGATCCACTG TATGTCGCTC TGCCAAGCTA TGTTGTCATC
ATGGGAGGCG GCGCGATCAT CAACCTCGGT TTCTGCTTCA TTCGTCTGGC AAAAGTGAAG
GATTTGTCGC TAAAAGCCGA CTTTTCGCTG GCAAAACCGC TAATCATTCA CAACGTGTTA
CTCTCGGCAC TGGGCGGTTT GATGTGGTAT CTGCAATTCT TTTTCTATGC CTGGGGCCAC
GCCCGCATTC CGGCGCAGTA TGACTACATC AGCTGGATGC TGCATATGAG CTTCTATGTA
TTGTGCGGCG GTATCGTCGG GCTGGTGCTG AAAGAGTGGA ACAATGCAGG CCGCCGTCCG
GTAACGGTGT TGAGCCTCGG TTGTGTGGTG ATTATTGTCG CCGCCAACAT CGTCGGCATG
GGCATGGCGA ATTAA
 
Protein sequence
MSNAITMGIF WHLIGAASAA CFYAPFKKVK KWSWETMWSV GGIVSWIILP WAISALLLPN 
FWAYYSSFSL STLLPVFLFG AMWGIGNINY GLTMRYLGMS MGIGIAIGIT LIVGTLMTPI
INGNFDVLIN TEGGRMTLLG VLVALIGVGI VTRAGQLKER KMGIKAEEFN LKKGLVLAVM
CGIFSAGMSF AMNAAKPMHE AAAALGVDPL YVALPSYVVI MGGGAIINLG FCFIRLAKVK
DLSLKADFSL AKPLIIHNVL LSALGGLMWY LQFFFYAWGH ARIPAQYDYI SWMLHMSFYV
LCGGIVGLVL KEWNNAGRRP VTVLSLGCVV IIVAANIVGM GMAN