Gene EcHS_A4137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4137 
SymbolrhaT 
ID5591248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4127500 
End bp4128534 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content51% 
IMG OID640923239 
Productrhamnose-proton symporter 
Protein accessionYP_001460698 
Protein GI157163380 
COG category 
COG ID 
TIGRFAM ID[TIGR00776] RhaT L-rhamnose-proton symporter family protein 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAACG CGATTACGAT GGGGATATTT TGGCATTTGA TCGGCGCGGC CAGTGCAGCC 
TGTTTTTACG CTCCGTTCAA AAAAGTAAAA AAATGGTCAT GGGAAACCAT GTGGTCAGTC
GGTGGGATTG TTTCGTGGAT TATTCTGCCG TGGGCCATCA GCGCCCTGTT ACTACCGAAT
TTCTGGGCGT ATTACAGCTC GTTTAGTCTC TCTACGCTAC TGCCTGTTTT TCTGTTCGGC
GCTATGTGTG GGATCGGTAA TATCAACTAC GGCCTGACCA TGCGTTATCT CGGCATGTCG
ATGGGAATTG GCATCGCCAT TGGCATTACG TTGATTGTCG GTACGCTGAT GACGCCAATT
ATCAACGGCA ATTTCGATGT GTTGATTAGC ACCGAAGGCG GACGCATGAC GTTGCTCGGC
GTTCTGGTGG CGCTGATTGG CGTAGGGATT GTAACTCGCG CCGGGCAGTT GAAAGAGCGC
AAGATGGGCA TTAAAGCCGA AGAGTTCAAT CTGAAAAAAG GGCTGGTGCT GGCGGTGATG
TGCGGCATTT TCTCTGCCGG GATGTCCTTT GCGATGAACG CCGCAAAACC GATGCATGAA
GCCGCTGCCG CACTTGGCGT CGATCCACTG TATGTCGCTC TGCCAAGCTA TGTTGTCATC
ATGGGCGGCG GCGCGATCAT TAACCTCGGT TTCTGTTTTA TTCGTCTGGC AAAAGTGAAG
GATTTGTCGC TAAAAGCCGA CTTCTCGCTG GCAAAATCGC TGATCATTCA CAATGTGTTA
CTCTCGACAC TGGGCGGGTT GATGTGGTAT CTGCAATTCT TTTTCTATGC CTGGGGCCAC
GCCCGCATTC CGGCGCAGTA TGACTACATC AGTTGGATGC TGCATATGAG TTTCTATGTA
TTGTGCGGCG GTATCGTCGG GCTGGTGCTG AAAGAGTGGA ACAATGCAGG ACGCCGTCCG
GTAACGGTGT TGAGCCTCGG TTGTGTGGTG ATTATTGTCG CCGCTAACAT CGTCGGCATC
GGCATGGCGA ATTAA
 
Protein sequence
MSNAITMGIF WHLIGAASAA CFYAPFKKVK KWSWETMWSV GGIVSWIILP WAISALLLPN 
FWAYYSSFSL STLLPVFLFG AMCGIGNINY GLTMRYLGMS MGIGIAIGIT LIVGTLMTPI
INGNFDVLIS TEGGRMTLLG VLVALIGVGI VTRAGQLKER KMGIKAEEFN LKKGLVLAVM
CGIFSAGMSF AMNAAKPMHE AAAALGVDPL YVALPSYVVI MGGGAIINLG FCFIRLAKVK
DLSLKADFSL AKSLIIHNVL LSTLGGLMWY LQFFFYAWGH ARIPAQYDYI SWMLHMSFYV
LCGGIVGLVL KEWNNAGRRP VTVLSLGCVV IIVAANIVGI GMAN