Gene EcolC_4110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4110 
Symbol 
ID6065918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4534775 
End bp4535809 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content52% 
IMG OID641603532 
Productrhamnose-proton symporter 
Protein accessionYP_001727035 
Protein GI170022081 
COG category 
COG ID 
TIGRFAM ID[TIGR00776] RhaT L-rhamnose-proton symporter family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACG CGATTACGAT GGGGATATTT TGGCATTTGA TCGGCGCGGC CAGTGCAGCC 
TGTTTTTACG CTCCGTTCAA AAAAGTAAAA AAATGGTCAT GGGAAACCAT GTGGTCAGTC
GGTGGGATTG TTTCGTGGAT TATTCTGCCG TGGGCCATCA GCGCCCTGTT ACTACCGAAT
TTCTGGGCGT ATTACAGCTC GTTTAGTCTC TCTACGCTAC TGCCTGTTTT TCTGTTCGGC
GCTATGTGGG GGATCGGTAA TATCAACTAC GGCCTGACCA TGCGTTATCT CGGCATGTCG
ATGGGAATTG GCATCGCCAT TGGCATTACG TTGATTGTCG GTACGCTGAT GACGCCAATT
ATCAACGGCA ATTTCGATGT GTTGATTAGC ACCGAAGGCG GACGCATGAC GTTGCTCGGC
GTTCTGGTGG CGCTGATTGG CGTAGGGATT GTAACTCGCG CCGGGCAGTT GAAAGAGCGC
AAGATGGGCA TTAAAGCCGA AGAGTTCAAT CTGAAAAAAG GGCTGGTGCT GGCGGTGATG
TGCGGCATTT TCTCTGCCGG GATGTCCTTT GCGATGAACG CCGCAAAACC GATGCATGAA
GCCGCTGCCG CACTTGGCGT CGATCCACTG TATGTCGCTC TGCCAAGCTA TGTTGTCATC
ATGGGCGGCG GCGCGATCAT TAACCTCGGT TTCTGTTTTA TTCGTCTGGC AAAAGTGAAG
GATTTGTCGC TAAAAGCCGA CTTCTCGCTG GCAAAATCGC TGATCATTCA CAATGTGTTA
CTCTCGACAC TGGGCGGGTT GATGTGGTAT CTGCAATTCT TTTTCTATGC CTGGGGCCAC
GCCCCCATTC CGGCGCAGTA TGACTACATC AGTTGGATGC TGCATATGAG TTTCTATGTA
TTGTGCGGCG GTATCGTCGG GCTGGTGCTG AAAGAGTGGA ACAATGCAGG ACGCCGTCCG
GTAACGGTGT TGAGCCTCGG TTGTGTGGTG ATTATTGTCG CCGCTAACAT CGTCGGCATC
GGCATGGCGA ATTAA
 
Protein sequence
MSNAITMGIF WHLIGAASAA CFYAPFKKVK KWSWETMWSV GGIVSWIILP WAISALLLPN 
FWAYYSSFSL STLLPVFLFG AMWGIGNINY GLTMRYLGMS MGIGIAIGIT LIVGTLMTPI
INGNFDVLIS TEGGRMTLLG VLVALIGVGI VTRAGQLKER KMGIKAEEFN LKKGLVLAVM
CGIFSAGMSF AMNAAKPMHE AAAALGVDPL YVALPSYVVI MGGGAIINLG FCFIRLAKVK
DLSLKADFSL AKSLIIHNVL LSTLGGLMWY LQFFFYAWGH APIPAQYDYI SWMLHMSFYV
LCGGIVGLVL KEWNNAGRRP VTVLSLGCVV IIVAANIVGI GMAN