Gene EcSMS35_3114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3114 
SymbolrafA 
ID6146988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3199851 
End bp3201977 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content51% 
IMG OID641617981 
Productalpha-galactosidase 
Protein accessionYP_001745131 
Protein GI170682126 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.024409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTCAA AGTACTGCAG ACTGAGCAGT CCTCGCTCTG ATTTAATTAT TAAAACCCGT 
CCACATGCAG AAATTATCTG GTGGGGCTCT GCACTGAAAC ATTTCTCACC GGATGACTGT
GCCAGCCTGG AAAGACCGGT TGCGAATGGT CGTCTGGATA TTGATACGCC ACTAACACTG
ATGGCTGAAA ATGCCCTGGG ACTTTTTAGC TCTCCGGGAC TGGAAGGACA CAGGAATGGG
CTGGATGCAT CTCCTGTTTT TTATACAGTT GACGTGGAAC ATACCGAAAA CACTCTGAGA
CTTACCAGTG AAGATTCGGT AGCCGGCCTG CGTCTGGTCA GCGAGCTGGT GATGACGCCA
TCAGGAATTC TGAAAGTTCG TCATGCACTG ACCAACCTCA GAGAGGGGGA CTGGCAGATA
AATCGTTTCG CCATCACTTT ACCTTTAGCG GAACGTGCGG AAGAAGTCAT GGCTTTTCAC
GGACGCTGGA CTCGTGAATT TCAGCCGCAC AGGGTACGTC TCACTCATGA TGCTTTTGTT
CTGGAAAATC GCAGAGGGCG GACATCTCAT GAGCATTTTC CGGCGCTGAT TGTCGGCACG
CCAGGCTTCT CGGAACAACA GGGAGAGGTG TGGGCTGTGC ATCTGGGGTG GAGTGGAAAT
CACCGCATGA GATGTGAGGC AAAAACTGAC GGCAGGCGTT ACGTTCAGGC TGAGGCTCTG
TGGATGCCGG GTGAGAAGGC TCTCAGGAAG AATGAAACCC TGTACACCCC GTGGCTATAT
GCCTGCCACT CTGCGGATGG CCTGAATGGA ATGAGTCAGC AATACCATCG TTTTTTGCGT
GATGAAATTA TCCGTTTCCC TGAGCAAAAA CCCCGCCCTG TACATCTCAA TACCTGGGAA
GGTATTTATT TCAATCACAA TCCTGATTAC ATCATGCAGA TGGCTGAGCG TGCAGCAGCA
CTGGGCGTTG AACGTTTCAT TATTGATGAT GGCTGGTTTA AAGGACGTAA CGATGACCGC
GCGGCTCTGG GCGACTGGTA TACCGATGAA CAGAAATACC CGAACGGGCT GATGCCGGTT
ATTAAACATG TGAAATCTCT CGGTATGGAA TTTGGCATAT GGGTTGAGCC GGAAATGATT
AACCCGGATT CTGACCTGTT TCGTCTTCAT CCTGACTGGG TATTGTCAAT GCCTGGATAT
TCCCAGCCAA CCGGAAGATA TCAGTATGTT CTTAACCTGA ATATTCCAGA GGCCTTTGCT
TACATTTATG AACGTTTCTT ATGGTTACTG GGAGAACATC CGGTTGATTA TGTGAAATGG
GACATGAATC GTGAGCTTGT ACAGGCAGGG CATGAAGGCC GTGCGGCAGC AGATGCACAG
ACCCGTCAGT TCTATCGATT GCTTGATCTC CTCCGTGAAC GTTTTCCACA TGTTGAGTTT
GAGTCCTGTG CTTCCGGTGG GGGGCGTATT GACTTCGAAG TCCTGAAACG CACACACCGG
TTCTGGGCAT CTGACAATAA TGATGCCCTG GAGCGCTGCA CCATACAACG TGGCATGAGT
TACTTTTTCC CTCCTGAGGT GATGGGGGCG CATATTGGCC ATCGCCGCTG CCATGCAACT
TTCCGGCAGC ACAGCATCGC TTTTCGTGGG CTGACGGCAT TGTTCGGCCA TATGGGGCTG
GAGCTGGATC CGGTGGCCGC AGATGCGAAG GAATCTGACG GTTATCGCCG GTATGCCTTG
CTCTATAAAG AATGGCGACA ACTGATTCAT ACAGGAGTTC TCTGGCGTGT GGATATGCCA
GATCCTTCGA TACAGGTTCA GGGAGTCGTC AGCCCTGATC AGTCTCAGGC ACTTTTTATG
ATCAGCCAGC TTGCAATGCC GGATTACACC TTACCAGGCA TACTTCGTTT TCCCGGTCTG
GCGGCGGAAG TGCGTTACCG GCTTCGGGTT ATTGATCACC CGGACCTCCA GGTGGTTGGT
GAAGGCGGTC ATACCATGCG CAAATTACCT GTCTGGATGA ATCAGAGCCT TGAGGCCAGT
GGTGAATGGC TGGCACAGGG AGGGATTCAG CTCCCCGTAC TGGATCCGGA GAGTGCGATT
TTGATAGCAC TTGAAAGAGC TGTGTGA
 
Protein sequence
MVSKYCRLSS PRSDLIIKTR PHAEIIWWGS ALKHFSPDDC ASLERPVANG RLDIDTPLTL 
MAENALGLFS SPGLEGHRNG LDASPVFYTV DVEHTENTLR LTSEDSVAGL RLVSELVMTP
SGILKVRHAL TNLREGDWQI NRFAITLPLA ERAEEVMAFH GRWTREFQPH RVRLTHDAFV
LENRRGRTSH EHFPALIVGT PGFSEQQGEV WAVHLGWSGN HRMRCEAKTD GRRYVQAEAL
WMPGEKALRK NETLYTPWLY ACHSADGLNG MSQQYHRFLR DEIIRFPEQK PRPVHLNTWE
GIYFNHNPDY IMQMAERAAA LGVERFIIDD GWFKGRNDDR AALGDWYTDE QKYPNGLMPV
IKHVKSLGME FGIWVEPEMI NPDSDLFRLH PDWVLSMPGY SQPTGRYQYV LNLNIPEAFA
YIYERFLWLL GEHPVDYVKW DMNRELVQAG HEGRAAADAQ TRQFYRLLDL LRERFPHVEF
ESCASGGGRI DFEVLKRTHR FWASDNNDAL ERCTIQRGMS YFFPPEVMGA HIGHRRCHAT
FRQHSIAFRG LTALFGHMGL ELDPVAADAK ESDGYRRYAL LYKEWRQLIH TGVLWRVDMP
DPSIQVQGVV SPDQSQALFM ISQLAMPDYT LPGILRFPGL AAEVRYRLRV IDHPDLQVVG
EGGHTMRKLP VWMNQSLEAS GEWLAQGGIQ LPVLDPESAI LIALERAV