Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3114 |
Symbol | rafA |
ID | 6146988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3199851 |
End bp | 3201977 |
Gene Length | 2127 bp |
Protein Length | 708 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617981 |
Product | alpha-galactosidase |
Protein accession | YP_001745131 |
Protein GI | 170682126 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.024409 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTTCAA AGTACTGCAG ACTGAGCAGT CCTCGCTCTG ATTTAATTAT TAAAACCCGT CCACATGCAG AAATTATCTG GTGGGGCTCT GCACTGAAAC ATTTCTCACC GGATGACTGT GCCAGCCTGG AAAGACCGGT TGCGAATGGT CGTCTGGATA TTGATACGCC ACTAACACTG ATGGCTGAAA ATGCCCTGGG ACTTTTTAGC TCTCCGGGAC TGGAAGGACA CAGGAATGGG CTGGATGCAT CTCCTGTTTT TTATACAGTT GACGTGGAAC ATACCGAAAA CACTCTGAGA CTTACCAGTG AAGATTCGGT AGCCGGCCTG CGTCTGGTCA GCGAGCTGGT GATGACGCCA TCAGGAATTC TGAAAGTTCG TCATGCACTG ACCAACCTCA GAGAGGGGGA CTGGCAGATA AATCGTTTCG CCATCACTTT ACCTTTAGCG GAACGTGCGG AAGAAGTCAT GGCTTTTCAC GGACGCTGGA CTCGTGAATT TCAGCCGCAC AGGGTACGTC TCACTCATGA TGCTTTTGTT CTGGAAAATC GCAGAGGGCG GACATCTCAT GAGCATTTTC CGGCGCTGAT TGTCGGCACG CCAGGCTTCT CGGAACAACA GGGAGAGGTG TGGGCTGTGC ATCTGGGGTG GAGTGGAAAT CACCGCATGA GATGTGAGGC AAAAACTGAC GGCAGGCGTT ACGTTCAGGC TGAGGCTCTG TGGATGCCGG GTGAGAAGGC TCTCAGGAAG AATGAAACCC TGTACACCCC GTGGCTATAT GCCTGCCACT CTGCGGATGG CCTGAATGGA ATGAGTCAGC AATACCATCG TTTTTTGCGT GATGAAATTA TCCGTTTCCC TGAGCAAAAA CCCCGCCCTG TACATCTCAA TACCTGGGAA GGTATTTATT TCAATCACAA TCCTGATTAC ATCATGCAGA TGGCTGAGCG TGCAGCAGCA CTGGGCGTTG AACGTTTCAT TATTGATGAT GGCTGGTTTA AAGGACGTAA CGATGACCGC GCGGCTCTGG GCGACTGGTA TACCGATGAA CAGAAATACC CGAACGGGCT GATGCCGGTT ATTAAACATG TGAAATCTCT CGGTATGGAA TTTGGCATAT GGGTTGAGCC GGAAATGATT AACCCGGATT CTGACCTGTT TCGTCTTCAT CCTGACTGGG TATTGTCAAT GCCTGGATAT TCCCAGCCAA CCGGAAGATA TCAGTATGTT CTTAACCTGA ATATTCCAGA GGCCTTTGCT TACATTTATG AACGTTTCTT ATGGTTACTG GGAGAACATC CGGTTGATTA TGTGAAATGG GACATGAATC GTGAGCTTGT ACAGGCAGGG CATGAAGGCC GTGCGGCAGC AGATGCACAG ACCCGTCAGT TCTATCGATT GCTTGATCTC CTCCGTGAAC GTTTTCCACA TGTTGAGTTT GAGTCCTGTG CTTCCGGTGG GGGGCGTATT GACTTCGAAG TCCTGAAACG CACACACCGG TTCTGGGCAT CTGACAATAA TGATGCCCTG GAGCGCTGCA CCATACAACG TGGCATGAGT TACTTTTTCC CTCCTGAGGT GATGGGGGCG CATATTGGCC ATCGCCGCTG CCATGCAACT TTCCGGCAGC ACAGCATCGC TTTTCGTGGG CTGACGGCAT TGTTCGGCCA TATGGGGCTG GAGCTGGATC CGGTGGCCGC AGATGCGAAG GAATCTGACG GTTATCGCCG GTATGCCTTG CTCTATAAAG AATGGCGACA ACTGATTCAT ACAGGAGTTC TCTGGCGTGT GGATATGCCA GATCCTTCGA TACAGGTTCA GGGAGTCGTC AGCCCTGATC AGTCTCAGGC ACTTTTTATG ATCAGCCAGC TTGCAATGCC GGATTACACC TTACCAGGCA TACTTCGTTT TCCCGGTCTG GCGGCGGAAG TGCGTTACCG GCTTCGGGTT ATTGATCACC CGGACCTCCA GGTGGTTGGT GAAGGCGGTC ATACCATGCG CAAATTACCT GTCTGGATGA ATCAGAGCCT TGAGGCCAGT GGTGAATGGC TGGCACAGGG AGGGATTCAG CTCCCCGTAC TGGATCCGGA GAGTGCGATT TTGATAGCAC TTGAAAGAGC TGTGTGA
|
Protein sequence | MVSKYCRLSS PRSDLIIKTR PHAEIIWWGS ALKHFSPDDC ASLERPVANG RLDIDTPLTL MAENALGLFS SPGLEGHRNG LDASPVFYTV DVEHTENTLR LTSEDSVAGL RLVSELVMTP SGILKVRHAL TNLREGDWQI NRFAITLPLA ERAEEVMAFH GRWTREFQPH RVRLTHDAFV LENRRGRTSH EHFPALIVGT PGFSEQQGEV WAVHLGWSGN HRMRCEAKTD GRRYVQAEAL WMPGEKALRK NETLYTPWLY ACHSADGLNG MSQQYHRFLR DEIIRFPEQK PRPVHLNTWE GIYFNHNPDY IMQMAERAAA LGVERFIIDD GWFKGRNDDR AALGDWYTDE QKYPNGLMPV IKHVKSLGME FGIWVEPEMI NPDSDLFRLH PDWVLSMPGY SQPTGRYQYV LNLNIPEAFA YIYERFLWLL GEHPVDYVKW DMNRELVQAG HEGRAAADAQ TRQFYRLLDL LRERFPHVEF ESCASGGGRI DFEVLKRTHR FWASDNNDAL ERCTIQRGMS YFFPPEVMGA HIGHRRCHAT FRQHSIAFRG LTALFGHMGL ELDPVAADAK ESDGYRRYAL LYKEWRQLIH TGVLWRVDMP DPSIQVQGVV SPDQSQALFM ISQLAMPDYT LPGILRFPGL AAEVRYRLRV IDHPDLQVVG EGGHTMRKLP VWMNQSLEAS GEWLAQGGIQ LPVLDPESAI LIALERAV
|
| |