Gene EcSMS35_1582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1582 
SymboluidA 
ID6144065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1566599 
End bp1568410 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content52% 
IMG OID641616459 
Productbeta-D-glucuronidase 
Protein accessionYP_001743637 
Protein GI170683754 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACGTC CTGTAGAAAC CCCAACCCGT GAAATCAAAA AACTCGACGG CCTGTGGGCA 
TTCAGTCTGG ATCGCGAAAA CTGTGGAATT GATCAGCGTT GGTGGGAAAG CGCGTTACAA
GAAAGCCGGG CAATTGCTGT GCCGGGCAGT TTTAACGATC AGTTCGCCGA TGCAGATATT
CGTAATTATG TGGGCAACGT CTGGTATCAG CGCGAAGTCT TTATACCGAA AGGTTGGGCA
GGCCAGCGTA TCGTGCTGCG TTTCGATGCG GTCACTCATT ACGGCAAAGT GTGGGTAAAT
AATCAGGAAG TGATGGAGCA TCAGGGCGGC TATACGCCAT TTGAAGCCGA TGTCACGCCG
TATGTTATTG CCGGGAAAAG TGTTCGTATC ACCGTTTGTG TGAACAATGA ACTGAACTGG
CAGACTATCC CGCCGGGAAT GGTGATTACC GATGAAAACG GCAAAAAAAA GCAGTCTTAC
TTCCATGATT TCTTTAACTA TGCCGGGATC CATCGCAGCG TAATGCTCTA CATCACGCCG
AACACCTGGG TGGACGATAT CACCGTGGTG ACGCATGTCG CGCAAGACTG TAACCACGCG
TCTGTTGACT GGCAGGTGGT AGCAAATGGT GATGTCAGCG TTGAACTGCG TGATGCGGAT
CAACAGGTGG TTGCAACTGG ACAAGGCACC AGCGGGACTT TGCAAGTGGT GAATCCGCAC
CTCTGGCAAC CAGGTGAAGG TTATCTCTAT GAACTGTGCG TCACAGCTAA AAGCCAGACA
GAGTGTGATA TCTACCCGCT GCGCGTCGGT ATCCGGTCAG TGGCAGTGAA GGGCGAACAG
TTCCTGATCA ACCACAAACC GTTCTACTTT ACTGGCTTTG GTCGTCATGA AGATGCGGAT
TTGCGCGGCA AAGGATTCGA TAACGTGCTG ATGGTGCACG ATCACGCATT AATGGACTGG
ATTGGGGCCA ACTCCTACCG TACCTCGCAT TACCCTTACG CTGAAGAGAT GCTCGACTGG
GCAGATGAAC ATGGCATCGT GGTGATTGAT GAAACTGCAG CTGTCGGCTT TAACCTCTCT
TTAGGCATTG GTTTCGAAGC GGGCAACAAG CCGAAAGAAC TGTACAGCGA AGACGCAGTC
AACGGGGAAA CCCAGCAGGC GCACTTACAG GCGATTGAAG AGCTGATTGC GCGTGACAAA
AACCACCCAA GCGTGGTGAT GTGGAGTATT GCCAACGAAC CGGATACCCG TCCGCAAGGT
GCACGGGAAT ATTTCGCGCC ACTGGCGGAA GCAACGCGTA AACTCGACCC GACGCGTCCG
ATCACCTGTG TCAATGTAAT GTTCTGCGAC GCTCACACCG ATACCATCAG CGATCTCTTT
GATGTGCTGT GCCTGAACCG TTATTACGGT TGGTATGTCC AAAGCGGCGA TTTGGAAACG
GCAGAGAAGG TTCTGGAAAA AGAACTGCTG GCCTGGCAGG AGAAACTGCA TCAGCCGATT
ATCATCACCG AATACGGCGT GGATACGTTA GCCGGGCTGC ACTCAATGTA CACCGACATG
TGGAGTGAAG AGTATCAGTG TGCATGGCTG GATATGTATC ACCGCGTCTT TGATCGCGTC
AGCGCCGTCG TCGGTGAACA GGTATGGAAT TTTGCCGATT TTGCGACCTC GCAAGGCATA
TTGCGCGTTG GCGGTAACAA GAAAGGGATC TTCACCCGCG ACCGCAAACC GAAGTCGGCG
GCTTTTCTGC TGCAAAAACG CTGGACTGGC ATGAACTTCG GTGAAAAACC GCAGCAGGGA
GGCAAACAAT GA
 
Protein sequence
MLRPVETPTR EIKKLDGLWA FSLDRENCGI DQRWWESALQ ESRAIAVPGS FNDQFADADI 
RNYVGNVWYQ REVFIPKGWA GQRIVLRFDA VTHYGKVWVN NQEVMEHQGG YTPFEADVTP
YVIAGKSVRI TVCVNNELNW QTIPPGMVIT DENGKKKQSY FHDFFNYAGI HRSVMLYITP
NTWVDDITVV THVAQDCNHA SVDWQVVANG DVSVELRDAD QQVVATGQGT SGTLQVVNPH
LWQPGEGYLY ELCVTAKSQT ECDIYPLRVG IRSVAVKGEQ FLINHKPFYF TGFGRHEDAD
LRGKGFDNVL MVHDHALMDW IGANSYRTSH YPYAEEMLDW ADEHGIVVID ETAAVGFNLS
LGIGFEAGNK PKELYSEDAV NGETQQAHLQ AIEELIARDK NHPSVVMWSI ANEPDTRPQG
AREYFAPLAE ATRKLDPTRP ITCVNVMFCD AHTDTISDLF DVLCLNRYYG WYVQSGDLET
AEKVLEKELL AWQEKLHQPI IITEYGVDTL AGLHSMYTDM WSEEYQCAWL DMYHRVFDRV
SAVVGEQVWN FADFATSQGI LRVGGNKKGI FTRDRKPKSA AFLLQKRWTG MNFGEKPQQG
GKQ