Gene EcSMS35_3635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3635 
Symbol 
ID6143985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3695332 
End bp3696354 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content54% 
IMG OID641618462 
Productputative hydrolase 
Protein accessionYP_001745602 
Protein GI170683037 
COG category[R] General function prediction only 
COG ID[COG0429] Predicted hydrolase of the alpha/beta-hydrolase fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.617226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0000831591 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCAGA TAACGACGAC CGATGCCAAT GAATTCAGCA GCAGTGCTGA ATTCACCCCT 
ATGCGCGGCT TTAGCAATTG TCATCTGCAA ACCATGCTGC CGCGTCTGTT TCGTCGCAAG
GTGAAATTCA CCCCGTACTG GCAGCGGCTG GAGTTGCCCG ACGGCGATTT TGTCGATCTC
GCATGGAGTG AAGACCCTGC CCAGGCGAAC CATAAACCGC GTTTAGTGGT GTTTCACGGG
CTGGAGGGCA GCCTCAATAG CCCTTACGCC CACGGTCTGG TCGAGGCGGC GCAAAAACGC
GGCTGGCTGG GCGTGGTGAT GCATTTTCGC GGATGCAGCG GTGAACCAAA CCGTATGCAC
CGCATTTACC ATTCGGGCGA AACCGAAGAC GCCAGCTGGT TTTTACGCTG GCTGCAGCGC
GAATTTGGAC ATGCGCCAAC GGCTGCCGTC GGCTATTCGC TCGGCGGTAA TATGCTCGCC
TGTTTGCTGG CAAAAGAAGG TAATAACCTC CCGATTGATG CAGCGGTAAT TGTCTCCGCG
CCATTTATGC TGGAAGCCTG TAGTTATCAT ATGGAAAAGG GCTTTTCCCG CGTTTATCAG
CGTTACTTGC TGAACCTGTT AAAAGCCAAT GCTGCGCGCA AACTGGCAGC CTATCCCGGT
ACGCTGCCGA TTAATCTCGC ACAGTTAAAA TCGGTACGTC GCATCCGTGA ATTTGACGAT
CTGATCACCG CCAGAATTCA CGGCTACGCT GACGCTATCG ACTATTATCG TCAGTGTAGC
GCCATGCCGA TGCTGAACCA GATCGCCAAA CCGACGCTGA TTATTCACGC CAAAGACGAT
CCGTTTATGG ATCATCAGGT GATCCCGAAA CCGGAAAGTC TCCCCCCGCA GGTGGAGTAT
CAACTGACTG AACATGGCGG TCATGTTGGC TTTATTGGCG GTACGTTACT TCATCCGCAA
ATGTGGCTGG AGTCACGCAT TCCTGACTGG TTAACAACGT ATCTGGAGGC GAAATCATGT
TGA
 
Protein sequence
MAQITTTDAN EFSSSAEFTP MRGFSNCHLQ TMLPRLFRRK VKFTPYWQRL ELPDGDFVDL 
AWSEDPAQAN HKPRLVVFHG LEGSLNSPYA HGLVEAAQKR GWLGVVMHFR GCSGEPNRMH
RIYHSGETED ASWFLRWLQR EFGHAPTAAV GYSLGGNMLA CLLAKEGNNL PIDAAVIVSA
PFMLEACSYH MEKGFSRVYQ RYLLNLLKAN AARKLAAYPG TLPINLAQLK SVRRIREFDD
LITARIHGYA DAIDYYRQCS AMPMLNQIAK PTLIIHAKDD PFMDHQVIPK PESLPPQVEY
QLTEHGGHVG FIGGTLLHPQ MWLESRIPDW LTTYLEAKSC