Gene EcHS_A3550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3550 
Symbol 
ID5594021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3527545 
End bp3528567 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content54% 
IMG OID640922667 
Productputative hydrolase 
Protein accessionYP_001460148 
Protein GI157162830 
COG category[R] General function prediction only 
COG ID[COG0429] Predicted hydrolase of the alpha/beta-hydrolase fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value0.9263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAGA TAACGACGAC CGATGCCAAT GAATTCAGCA GCAGTGCTGA ATTCACCCCT 
ATGCGCGGCT TTAGCAATTG TCATCTGCAA ACCATGCTGC CGCGTCTGTT TCGTCGCAAG
GTGAAATTCA CCCCGTACTG GCAGCGGCTG GAGTTGCCCG ACGGCGATTT TGTCGATCTC
GCGTGGAGTG AAGCCCCAGC ACAGGCGCGG CATAAACCGC GTTTAGTGGT ATTTCACGGG
CTGGAAGGCA GTCTCAATAG CCCTTACGCC CACGGTCTGG TTGAGGCGGC GCAAAAGCGC
GGCTGGCTGG GCGTGGTGAT GCATTTTCGC GGATGCAGCG GTGAACCAAA CCGTATGCAC
CGCATTTACC ATTCGGGCGA AACCGAAGAC GCCAGTTGGT TTTTACGCTG GCTGCAACGC
GAATTTGGTC ATGCGCCAAC GGCTGCCGTC GGCTATTCGC TCGGCGGTAA TATGCTGGCC
TGTTTGCTGG CAAAGGAAGG CAATGATCTC CCGGTTGATG CGGCGGTGAT TGTCTCTGCG
CCGTTTATGC TGGAAGCCTG TAGCTATCAT ATGGAAAAGG GCTTTTCCCG CGTTTATCAG
CGTTACTTGC TGAACCTGTT AAAAGCCAAT GCCGCGCGCA AGCTGGCAGC CTACCCCGGA
ACGCTGCCGA TTAATCTCGC GCAGTTAAAA TCGGTACGTC GCATCCGTGA ATTTGACGAT
CTAATCACCG CCAGAATTCA CGGCTACGCC GACGCTATCG ACTATTATCG TCAGTGTAGC
GCCATGCCGA TGCTGAACCG GATCGCCAAA CCGACGCTGA TTATTCACGC CAAAGACGAT
CCGTTTATGG ATCATCAGGT GATCCCGAAA CCGGAAAGTC TCCCCCCGCA GGTGGAGTAT
CAACTGACTG AACATGGCGG TCATGTTGGC TTTATTGGCG GTACATTACT TCATCCGCAA
ATGTGGCTGG AGTCACGCAT TCCTGACTGG TTAACAACGT ATCTGGAGGC GAAATCATGT
TGA
 
Protein sequence
MAQITTTDAN EFSSSAEFTP MRGFSNCHLQ TMLPRLFRRK VKFTPYWQRL ELPDGDFVDL 
AWSEAPAQAR HKPRLVVFHG LEGSLNSPYA HGLVEAAQKR GWLGVVMHFR GCSGEPNRMH
RIYHSGETED ASWFLRWLQR EFGHAPTAAV GYSLGGNMLA CLLAKEGNDL PVDAAVIVSA
PFMLEACSYH MEKGFSRVYQ RYLLNLLKAN AARKLAAYPG TLPINLAQLK SVRRIREFDD
LITARIHGYA DAIDYYRQCS AMPMLNRIAK PTLIIHAKDD PFMDHQVIPK PESLPPQVEY
QLTEHGGHVG FIGGTLLHPQ MWLESRIPDW LTTYLEAKSC