Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0789 |
Symbol | ybhA |
ID | 6146220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 789624 |
End bp | 790442 |
Gene Length | 819 bp |
Protein Length | 272 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615677 |
Product | phosphotransferase |
Protein accession | YP_001742869 |
Protein GI | 170682522 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAC GCGTGATTGC TCTCGACTTA GACGGCACCT TATTGACCCC GAAAAAGACC CTGCTTCCTT CATCGATAGA AGCCCTGGCC CGTGCTCGCG AAGCAGGCTA TCGATTAATC ATCGTCACAG GTCGCCATCA CGTCGCTATT CATCCTTTTT ATCAGGCGCT GGCGCTGGAT ACACCTGCTA TTTGCTGTAA TGGCACCTAT TTGTATGATT ATCATGCAAA AACCGTGCTG GAAGCGGACC CAATGCCCGT TAATAAAGCC CTGCAACTCA TTGAGATGCT GAATGAACAC CACATTCACG GTCTGATGTA TGTCGATGAT GCAATGGTCT ATGAGCACCC GACCGGGCAT GTCATTCGCA CGTCTAACTG GGCGCAAACC CTGCCGCCGG AACAGCGTCC GACTTTCACA CAAGTCGCTT CTCTGGCTGA AACGGCGCAA CAAGTTAACG CCGTATGGAA GTTCGCCCTG ACGCACGACG ACCTGCCGCA ATTGCAGCAT TTTGGTAAGC ATGTCGAACA TGAACTGGGA CTGGAGTGTG AATGGTCCTG GCACGATCAG GTTGATATTG CACGCGGCGG CAACAGCAAA GGTAAACGTT TGACGAAATG GGTTGAGGCG CAAGGCTGGT CGATGGAAAA CGTCGTGGCG TTCGGCGATA ACTTCAATGA TATTAGTATG CTGGAAGCCG CTGGTACAGG CGTGGCGATG GGCAACGCCG ATGACGCAGT AAAAGCGCGC GCCAACATTG TGATTGGTGA GAACACCACC GACAGCATTG CCCAGTTCAT TTATAGCCAC CTGATTTAA
|
Protein sequence | MTTRVIALDL DGTLLTPKKT LLPSSIEALA RAREAGYRLI IVTGRHHVAI HPFYQALALD TPAICCNGTY LYDYHAKTVL EADPMPVNKA LQLIEMLNEH HIHGLMYVDD AMVYEHPTGH VIRTSNWAQT LPPEQRPTFT QVASLAETAQ QVNAVWKFAL THDDLPQLQH FGKHVEHELG LECEWSWHDQ VDIARGGNSK GKRLTKWVEA QGWSMENVVA FGDNFNDISM LEAAGTGVAM GNADDAVKAR ANIVIGENTT DSIAQFIYSH LI
|
| |