Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4182 |
Symbol | |
ID | 6067364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4620541 |
End bp | 4621341 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641603610 |
Product | putative sugar phosphatase |
Protein accession | YP_001727106 |
Protein GI | 170022152 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.152006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCAGG TTGTTGCGTC TGATTTAGAT GGCACGTTAC TTTCTCCCGA CCATACGTTA TCCCCTTACG CCAAAGAAAC CCTGAAGCTG CTCACCGCGC GCGGCATCAA CTTTGTGTTT GCGACCGGTC GTCACCACGT TGATGTGGGG CAAATTCGCG ATAATCTGGA GATTAAGTCT TACATGATTA CCTCCAATGG CGCGCGCGTT CACGATCTGG ATGGCAATCT GATTTTTGCT CATAACCTGG ATCGCGACAT TGCCAGCGAT CTGTTTGGCG TAGTCAACGA CAATCCGGAC ATCATTACTA ACGTTTATCG CGACGACGAA TGGTTTATGA ATCGCCATCG CCCGGAAGAG ATGCGCTTTT TTAAAGAAGC GGTGTTCAAA TATGCGCTGT ATGAGCCTGG ATTACTGGAG CCGGAAGGCG TCAGCAAAGT GTTCTTCACC TGCGATTCCC ATGAACAACT GCTGCCGCTG GAGCAGGCGA TTAACGCTCG TTGGGGCGAT CGCGTCAACG TCAGTTTCTC TACCTTAACC TGTCTGGAAG TGATGGCGGG CGGCGTTTCA AAAGGCCATG CGCTGGAAGC GGTGGCGAAG AAACTGGGCT ACAGCCTGAA GGATTGTATT GCGTTTGGTG ACGGGATGAA CGATGCCGAA ATGCTGTCGA TGGCGGGGAA AGGCTGCATT ATGGGCAGCG CGCATCAACG GCTGAAAGAT CTGCATCCGG AGCTGGAAGT GATTGGTACT AACGCCGAAG ACGCGGTGCC GCATTATCTG CGTAAACTCT ATTTATCGTA A
|
Protein sequence | MYQVVASDLD GTLLSPDHTL SPYAKETLKL LTARGINFVF ATGRHHVDVG QIRDNLEIKS YMITSNGARV HDLDGNLIFA HNLDRDIASD LFGVVNDNPD IITNVYRDDE WFMNRHRPEE MRFFKEAVFK YALYEPGLLE PEGVSKVFFT CDSHEQLLPL EQAINARWGD RVNVSFSTLT CLEVMAGGVS KGHALEAVAK KLGYSLKDCI AFGDGMNDAE MLSMAGKGCI MGSAHQRLKD LHPELEVIGT NAEDAVPHYL RKLYLS
|
| |