Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4186 |
Symbol | |
ID | 5590158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4173672 |
End bp | 4175165 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640927803 |
Product | sulfatase |
Protein accession | YP_001465162 |
Protein GI | 157157618 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCC CCAATTTTCT GTTCATCATG ACCGATACCC AGGCCACCAA TATGGTCGGT TGCTATAGCG GTAAACCGCT GAATACGCAA AATATTGATA GTCTGGCGGC GGAAGGTATT CGCTTTAATT CCGCCTACAC CTGTTCACCG GTTTGTACGC CTGCACGCGC CGGACTATTT ACCGGTATCT ACGCTAACCA GTCCGGCCCG TGGACCAACA ACGTCGCGCC AGGCAAAAAC ATCTCCACTA TGGGGCGCTA CTTTAAGGAT GCCGGCTATC ACACCTGTTA CATCGGCAAA TGGCATCTCG ACGGTCATGA CTATTTCGGC ACTGGCGAGT GTCCGCCGGA GTGGGACGCT GATTACTGGT TCGATGGGGC GAACTACCTT AGCGAACTGA CGGAAAAAGA GATCAGCCTG TGGCGCAATG GCCTAAACAG CGTTGAGGAT TTACAGGCGA ACCATATCGA CGAAACCTTC ACCTGGGCGC ACCGCATCAG CAATCGGGCG GTAGATTTTC TGCAACAGCC CGCGCGCGCC GAGGAACCCT TCCTGATGGT GGTTTCGTAT GATGAGCCGC ATCACCCGTT CACCTGTCCG GTGGAGTATT TAGAGAAATA CGCTGATTTT TACTACGATC TGGGCGAGAA AGCTCAGGAT GACCTGGCGA ACAAACCGGA ACATCACCGC TTATGGGCGC AGGCGATGCC ATCGCCAGTC GGTGATGACG GGCTTTATCA CCATCCGCTC TATTTTGCCT GTAATGACTT TGTTGATGAC CAAATCGGAC GGGTCATCAA TGCCTTAACG CCAGAGCAAC GTGAAAATAC GTGGGTCATT TATACTTCCG ATCACGGCGA AATGATGGGC GCACATAAGC TGATCAGTAA AGGAGCGGCG ATGTATGACG ACATCACCCG TATTCCGCTG ATCATCCGTT CGCCGCAAGG GGAGCGGCGA CAGGTCGATA CGCCAGTCAG TCATATCGAT TTACTGCCGA CAATGATGGC GCTGGCAGAT ATTGAAAAAC CAGAGATTCT GCCGGGGGAA AATATCCTTG CCGTGAAAGA GCCACGCGGT GTAATGGTGG AATTTAACCG CTACGAGATT GAGCATGACA GCTTTGGCGG TTTTATTCCG GTGCGTTGCT GGGTGACGGA TGACTTTAAA CTGGTACTCA ACCTCTTCAC CAGTGATGAA CTTTACGATC GCCGTAATGA CCCAAATGAA ATGCATAACC TGATCGATGA TATCCGTTTT GCAGACGTTC GCAGCAAAAT GCATGACGCC TTATTGGATT ACATGGACAA AATTCGCGAT CCGTTCCGCA GTTACCAATG GAGCCTGCGT CCGTGGCGTA AAGATGCACT GCCGCGCTGG ATGGGGGCAT TTCGTCCACG CCCACAAGAT GGCTATTCGC CGGTGGTACG TGACTATGAC ACCGGCCTAC CGACGCAAGG AGTGAAAGTG GAAGAGAAAA AACAGAAGTT CTGA
|
Protein sequence | MKRPNFLFIM TDTQATNMVG CYSGKPLNTQ NIDSLAAEGI RFNSAYTCSP VCTPARAGLF TGIYANQSGP WTNNVAPGKN ISTMGRYFKD AGYHTCYIGK WHLDGHDYFG TGECPPEWDA DYWFDGANYL SELTEKEISL WRNGLNSVED LQANHIDETF TWAHRISNRA VDFLQQPARA EEPFLMVVSY DEPHHPFTCP VEYLEKYADF YYDLGEKAQD DLANKPEHHR LWAQAMPSPV GDDGLYHHPL YFACNDFVDD QIGRVINALT PEQRENTWVI YTSDHGEMMG AHKLISKGAA MYDDITRIPL IIRSPQGERR QVDTPVSHID LLPTMMALAD IEKPEILPGE NILAVKEPRG VMVEFNRYEI EHDSFGGFIP VRCWVTDDFK LVLNLFTSDE LYDRRNDPNE MHNLIDDIRF ADVRSKMHDA LLDYMDKIRD PFRSYQWSLR PWRKDALPRW MGAFRPRPQD GYSPVVRDYD TGLPTQGVKV EEKKQKF
|
| |