Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1506 |
Symbol | |
ID | 5114474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 1660471 |
End bp | 1661976 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640491694 |
Product | sulfatase |
Protein accession | YP_001176237 |
Protein GI | 146311163 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.265184 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAGA CAACGTTAGC AGTGTTTGTC AGCGCGGTCA TTGGCATGAC AGGGGGAATG GGAAATGTTC TGGCGGCAGA GCAATCAGCG AATCAATTAA ATAAACCTAA CGTCGTGATT ATTTTAGCCG ACGACCTGGG CTATGGCGAT TTAGGCATAT ATGGGCATCC CATCGTTAAA ACCCCGAATA TCGATAAACT CGCGCAGGAA GGGGTGAGGT TTTCGCAATA TTACGCACCC GCGCCACTGT GTTCGCCTTC ACGTGCAGGT TTACTGACAG GACGCACCCC CTTCAGAACG GGGATTCGAT CCTGGATCCC GACCAATAAA AATATCGCAC TGGGGCGTAA CGAAAAGACC ATAGCCAGTT ACCTGAAAGA CCAGGGTTAC GACACGGCAA TGATGGGGAA ATGGCATCTT AATGCCGGTG TTGACCGCCA CGATCAGCCC CAGGCTGAAG ATGCTGGTTT CGACTATACG TTGGTCAATG CTGCTGGTTT TGTCACCAGC GATCTGGATA AGGCGAAAGA GCGTCCGCGT AATGGCGTGG TGTACCCGAA TGGGTTTTAT CGAAACGGTA AAGCGCTGGG GACCGTTAAC CAAATCAGCG GTGAATTTGT CAGTCAGGAA GCCATTAACT GGCTAAACGA TAAAAAAGAT AACAAACCTT TCTTTATGTA TGTGGCTTTC ACAGAGGTCC ATACGCCGCT GGCGTCACCC AAAAAATACC TCGAAATTTA TAAAAATTAT ATGAGCGAGT ATGAAAAGCA GCATCCCGAT ATGTTTTATG CCGACTGGGT GGATAAGCCT TATCGTGGTC CGGGAGAATA CTACGCCAAT ATCAGTTACA TGGATGAACA GGTTGGTAAA GTCCTCGCAA AAATCAAATC AATGGGGCAG GAGGACAACA CGATAATTAT CTTTACCAGC GATAACGGTC CTGTCACGCG CGAAGCGCGT AAGTGGTACG AACTTAATAT GGCAGGTGAA ACGGATGGCT TACGGGGTCG CAAAGATAAT TTGTGGGAGG GGGGAATACG CGTGCCAGCG ATCATTAAAT ATGGTCAGCA TTTACACGCC GGCACGGTAA CCGACACGCC TGTAAGCGGT CTGGATATAT TACCCACTCT TGCAGAACTG ACGCATTTTA ACTTGCCGAC CGACCGGATT ATTGATGGGG AATCTATTGT GCCCGTACTT GAGGGACAAA CGATGAACCG CCAGCAACCC TTGTTATTCG CGATTGATAT GCCGTTCCAG GATGATCCGA CGGATATGTG GGCACTACGC GACGGCGACT GGAAGATGAT ATTTGACCGC AATAGCAAAC CTAAATATCT CTATAACCTC AAGCTGGATC GTGGCGAGAC AATGAATCAA CTGGGTAAAC AACCCGTGCT GGAGCAAAAA ATGATAGCCG CGTTAGCACG TTATCAGTCC AGTATTGAAA ATGATTCACT TATGAAGGCT AGGGGCGATA AACCGACACC AGTAGACTGG AACTAA
|
Protein sequence | MRKTTLAVFV SAVIGMTGGM GNVLAAEQSA NQLNKPNVVI ILADDLGYGD LGIYGHPIVK TPNIDKLAQE GVRFSQYYAP APLCSPSRAG LLTGRTPFRT GIRSWIPTNK NIALGRNEKT IASYLKDQGY DTAMMGKWHL NAGVDRHDQP QAEDAGFDYT LVNAAGFVTS DLDKAKERPR NGVVYPNGFY RNGKALGTVN QISGEFVSQE AINWLNDKKD NKPFFMYVAF TEVHTPLASP KKYLEIYKNY MSEYEKQHPD MFYADWVDKP YRGPGEYYAN ISYMDEQVGK VLAKIKSMGQ EDNTIIIFTS DNGPVTREAR KWYELNMAGE TDGLRGRKDN LWEGGIRVPA IIKYGQHLHA GTVTDTPVSG LDILPTLAEL THFNLPTDRI IDGESIVPVL EGQTMNRQQP LLFAIDMPFQ DDPTDMWALR DGDWKMIFDR NSKPKYLYNL KLDRGETMNQ LGKQPVLEQK MIAALARYQS SIENDSLMKA RGDKPTPVDW N
|
| |