Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5750 |
Symbol | |
ID | 8736226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 6155467 |
End bp | 6156984 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646506377 |
Product | sulfatase |
Protein accession | YP_003397526 |
Protein GI | 284047186 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.457372 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGACG ACGGACGGAT CTCACGCCGC GCCCTGCTGC GCGGCGGCGG CGCGTCGGCC GGCCTCGCCG CGCTGACCGG CGCCGGCGTG CTCGGCGCCG CCGCGACCGC GCAGGCGCAG GACGGCGACA GAGCGAGAGA GCGCCCCAAC GTCGTGCTGA TCGTCGCTGA CCGCCTGCGG GCCGACTACG TCGGCGCCTA CGACGACGTC TTCGACGACC GCCACGCCAA GACGCCCAAC ATCGACGAGC TGGCCGACCA GGCGCTGCGC TTCAAGTACG CCGTCCCGGA CGGCATGCCG GCGATCCCGA TGCGCCGCGG GATGCTGACC GGGATGCGCA GCTACCCGTT CCGCGACTGG AGAGCGACGG CGGGGATGCC AGCGATCCCC GGCTTCAACA AGATCTACGA CTTCCAGCCG ATGGTCACCG AGCTGGCCGC CGCCGCCGGG ATCACGACCG TCTACGTGAC CGACAACCCG ACGTTCACCG GCCCGCGCTT CGGCCGGATC GTGCGGACCG GCCCGCTCGC GACCTCGGCC GGCTTCGAGT CGACCGAGCG CGACTACCTG CTGCCGCTCG GCGGCACGGT CCAGCGCAGA CGCCAGGAGC CGACGAGCCG CGTCCTGCGC GAGGGGATCG AGCAGCTCGA CCAGCTGAGA GGCAGACAGC CGTTCTTCCT GACGCTCGAC GCGTTCGACC CCAACGACGC GTTCCGGCTG CCGCGCCAGT TCGTCGAGGG CAGCGGCCCG CTGCACGACG ACGTCACGCT GCCGAAGGAC CGCGTCTACC AGCAGACGTT CAGAGCCGAC GGCGACGTCA AGGGCGAGGT GCGCGACCGC TACGCCGCCG AGGTCGAGTC GGTCGACGGC TGGGTCGGCA GATTCCTCGA CAAGCTCGAC GACGCCGGCC TCGCCGACAA CACCGTCGTC GTCTTCGTCG GCGACTCCGG CATCGCGCTC GGCGAGCAGG GCGTCTACGG CCACCCCGCC GGCGTCTGGC ACCGGCGCGC CTACCACGTG CCGTTCCTGA TCCGCGACCC CGACGGCCGC TGGGCCGGCG ACACCAGCAA GTGGTTCGCC TCCACGCACG ACATCCCGTC GACGCTGCTC TCCTACTGGG GCATCACGAG CCCGGGCAGA ATGCAGGGCG AGGACCTGAC GACGCTGTTC GACGACTTCG ACCTGCCGGC GCGGCCGTAC TACACGACCG CGATCGACAC GCACATCGTC ACGGGCAGCC GCACGTGGCT GCTGATCGGC CGCTCCGACC AGGACCGCTG GCGCCTGTAC GAGGCCGAGG ACGAGGACGA GCCGGACGAG ATCCGCACCG AGACCGTCAA GTCGCCGACC GTGCTCGAGG AGATGAGACG GTATGCGCTG GCCGCCGCCG GCGGCACGCT GCCCGACTTC GGCGACACGG CCGCGATACG GCCGGCACAG CCCGACTCGC TCGACAAGAA GGTCGCCGAC GACGGCACGC TCGACGAGGA CGAGGCGGAG GCGAACGAGC TTCGATGA
|
Protein sequence | MADDGRISRR ALLRGGGASA GLAALTGAGV LGAAATAQAQ DGDRARERPN VVLIVADRLR ADYVGAYDDV FDDRHAKTPN IDELADQALR FKYAVPDGMP AIPMRRGMLT GMRSYPFRDW RATAGMPAIP GFNKIYDFQP MVTELAAAAG ITTVYVTDNP TFTGPRFGRI VRTGPLATSA GFESTERDYL LPLGGTVQRR RQEPTSRVLR EGIEQLDQLR GRQPFFLTLD AFDPNDAFRL PRQFVEGSGP LHDDVTLPKD RVYQQTFRAD GDVKGEVRDR YAAEVESVDG WVGRFLDKLD DAGLADNTVV VFVGDSGIAL GEQGVYGHPA GVWHRRAYHV PFLIRDPDGR WAGDTSKWFA STHDIPSTLL SYWGITSPGR MQGEDLTTLF DDFDLPARPY YTTAIDTHIV TGSRTWLLIG RSDQDRWRLY EAEDEDEPDE IRTETVKSPT VLEEMRRYAL AAAGGTLPDF GDTAAIRPAQ PDSLDKKVAD DGTLDEDEAE ANELR
|
| |