Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5128 |
Symbol | |
ID | 8735594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 5487702 |
End bp | 5489324 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646505753 |
Product | sulfatase |
Protein accession | YP_003396912 |
Protein GI | 284046572 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGAT CCGACGACAC CCAGCGGGAC GGGGTCACGC GCAGACAGCT GCTGAAGGGT GCCGCGGCCG CGGCACCCGG GATCCTGCTC GGCGGGCAGG CCGCGGCGGC CGCGGCGGCC GCCGCGCGCG AACGGCCGAG ACGGCGGCGG ACGCCGCCGG CGCGTCGCGT CGCGGGCATG AACGTGCTGC TCTTCCTGAC CGACCAGCAA CGCGCGATCC AGCACTTCCC GCCCGGCTGG TCGCAGCGCA ACATGCCCGG GCTGACGCGC CTGCAGCGGC ACGGCCTGAC GTTCGCGAAC GCGTTCACGA ACGCGTGCAT GTGCTCGCCC GCGCGCTCGA CGCTGATGAC GGGCTACTTC CCCGCCCAGC ACGGCGTCAA GTACACGCTC GAGACGGACA TGCCGTCGCC GCAGTACCCG CAGGTCGAGC TGGCGACGAC GTTCAAGAAC CCCGCCACCG TCGTCGCGGC GGCCGGCTAC ACGCCGGTCT ACAAGGGCAA GTTCCACTGC GTCAAGCCGG CGAACGGCTC GACCTGGGTC CCCTCCGACG TCAACCAGTA CGGCTTCACG CGCTGGGACC CGCCGGACGC CGGGGCCAAC CAGGACATCC CCGAGGAGGG CGGCGGGACC TACGACAACG ACGGCCGCTT CATGAACTCA CAGGGCACGC CCGAGGCCGG CACCGAGGGT GCGCTCCAGT ACCTCTCCTC CGTCGCCGCG CAGAGCCAGC CGTTCTTCAT GGTCGTCTCG CTCGTCAACC CGCACGACGT CCTCTTCTAC CCGAAGACCT ACGAGTCCGG CGGCTACGAC GACTCGTGGC TGAGAGGGGA GATCGAGCCG CCCGCGACGG CGAACGAGGA CCTCTCGACC AAGCCGGCGG TGCAGAGACA GTTCCAGCGC CTCTTCAGCG CCACCGGGCC GCTCCCAACG CCGCAGATGA AGCGCAACTA CCTGAACTTC TACGGCAACC TGATGAAGGC CTCCGACGCC TACCTGGTGA AGCTGCTCGA CACGCTGAAG AGCACCGGCC TGCTCGACGA CACGCTCGTG ATCGCGACCG CCGACCACGG CGAGATGGGC ACCGCACACG GCGGCCTGCG ACAGAAGAAC TTCAACTTCT ACGAGGAGTC GACGCGCGTC CCGCTCGTCT ACTCCAACCC GCGCCTCTTC AGAAGACCCG AGCGCAGCGA CGCGCTCGTC TCGCACGTCG ACTTCCTGCC GACGCTCGCG AGCCTCGTCG GCGCGCCGGC CTCCGCGCGC GCGAACTGGG AGGGCGTCGA CTACTCCTCC CAGATCCTCG ACCGCTCGCC GAAGCCGACG CAGGACTACA CCGTCTTCAC GTACGACGAC TGGCAGTCCG GGCAGGCGAG AGGCCCCTAC CCCCAGCCGC CGAACCACAT CGTCAGCATC CGCGAACGGC GCTGGAAGCT CGCCCGCTAC TACGACGCCG ACGGCAGAGC GCCGGACCAA TGGGAGATGT ACGACCTGAA GAGCGACCCG CTGGAGCGCA GAAACCTCGC GTGGCGGGGC TACAGACGGA CGCCCGCGCA GGAGCGCGAG TACCGCCGCC TGCGGCGCAA GCTCGCGCGC GTCGAGCGGA CGCGGCTGCA GCCGCTGAGC TGA
|
Protein sequence | MAGSDDTQRD GVTRRQLLKG AAAAAPGILL GGQAAAAAAA AARERPRRRR TPPARRVAGM NVLLFLTDQQ RAIQHFPPGW SQRNMPGLTR LQRHGLTFAN AFTNACMCSP ARSTLMTGYF PAQHGVKYTL ETDMPSPQYP QVELATTFKN PATVVAAAGY TPVYKGKFHC VKPANGSTWV PSDVNQYGFT RWDPPDAGAN QDIPEEGGGT YDNDGRFMNS QGTPEAGTEG ALQYLSSVAA QSQPFFMVVS LVNPHDVLFY PKTYESGGYD DSWLRGEIEP PATANEDLST KPAVQRQFQR LFSATGPLPT PQMKRNYLNF YGNLMKASDA YLVKLLDTLK STGLLDDTLV IATADHGEMG TAHGGLRQKN FNFYEESTRV PLVYSNPRLF RRPERSDALV SHVDFLPTLA SLVGAPASAR ANWEGVDYSS QILDRSPKPT QDYTVFTYDD WQSGQARGPY PQPPNHIVSI RERRWKLARY YDADGRAPDQ WEMYDLKSDP LERRNLAWRG YRRTPAQERE YRRLRRKLAR VERTRLQPLS
|
| |