Gene Cwoe_5128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5128 
Symbol 
ID8735594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5487702 
End bp5489324 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content70% 
IMG OID646505753 
Productsulfatase 
Protein accessionYP_003396912 
Protein GI284046572 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGAT CCGACGACAC CCAGCGGGAC GGGGTCACGC GCAGACAGCT GCTGAAGGGT 
GCCGCGGCCG CGGCACCCGG GATCCTGCTC GGCGGGCAGG CCGCGGCGGC CGCGGCGGCC
GCCGCGCGCG AACGGCCGAG ACGGCGGCGG ACGCCGCCGG CGCGTCGCGT CGCGGGCATG
AACGTGCTGC TCTTCCTGAC CGACCAGCAA CGCGCGATCC AGCACTTCCC GCCCGGCTGG
TCGCAGCGCA ACATGCCCGG GCTGACGCGC CTGCAGCGGC ACGGCCTGAC GTTCGCGAAC
GCGTTCACGA ACGCGTGCAT GTGCTCGCCC GCGCGCTCGA CGCTGATGAC GGGCTACTTC
CCCGCCCAGC ACGGCGTCAA GTACACGCTC GAGACGGACA TGCCGTCGCC GCAGTACCCG
CAGGTCGAGC TGGCGACGAC GTTCAAGAAC CCCGCCACCG TCGTCGCGGC GGCCGGCTAC
ACGCCGGTCT ACAAGGGCAA GTTCCACTGC GTCAAGCCGG CGAACGGCTC GACCTGGGTC
CCCTCCGACG TCAACCAGTA CGGCTTCACG CGCTGGGACC CGCCGGACGC CGGGGCCAAC
CAGGACATCC CCGAGGAGGG CGGCGGGACC TACGACAACG ACGGCCGCTT CATGAACTCA
CAGGGCACGC CCGAGGCCGG CACCGAGGGT GCGCTCCAGT ACCTCTCCTC CGTCGCCGCG
CAGAGCCAGC CGTTCTTCAT GGTCGTCTCG CTCGTCAACC CGCACGACGT CCTCTTCTAC
CCGAAGACCT ACGAGTCCGG CGGCTACGAC GACTCGTGGC TGAGAGGGGA GATCGAGCCG
CCCGCGACGG CGAACGAGGA CCTCTCGACC AAGCCGGCGG TGCAGAGACA GTTCCAGCGC
CTCTTCAGCG CCACCGGGCC GCTCCCAACG CCGCAGATGA AGCGCAACTA CCTGAACTTC
TACGGCAACC TGATGAAGGC CTCCGACGCC TACCTGGTGA AGCTGCTCGA CACGCTGAAG
AGCACCGGCC TGCTCGACGA CACGCTCGTG ATCGCGACCG CCGACCACGG CGAGATGGGC
ACCGCACACG GCGGCCTGCG ACAGAAGAAC TTCAACTTCT ACGAGGAGTC GACGCGCGTC
CCGCTCGTCT ACTCCAACCC GCGCCTCTTC AGAAGACCCG AGCGCAGCGA CGCGCTCGTC
TCGCACGTCG ACTTCCTGCC GACGCTCGCG AGCCTCGTCG GCGCGCCGGC CTCCGCGCGC
GCGAACTGGG AGGGCGTCGA CTACTCCTCC CAGATCCTCG ACCGCTCGCC GAAGCCGACG
CAGGACTACA CCGTCTTCAC GTACGACGAC TGGCAGTCCG GGCAGGCGAG AGGCCCCTAC
CCCCAGCCGC CGAACCACAT CGTCAGCATC CGCGAACGGC GCTGGAAGCT CGCCCGCTAC
TACGACGCCG ACGGCAGAGC GCCGGACCAA TGGGAGATGT ACGACCTGAA GAGCGACCCG
CTGGAGCGCA GAAACCTCGC GTGGCGGGGC TACAGACGGA CGCCCGCGCA GGAGCGCGAG
TACCGCCGCC TGCGGCGCAA GCTCGCGCGC GTCGAGCGGA CGCGGCTGCA GCCGCTGAGC
TGA
 
Protein sequence
MAGSDDTQRD GVTRRQLLKG AAAAAPGILL GGQAAAAAAA AARERPRRRR TPPARRVAGM 
NVLLFLTDQQ RAIQHFPPGW SQRNMPGLTR LQRHGLTFAN AFTNACMCSP ARSTLMTGYF
PAQHGVKYTL ETDMPSPQYP QVELATTFKN PATVVAAAGY TPVYKGKFHC VKPANGSTWV
PSDVNQYGFT RWDPPDAGAN QDIPEEGGGT YDNDGRFMNS QGTPEAGTEG ALQYLSSVAA
QSQPFFMVVS LVNPHDVLFY PKTYESGGYD DSWLRGEIEP PATANEDLST KPAVQRQFQR
LFSATGPLPT PQMKRNYLNF YGNLMKASDA YLVKLLDTLK STGLLDDTLV IATADHGEMG
TAHGGLRQKN FNFYEESTRV PLVYSNPRLF RRPERSDALV SHVDFLPTLA SLVGAPASAR
ANWEGVDYSS QILDRSPKPT QDYTVFTYDD WQSGQARGPY PQPPNHIVSI RERRWKLARY
YDADGRAPDQ WEMYDLKSDP LERRNLAWRG YRRTPAQERE YRRLRRKLAR VERTRLQPLS