Gene Cwoe_3922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3922 
Symbol 
ID8734379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4163601 
End bp4164872 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content70% 
IMG OID646504546 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003395714 
Protein GI284045374 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.155869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCAT CCAAGGCCAT CGTCACCCTC GCCGCGGCGG CCGTGCTCGG CGCCGCGGGC 
GGCGCTGCGG TCGTGGGCGT CGCCGGCGAC GGCGGCGGCT CGACCAAGAC GGTGCTCGAG
CCGGCTGCGG CACAGCCGCA GCCCGTCAGC GTCGCGCAGA GAAACGGCGA CGCACTCACC
CCCAAGCAGG TCTACTCGCT CGCGAGAGAC TCGGTCGTGT TCATCACCTC CGACGTCACC
GAGCAGGGTC AGTCCGGTCA GGCGACCGGC TCCGGCTTCG TCATCTCCAA GGACGGCTAC
ATCGTCACCA ACGCGCACGT CGTCAACGGC GCGTCGAAGG TCACCGTGAA GATCGGTGAC
GGGCAGACGC AGGACGCCGA GATCGTCGGC AAGGACGAAT CGACCGACAT CGCACTGCTG
AAGGTGAGCG GCAGCGACGA CCTCAAGCCG CTGCAGTTCG CCGACTCCGA CAAGATCTCC
GTCGGCGACC CGATGTACGC GATCGGCAAC CCGTTCGGCC TCGACCGCAC GCTCACGACC
GGCGTCGTCT CCGCGCTCCA GCGGCAGATC ACGGCGCCCA ACGGCTTCTC GATCGACGGC
GTGATCCAGA CCGACGCGCC GATCAACCCT GGCAACTCGG GCGGCCCGCT GCTCGACGCC
CACGGCGAGG TGGTCGGCGT CAACTCGCAG ATCCTCAACG GCGGCGGCAG CTCCAGCGAG
GGCAACGTCG GCATCGGCTT CGCGGCACCG TCGAACACGG TCAAGAACGT CGTCGAGCAG
CTGCGGCAGA ACGGCTCGGT CGAGCACGCC TACCTCGGCG TCCAGATGGG CGACGCGGCG
AGCGGCGGCG GCGCGCAGGT CGGCGCCGTG ACCCCGGACG GCCCGGCCGC CGCGGGCGGT
GTCCAGCAGG GCGACGTGAT CACCAGCTTC GACGGCAAGA CCGTCACCGA CGCCGCCTCG
CTGTCGAGCA TGGTCAACGC CAAGCAGGTC GGCGACAAAG TCGAGCTGGA GGTTCGCCGC
GGCGACGGCG AGCAGACGCT CAGCGTGACG CTCGCCGCGC AGCCCGCCTC GGCGAGCAGC
GCGCAGCAGC AGAGCCAGGT CGATCCGCAG CAGCAGGTCG ATCCCAACCA GCAGGTGGAC
CCGCAGCAGC AGGTGGACCC GCAGCAGCAG GTGGATCCGA ACCAGCAGGT CGATCCGCAG
CAGCAGGTCG ATCCCAACGG CGGCCAGCAG CAGATCGACC CGCGCGACCT GCTCGAGCAG
CTCATGCCCT GA
 
Protein sequence
MNSSKAIVTL AAAAVLGAAG GAAVVGVAGD GGGSTKTVLE PAAAQPQPVS VAQRNGDALT 
PKQVYSLARD SVVFITSDVT EQGQSGQATG SGFVISKDGY IVTNAHVVNG ASKVTVKIGD
GQTQDAEIVG KDESTDIALL KVSGSDDLKP LQFADSDKIS VGDPMYAIGN PFGLDRTLTT
GVVSALQRQI TAPNGFSIDG VIQTDAPINP GNSGGPLLDA HGEVVGVNSQ ILNGGGSSSE
GNVGIGFAAP SNTVKNVVEQ LRQNGSVEHA YLGVQMGDAA SGGGAQVGAV TPDGPAAAGG
VQQGDVITSF DGKTVTDAAS LSSMVNAKQV GDKVELEVRR GDGEQTLSVT LAAQPASASS
AQQQSQVDPQ QQVDPNQQVD PQQQVDPQQQ VDPNQQVDPQ QQVDPNGGQQ QIDPRDLLEQ
LMP