Gene Cwoe_2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2687 
Symbol 
ID8733130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2867671 
End bp2869296 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content73% 
IMG OID646503299 
Productconserved repeat domain protein 
Protein accessionYP_003394481 
Protein GI284044141 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5640] Secreted trypsin-like serine protease 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000153291 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.102085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATTC GAAAGTTCAG GATCCCAGGG CGGGCTGCTG CCGCCGCCCT GGTCGCGCTC 
GGTGCTCTCC TCACCAGCGC CACGGCCGCG TTCGCCAGTG ACAGACCGCT GACGCTGCGC
CAGGACGAGC CGGTCAAGAA GCCCGCGATC GTCGGCGGCA CGCCGATCAC GATCGCGCAG
GTGCCGTGGC AGGTCTTCCT CGAGTCCACG GTCCCCGGCG GTGTCGCCCG CTGTGGCGGC
ACGATCATCA ACACCACGAC GGTCGTCACG GCCGCACACT GCGTCTACGA CAGCAGCACC
GGCCAGCCGA TCCTGGCCGC TGCGATGACG GTCAAGGCCG GCATCTCCGA CTTCAGAAAC
CCCGAGGCGA CGAGACAGGA CAGTCCCGTC GCGCAGGTGC GGGTCCACCC GGGATACGTC
TACGACCTCG GCAGCAGAGG CGTCGCGCCC GACGACGTCG CGGTGCTGAC GCTCGCTGTC
CCGCTCAACT TCAGCGGCCC CGCCGTCAGA CAGATCGCGC CCACGAGCCC GTTCGCGTAC
CTCGCGCCCG GCACCGCCGC GTCGATCTCC GGCTTCGGCC GCCAGACCGC GGGCGGCACG
CCCGACGGCA AGCTCTACGG CCTCGACACG ACCGTCGGCG ATCCGCTCGC GTGCGGCGGC
GAGGCGAACG CCGTCGTCCT GTGCGTCACC AGCCAGGTCG GCTCGGCCTG CCAGGGCGAC
TCCGGCGGCC CGCTGACCGC CGGCGGCGTC TTCGCCGGCG TCGCCAGCTT CGTCACCGTC
AGCGGCCCGA CAGGCGAGTG CGGCCCCGGC TCGCTCAACG GCTACACCAA CCTCGCCGCA
CCCGAGATCC TGGAGTTCAT CCAGGGCAGC GCCAACCCGC CGATCGCGCC CCGCGGCGGC
AGAGACATCA GCGCCCGCGG CGTCTTCCAG GCGAACAACT CGATGACGTG CACCGCCGGC
ACCTGGAGCG GCGCGCCGGC GTTCACCTAC GCGTTCGTCG ACACGCGCAA CGGCCAGGTG
CTGCAGAACG GCCCAAGCGC GACGTACGCG TTCACCGGCG GCGACGTCGG CCGCACGGTC
GCATGCGCCG TCAGCGCCAG CAACGCCGGC GGCACGGGGA TCTCGCGCAC GCAGGCGTCG
CCCGCGATCG CCGCCGCCCC GGCGCCCGCG CCGGTGCCGA CGCCCAGACC GCAGCCGAGA
CCGCAGCCGA GAAGACCGAC CCCGGCGAAG CCGTCGCTGC GCGTCAGCGT CAAGGCGTCG
AGCACGCGCG TCGTCGCCGG CCGCACGGTC ACGTACGCGA TCACCGTCGC CAACCGCGGT
CGCGCGGCGG CCCGCAAGGT CGTCGTCTGC GACGCTCCCG GCAAGGGCCT GACGTTCGGC
AGCCTGCCGA AGGGCGCGAG AAAGTCGCGC GGCCGCGCCT GCTGGTCGCT CGGCACGGTG
AAGGCGAGAT CGACGCGCAC GCTGCGGGTG TCGCTGCGCG TCGCCGGCAC GACGAAGCCC
GGCCTCGTCT CCAACCGCGT CGCGGTCAGC TCCGCGAACG CGGGCAGACG CTCCGCGACC
GCGCGCGTGC GCGTCGTCCC GGCGCAGCAG AGAGGCCGCA CCCGTCCCCC CGGGGTGACC
GGCTAG
 
Protein sequence
MHIRKFRIPG RAAAAALVAL GALLTSATAA FASDRPLTLR QDEPVKKPAI VGGTPITIAQ 
VPWQVFLEST VPGGVARCGG TIINTTTVVT AAHCVYDSST GQPILAAAMT VKAGISDFRN
PEATRQDSPV AQVRVHPGYV YDLGSRGVAP DDVAVLTLAV PLNFSGPAVR QIAPTSPFAY
LAPGTAASIS GFGRQTAGGT PDGKLYGLDT TVGDPLACGG EANAVVLCVT SQVGSACQGD
SGGPLTAGGV FAGVASFVTV SGPTGECGPG SLNGYTNLAA PEILEFIQGS ANPPIAPRGG
RDISARGVFQ ANNSMTCTAG TWSGAPAFTY AFVDTRNGQV LQNGPSATYA FTGGDVGRTV
ACAVSASNAG GTGISRTQAS PAIAAAPAPA PVPTPRPQPR PQPRRPTPAK PSLRVSVKAS
STRVVAGRTV TYAITVANRG RAAARKVVVC DAPGKGLTFG SLPKGARKSR GRACWSLGTV
KARSTRTLRV SLRVAGTTKP GLVSNRVAVS SANAGRRSAT ARVRVVPAQQ RGRTRPPGVT
G