Gene Cwoe_4901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4901 
Symbol 
ID8735367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5228327 
End bp5229550 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content72% 
IMG OID646505529 
ProductHtrA2 peptidase 
Protein accessionYP_003396688 
Protein GI284046348 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.366524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATCT TCGACCTACC GATGGGATCC CCAAGACACC TGTGGACCGG AGAGTGGCGC 
GCCGAGTCCG ACAAAGCCCG TGACGACGGC GCCAGCTCGC TGCGCCGCAC CCCGCCGCCG
CGCCACGTCG AGGACCCCGG AGCACCCTCG CCGGAACCGC AGCAACCGCG CCGGCGCGGG
ACGATCGCGC TCGCGGTGCT CGTCGCCCTC GTCGCCGTCG CCGGCGGCGC GCTCGCCGCG
ACGATGCTGT TCGGCGGCGA CGACAGCAAC AGCCCGGACC CGCTTCCGGC AGTCGCCAGC
AGACCGATCA GACCGCGCGA GGGCCAGACG CGCGCGGGCG CCATCTACGC CGCCGCCAGC
CCCGCGGTCG TGTCGGTCCG CACGACCACC GGCCAGGGCA CCGGCTTCCT CGTCGAGGAC
GACGGCATGA TCGTCACCAA CGCCCACGTC GTCGGTGAGA GCTCCCACGT CGTCGTCAAG
TTCGGCACCG ACGGCGCCTC GATCGACGGC GACGTGCTCG GCAGTGACCC GTCGACCGAC
CTCGCCGTCG TCAGCATCGA GCGCAGCCGC ATCCCCAACG GCGTCAAGGC GCTCAGATTC
GCCGACTCCT CGAACGTCGC CGTCGGCGAC ATGGCGGTCG CGATCGGCAA CCCGTTCGGG
CTCGACCGCA CCGCGACCGA AGGGATCGTC TCCGGGCTCG GCCGCTCGAT CACCTCCCCC
AACGGCTTCG AGATCGACGA GGTGATCCAG ACCGACGCGC CGATCAACCC CGGCAACTCC
GGCGGCCCGC TGCTCGACAG CGGCGGCCGC GTGATCGGCG TCAACTCGCA GATCGCGACG
AGCGGGATGG GCGCGCAGGG CAACATCGGC ATCGGCTTCG CGGTCCCCTC CAACACCGCG
CGCCAGATCG TCCCGCGGCT GGAGAAGGGC GAGGCGATCC CGCGCCCCTA CCTCGGCGTC
ACCACCTCCC CCGCGTCGCT GACGAACCCC GACGGCGCGG TCGTGCAGGA CGTCGTCCCC
GGCGGCCCGG CCGACAGAGC CGGTCTGCGC AGAGGCGACG TCGTCAAGCG GATCGACGGC
AGATCCGTGC AGGAGCCGGG CGACGTCGCG GCCGGCATCT CCGACCGCGC GCCCGGCGAC
GAGGTCGCGA TCGACATCGA GCGCGGCGGC AGAGAGATGA CGGTGAGAGC GACGTTGGGG
ACTCGACCCG CCAGAACGCC GTGA
 
Protein sequence
MGIFDLPMGS PRHLWTGEWR AESDKARDDG ASSLRRTPPP RHVEDPGAPS PEPQQPRRRG 
TIALAVLVAL VAVAGGALAA TMLFGGDDSN SPDPLPAVAS RPIRPREGQT RAGAIYAAAS
PAVVSVRTTT GQGTGFLVED DGMIVTNAHV VGESSHVVVK FGTDGASIDG DVLGSDPSTD
LAVVSIERSR IPNGVKALRF ADSSNVAVGD MAVAIGNPFG LDRTATEGIV SGLGRSITSP
NGFEIDEVIQ TDAPINPGNS GGPLLDSGGR VIGVNSQIAT SGMGAQGNIG IGFAVPSNTA
RQIVPRLEKG EAIPRPYLGV TTSPASLTNP DGAVVQDVVP GGPADRAGLR RGDVVKRIDG
RSVQEPGDVA AGISDRAPGD EVAIDIERGG REMTVRATLG TRPARTP