Gene Cwoe_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3233 
Symbol 
ID8733682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3438040 
End bp3439320 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content70% 
IMG OID646503850 
Productpeptidase M16 domain protein 
Protein accessionYP_003395026 
Protein GI284044686 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.786099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC ACCGGATCAC CGAGCTGGAC TCAGGCGTGC GGATCGTGAC GGAGGGCATG 
CCCTCCGTCC GGTCCGTCTC GCTCGGGTAC TGGATCGGCA CCGGCTCGCG GGGCGAGACC
GACGCGCAGG CGGGGCTCTC GCACCTGATC GAGCACCTGC TGTTCAAGGG CAGCAGCAGA
TATCAGTCGC TCGAGATCGA CCAGATCTTC GACGGCATGG GCGCGGAGCT GAACGCCGGC
ACGGGCAAGG AGACGACCTC CGTCTACTCG CGCGTGATCG ACGAGCACCT CGACCTCGCG
TTCGACGTGA TGAGCGACAT GGTCTTCAGA CCCGCGTTCG AGGACGTCGA CAGCGAGCGC
GAGGTGATCC TCGAAGAGAT CGCGATGTAC GAGGACGACC CGCAGGACAA GGTCTTCGAC
GTGCTCGGCC AGGCCGTCTT CGGCGACCAC CCGCTCGGCC GCTCGATCAT CGGCAGCGCC
GACGTCGTCG CCGGCACGCC GGTCGACGCG ATCAAGGCGT TCCACGACTC CCGCTACGTC
GCGTCGAACG TCGTGCTCGC CGCGGCGGGC GCCGTCGACC ACGACCAGCT CGTCGAGCTG
GCCGCGACGC GCGTCCCCAA CGGCGGCCGC AGCGCGGACG CGCCGCAGCC GCTGCCGGCG
CCGGCGCAGC ACGCGCCGCG CGTGCGCTTC GAGCGCAAGG ACACCGAGCA GTACCACGTC
TGCCTCGGCG GGACGGGGAT CGCGCGCGAC GATGAGCGGC GCTTCGCGCT GCGCGTGCTC
GACACGATCT TCGGCGGGAC GTCGTCCTCG CGGCTGTTCC AGGAGGTGCG CGAGAAGCGC
GGGCTCGCCT ACGCCGTCTA CTCGTTCACC GGGCAGTTCG CCGACACGGG CCAGATCGGC
CTCTACGTCG GGACGCGCAG CGACAACCTG GCTCCGGCGC TCGAGGTCGT CGCGCAGGAG
CTGGAGCGGC TGCGCCGTGA GCCGGCGACC GCCGACGAGC TGGCGCGCGC GAAGGAGAAC
CTGAAGGGCC GTGTCGTGCT GTCGCTCGAA TCGACCGGCT CGCGCATGAA CCGGCTCGGC
TCGGCGCTGC TGAGCGACGT GCCGCTGCTG TCGGTCGACG AGGTCGTCGA GCAGATCGAC
GCCGTCTCGC TCGACGCGGT CGCGCAGCTG GCGGAGGAGC TGTTCGCCCC TGAGCAGCTG
TCGACGGCCG GCATCGGTCC CGACGAGGAC GTCTTCAGAG CGGCGCTGGG GCCGCTCTCT
TCTACCGCCG TGGAGGCATA G
 
Protein sequence
MTDHRITELD SGVRIVTEGM PSVRSVSLGY WIGTGSRGET DAQAGLSHLI EHLLFKGSSR 
YQSLEIDQIF DGMGAELNAG TGKETTSVYS RVIDEHLDLA FDVMSDMVFR PAFEDVDSER
EVILEEIAMY EDDPQDKVFD VLGQAVFGDH PLGRSIIGSA DVVAGTPVDA IKAFHDSRYV
ASNVVLAAAG AVDHDQLVEL AATRVPNGGR SADAPQPLPA PAQHAPRVRF ERKDTEQYHV
CLGGTGIARD DERRFALRVL DTIFGGTSSS RLFQEVREKR GLAYAVYSFT GQFADTGQIG
LYVGTRSDNL APALEVVAQE LERLRREPAT ADELARAKEN LKGRVVLSLE STGSRMNRLG
SALLSDVPLL SVDEVVEQID AVSLDAVAQL AEELFAPEQL STAGIGPDED VFRAALGPLS
STAVEA