Gene Cwoe_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1788 
Symbol 
ID8732229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1880543 
End bp1881751 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID646502405 
ProductPeptidase M23 
Protein accessionYP_003393589 
Protein GI284043249 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCC GCATCCTCCT CGCCACACTG CTGCTCCCGC TGGTCGTGTG GGTCTCGCTC 
CCGCTCGTCT CCAGCGCCGA CCCGCAGGAC GACCTCGATC GCATCGAGGG CAGAATCGAC
GCCAAGCGCG GGCAGCTCGA CCGCGTCAGA GGCAGAGCAC AGATACTCAC GAGCGACATC
ACCGCGTTCA CGCGCAGAAT CGACGCGCTC CAGGGGACCG TCGACACGCT CCAGCGCAGA
CAGGACACGA TCCAGGCGAA TCTCGACGAG AAGCGCCGCG AGCTGGCGAG AACGCAGGAA
GAGCTGCGCG TCACGCGCGC GCGCCTCGCG AGACTGAAGG CGCGGCTGGA GAGATCGCGC
AAGATCCTCG CCGCCCGCCT CGTCGAGGTC TACAAGTCCG ACGAGCCCGA CATGGTCAGC
GTCGTGCTCG ACGCCGACGG CTTCGCGGAG CTGCTGGAGA ACGGCGCCTA CCTCGAGCGG
ATCGGCGAGC AGGACCGCCG CATCATCAGC TCGGTCAAGG ACGCGAAGGC CGAGGCCGCG
ATAACGACGA GACGGCTCAG CGTGCTCGAG GCCAGACAGC AGGCGATCGC CGACCAGATC
TACGAGCAGC GCAACGAGGT CGCGCGCGCG CGAATCGAGG TCGAGGGCAA GCGCGACGCG
GTCGACCGCG TGCGCGCCGG CAAGCGCAGG CTGCTCGGCC GGATCCGCTC CCACCAGCAC
GAGCTGAACG AGGACATCGA CGCGCTGCAG GCGCAGGAGT CGAAGATCCA GCGGCGCATC
CAGGCCGCGC AGAACCCGAC CTCCGGCATC GCTCCGGGGC CGATCAGAGG CGGCGGCCGC
TTCATCTGGC CCGTCAACGG CCCGATCACC AGCTCGTTCG GCTGGCGCAC GTCGCCCGTG
ACGAGATTCC ACCAGGGCCT CGACATCGGC GTTCCCGAGG GCACCCCGAT CCGCGCCGCC
GGCAGCGGCA GCGTGATCCT CGCCGGCGTC AACGGCGGCT ACGGCAACTT CACCTGCATC
GACCACGGCG GCGGCGTCTC CAGCTGCTAC GCGCACCAGT CCTCGATCGG CGTCGGCGTC
GGTCAGAGCG TCTCGCAGGG CCAGGTCATC GGCGCGGTCG GCAACACCGG CTTCTCGTTC
GGCGCCCACC TCCACTTCGA GGTGCGCATC AACGGCTCCG CCGTGCAGCC GCTGAACTAC
CTCGGCTGA
 
Protein sequence
MRIRILLATL LLPLVVWVSL PLVSSADPQD DLDRIEGRID AKRGQLDRVR GRAQILTSDI 
TAFTRRIDAL QGTVDTLQRR QDTIQANLDE KRRELARTQE ELRVTRARLA RLKARLERSR
KILAARLVEV YKSDEPDMVS VVLDADGFAE LLENGAYLER IGEQDRRIIS SVKDAKAEAA
ITTRRLSVLE ARQQAIADQI YEQRNEVARA RIEVEGKRDA VDRVRAGKRR LLGRIRSHQH
ELNEDIDALQ AQESKIQRRI QAAQNPTSGI APGPIRGGGR FIWPVNGPIT SSFGWRTSPV
TRFHQGLDIG VPEGTPIRAA GSGSVILAGV NGGYGNFTCI DHGGGVSSCY AHQSSIGVGV
GQSVSQGQVI GAVGNTGFSF GAHLHFEVRI NGSAVQPLNY LG