Gene Cwoe_2900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2900 
Symbol 
ID8733344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3096239 
End bp3097231 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content74% 
IMG OID646503513 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_003394694 
Protein GI284044354 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.554224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.491948 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATCC TCGCGATCGA GACGAGCTGC GACGACACCT GCGCAGCGGT CGTCAGCCGC 
GACGGCGAGG TGCTGGCGAA CGTCATCTCC TCCCAGGGCG TCCACGACCG CTACGGCGGC
GTCGTGCCCG AGATCGCCTC GCGCCACCAC CTGGAGCTGA CCGGCGCCGT CGTCGACGAC
GCGCTGCAGC AGGCTGGCAC GACGCTCGCG CAGCTCGACG CGGTCGCCGT CACGCAGGGG
CCGGGCCTCG TCGGCGCGCT GCTCGTCGGC GTGGCGACCG CCAAGTCGCT CGCCGCCGCT
CACGGCCTCC CGCTGATCCC GGTCGACCAC CTCCACGGCC ACGTCGCCGC GTCGTTCCTG
CAGCCAGGGC CGATCGAGCC GCCGTTCCTG ATCCTGATCG CGTCCGGCGG CCACACGCTG
CTCGCGCACG TGAAGGACCA TGGGCCGAAC TGGGAGGTGC TCGGCTCCAC CCGCGACGAC
GCCGCCGGCG AGGCGTTCGA CAAGGGGGCG CGGCTGCTCG GACTCGGCTA CCCGGGCGGG
CCGGCGTTGC AGAAGCTCGC GGAGGAGGGC GACGCGACCG CGTTCAGATT CCCCGTCGCG
GCGCAGGTGC CGGGGCTCGA CTTCTCCTTC TCGGGCCTCA AGACCGCGCT CCTGTACAAG
GTCCGCGAGC TGGGGGAGGA GGAGGCCGCA CGCCGTCGCG CCGACCTCGC CGCGTCCTTC
CAGCGCGCAA TCGTCGAGAC GCTGGCGCGG CGGGTCGAGC GCGCCCGCGA GCAGACCGGG
ATCGAGCGGC TCGCTATCGG CGGCGGCGTC GCGGCGAACG GGCCGCTGCG CGAGCGCATG
CGCGCGCTCG CGCCCGACCT GCACGTCCCG CCGCGCGAGC TGTGCACCGA CAACGCCGCG
ATGATCGCCT CCGCCGCCCG CTTCCTCGAC CCGCTGCCGT ACCCGAGCTA CCTGCGGCTC
GACGCGTACG CGACCGGCGA GCGCGGCCTG TGA
 
Protein sequence
MSILAIETSC DDTCAAVVSR DGEVLANVIS SQGVHDRYGG VVPEIASRHH LELTGAVVDD 
ALQQAGTTLA QLDAVAVTQG PGLVGALLVG VATAKSLAAA HGLPLIPVDH LHGHVAASFL
QPGPIEPPFL ILIASGGHTL LAHVKDHGPN WEVLGSTRDD AAGEAFDKGA RLLGLGYPGG
PALQKLAEEG DATAFRFPVA AQVPGLDFSF SGLKTALLYK VRELGEEEAA RRRADLAASF
QRAIVETLAR RVERAREQTG IERLAIGGGV AANGPLRERM RALAPDLHVP PRELCTDNAA
MIASAARFLD PLPYPSYLRL DAYATGERGL