Gene Csal_0973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0973 
Symbol 
ID4026196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1089587 
End bp1090618 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content71% 
IMG OID637966150 
ProductO-sialoglycoprotein endopeptidase 
Protein accessionYP_573029 
Protein GI92113101 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGTAT TGGGCATCGA GACCTCCTGC GACGAAACCG GCGTCGCCAT TTATGACACC 
GAGCGCGGCC TGATCGCCGA TGCGCTGCAC AGCCAAATGG CCATGCACGC CGAATTCGGC
GGTGTGGTCC CGGAACTCGC CTCGCGGGAT CACACTCGCA AGCTGCTGCC GCTGATTCGC
CAGGTGCTCG ACGACGCCGA GCTGCGCGGC GACCAGCTAG ACGCCATCGC CTACACGGCG
GGCCCCGGCC TGGTCGGCGC GCTGATGGTC GGCGCCTCCA CCGCGCACGG CCTGGCGCGC
GCCTGGGACA TCCCGGCACT CGGCGTGCAT CACATGGAAG GCCATCTGCT GGCGCCGATG
CTCGAGGCCG CGCCGCCCGA CTTTCCCTTC GTGGCCCTGC TGGTGTCGGG TGGGCACACG
CAGCTCGTCG AGGTCCACGG CCTGGGCCGT TACCGGCTGC TGGGCGAATC GGTCGACGAT
GCCGCCGGCG AGGCCTTCGA CAAGGCCGCC AAGATGCTCG AACTGCCCTA CCCTGGCGGC
CCCCACGTCG CCCAGCTCGC CGAGCGCGGC GACCCGACCC GGTTTCGCTT TCCGCGCCCG
ATGACCGACC GGCCGGGACT CGACTTCAGC TTTTCGGGTC TCAAGACCCA CACCCTGACC
ACCGCCAACC AGCTCAAGGC GGCGGGCCCC CTCAGCGACC AGGACCGCGC CGACATCGCG
CGCGCCTTCG AGGAAGCCGT CGTCGACACG CTGGTCATCA AGTGCCGGCG CGCCCTCGAC
ACCACGGGCC TCAAGCGGCT GGTGGTGGCC GGCGGCGTCA GCGCCAATCA TCGCCTGCGC
GAGCGCCTGG ACCGGGAAAC CGCCAAGCGC CAGGCCCAGG CGTTCTACCC GCGCGGACGC
TTCTGCACCG ACAACGGCGC AATGATCGCT TATGTCGGCG CACAACGCCT GCTGGCCGGG
GAGCGCGACG ACGCGACGAT GCAGGCCACG CCGCGCTGGC CGCTGGCGTC GCTCACTCCT
CCGGCGGCTT GA
 
Protein sequence
MRVLGIETSC DETGVAIYDT ERGLIADALH SQMAMHAEFG GVVPELASRD HTRKLLPLIR 
QVLDDAELRG DQLDAIAYTA GPGLVGALMV GASTAHGLAR AWDIPALGVH HMEGHLLAPM
LEAAPPDFPF VALLVSGGHT QLVEVHGLGR YRLLGESVDD AAGEAFDKAA KMLELPYPGG
PHVAQLAERG DPTRFRFPRP MTDRPGLDFS FSGLKTHTLT TANQLKAAGP LSDQDRADIA
RAFEEAVVDT LVIKCRRALD TTGLKRLVVA GGVSANHRLR ERLDRETAKR QAQAFYPRGR
FCTDNGAMIA YVGAQRLLAG ERDDATMQAT PRWPLASLTP PAA