Gene Clim_1924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1924 
Symbol 
ID6354978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2127046 
End bp2128368 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content59% 
IMG OID642669521 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001943935 
Protein GI189347406 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAACC AAAAAGCAAG AGGGAAGGAG CGCGGAAACC CGAGTGTTCA TCGTCGCACA 
AAAGGATTTC CCGCACTCTC CGACGCAGCA GTCAGGTCGT GCAGCATGTT TTATGAGGTG
CCCCTAAAAA CAGGAGATGC CATCATGAAC CGTTCCTACA TTGTTCTGCG AAACATCGCT
CCGGCACCCT CAAAAACCAG AGGCGGGGTA CGGGCTTCCA TGACGCGGGT TGCAGCGGCC
CCTGAAGTCG AAATTGCCGA ACTCTCCGGC AAATCCGTCA AAAAAGCCAT GCAGGAAAAA
TCCACCATGG CGGTGGCGCC GGTTGTGCCC ATGAAACTCG TAACCCCTCT GCAATCGAAA
AAAACCGTTG CAGAATCGGC ATCGTCCGGC GTGGCCTGGG GTGTCATGGC GGTCGGGGCC
GTCGGGTCAT CCTGTAACGG CGAAGGCATC GTTGCAGCCG TGCTCGATAC GGGCATCGAT
CCCGGTCACC AAGCATTCGC AGGTGTGGAA CTTGTCCGCA AAAACTTCAC CGACGAAAGC
GACGACGACG AGCACGGCCA CGGCACCCAC TGCGCAGGAA CCATCTTCGG GCGCGATGTC
GACGGCATGC GCATCGGCGT TGCCCCCGGC ATCAAAAAAG CCCTTATCGG CAAGGTGCTC
GGCAACTCGG GAGGCGGAAG CGACAAAATA ATCGAAGCCA TCCAGTGGGC GGTCAGCAAC
GGAGCGAACG TCATTTCCAT GTCGCTCGGC ATGGATTTTC CCGGATATGT AAAAGCGCTC
GAAGACGAGG GCTTGCCAAC CGAGCATGCA ACATCGATGG CGCTCGAAGG ATACCGCACC
AACATTCTGC TTTTCCAGAG CATGGCAGCG CTCGTTGCAT CGCAGGAAAT GTTCGGTAAA
ACCTCGCTGC TCATCGCTGC GGCAGGCAAC GAAAGTCAGC GGCCTGCATT CGAAATAGCC
GTCAGCCCGC CGGCCGTATC CAACGGATTC GTCTCCGTTG CGGCGCTCGG CAGAACCCCG
GCAGGTAAAA CCTGGAGTGT AGCCGATTTC TCCAACACAG GCGCCCGGCT TTCCGGTCCC
GGGGTAGATA TCGTCTCCGC CAAACCCGGA GGCGGCCTGA CCCTCATGAG CGGCACCAGC
ATGGCAACTC CCCACGTCGC CGGTGTCGCG GCCCTCTGGG GCCAGAAGCT GCTTGGCGAA
GGCTGCCTGC GCGCAACATT GCTGATCGAC CGGCTCGTCG GCAACGCCTC GACCAAAGGC
ATGAAAAAAG GGTTCGACCC CTCCGACATC GGGGCAGGCA TGGTCATGGC GCCACAGGAT
TGA
 
Protein sequence
MRNQKARGKE RGNPSVHRRT KGFPALSDAA VRSCSMFYEV PLKTGDAIMN RSYIVLRNIA 
PAPSKTRGGV RASMTRVAAA PEVEIAELSG KSVKKAMQEK STMAVAPVVP MKLVTPLQSK
KTVAESASSG VAWGVMAVGA VGSSCNGEGI VAAVLDTGID PGHQAFAGVE LVRKNFTDES
DDDEHGHGTH CAGTIFGRDV DGMRIGVAPG IKKALIGKVL GNSGGGSDKI IEAIQWAVSN
GANVISMSLG MDFPGYVKAL EDEGLPTEHA TSMALEGYRT NILLFQSMAA LVASQEMFGK
TSLLIAAAGN ESQRPAFEIA VSPPAVSNGF VSVAALGRTP AGKTWSVADF SNTGARLSGP
GVDIVSAKPG GGLTLMSGTS MATPHVAGVA ALWGQKLLGE GCLRATLLID RLVGNASTKG
MKKGFDPSDI GAGMVMAPQD