Gene Clim_1403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1403 
Symbol 
ID6356174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1507907 
End bp1509574 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content51% 
IMG OID642669014 
Productcarboxyl-terminal protease 
Protein accessionYP_001943442 
Protein GI189346913 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGCA TTTTAACCGT TATAGTAATG GTGGTTGTTC TTGCCTTCGG TGTCTTTCTT 
GGTACCAGAC TGAACCGAGG CGATCATGAC AGAAAAGCTT CCGAAAGCAA AATGGTTGAT
GCGTACAGCC TGATCAGAGA CCTGTATGTT GATGAAGTGC AGGCAGACAG TCTTGTTGGA
GCCGGAATCA AGGGAATGGT GGAGTCTCTC GATCCCCATT CGGTTTATCT CGAACCCGAG
GAGGTTTCGT TTTCGCAGGC CGAATTCGAC GGAAATTTTG ATGGCATAGG CATAGAGTTC
GACGTTATCA ACGACACGCT GCTTGTCGTA ACGCCTCTTT CGGGTGGGCC GAGTGCTACT
GTCGGTATTG CTGCCGGTGA TCGTATTGTG GCTATCGATT CGGTTTCGGC AATCGGAATA
ACGCATCAGC AGGTACTGCG CAAACTCAGA GGGAAACGCG GAACAACAGT GCATCTGAAA
GTATTTCGTC CACTTGTCGG CAAGCTCATG GATTTTCAGG TTACAAGAGG ACGGATTTCA
ACCTCGAGTA TCGATGCTTT TTTTGTTCTT CAGAACGGTA CGGGCTATAT CCGGCTGAGC
CGTTTTGTCG CAACAACCGG CGATGAGTTC CGAAAGGCTC TCGCAAGCCT GAAAAAGAAA
GGCATGAAGC GCCTTGTCAT CGATTTGCGG GGTAATCCGG GAGGTTTTCT CGAGCAGGCA
GTCGAAGTTG CCGACGAATT CCTTCGCAAA GACCAGTTGG TCGTTTATAC CAAGAGTGCC
AAGAATGCCG TTGAAGATGC CAGATATGTA GCCAAGTCCG GCGATGGATT CGAGAGCGGA
GAGGTTGCGG TACTTGTCGA CAAAGGCAGT GCTTCCGCAT CTGAAATTCT TGCCGGAGCA
CTGCAGGATA ACAAGCGGGC AGTGATTATC GGAGAGCTTA CCTTTGGAAA GGGGCTTGTT
CAGCGACAGT TCGAGTTCAG GGATGGTTCC GCCCTGCGAC TTACCGTATC CCGCTATTAC
ACCCCTTCAG GTCGTCAGAT TCAGAGAACC TATCGCAAGG GAGGCGATGG GCGAGAGCTG
TATTACAAGG ACGCCCTTGT CAATGTACAA CCCGGGAAAC TGTTTACGGA TCCCGCTCGT
TTTCTTTACC TTGAAAACAA TGACGTATCC GTTTATCGTA CCGGGACCCT TCCTGCTCTG
CTTTCGCGTC CTGTTGCCGG TAAGGAATTT CAGGATAATC AGTTTACCCT GCTCAAGGAT
GCCGGCGGCA TTATACCCGA TTACTGGGTA AGCGGGAGGC CTTATTCCGA TTTCTATCAG
GAGCTTTACC GAACCGGTTC CTTTGAGCGG CTTGCCCAGA GAATTCTTGA CGATCCCGGC
AGTTCCGTTC AGGCGCATCG GAAGTCGCTT GGAGCTTTTA TGAAGGATTA TGCCGGAGAA
AACAGGCTTG AAGCGCTGGT CATGAAGATC TGTGCTGAAA AAAAGATTAC ATTCAACAGA
CAGGCCTTCA GCAAGGAGCA GAAATATATC TCCCTGGCCG TAAAGGCAAG GCTTGCGCAC
AGGTTGTTCG GCACGGAAGG GCAGATCATG GTTTATATCA TGCAGTCCGA TCCGCTGATC
GGCGTGGCTT CGAAAGTTTT TGCTTCAGGA ACTCAATCAG TGCGTTGA
 
Protein sequence
MSRILTVIVM VVVLAFGVFL GTRLNRGDHD RKASESKMVD AYSLIRDLYV DEVQADSLVG 
AGIKGMVESL DPHSVYLEPE EVSFSQAEFD GNFDGIGIEF DVINDTLLVV TPLSGGPSAT
VGIAAGDRIV AIDSVSAIGI THQQVLRKLR GKRGTTVHLK VFRPLVGKLM DFQVTRGRIS
TSSIDAFFVL QNGTGYIRLS RFVATTGDEF RKALASLKKK GMKRLVIDLR GNPGGFLEQA
VEVADEFLRK DQLVVYTKSA KNAVEDARYV AKSGDGFESG EVAVLVDKGS ASASEILAGA
LQDNKRAVII GELTFGKGLV QRQFEFRDGS ALRLTVSRYY TPSGRQIQRT YRKGGDGREL
YYKDALVNVQ PGKLFTDPAR FLYLENNDVS VYRTGTLPAL LSRPVAGKEF QDNQFTLLKD
AGGIIPDYWV SGRPYSDFYQ ELYRTGSFER LAQRILDDPG SSVQAHRKSL GAFMKDYAGE
NRLEALVMKI CAEKKITFNR QAFSKEQKYI SLAVKARLAH RLFGTEGQIM VYIMQSDPLI
GVASKVFASG TQSVR