Gene Clim_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0020 
Symbol 
ID6354934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp23941 
End bp25638 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content52% 
IMG OID642667644 
Productcarboxyl-terminal protease 
Protein accessionYP_001942107 
Protein GI189345578 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAG TATCATTCCG CAGGCATGTT GTCGCAGGAC TGACAGCGGC ACTCCTTGCT 
CTCTCTTTGC CGGGATCTCA TCTGCAGGCC GTTCCCAAGG AGAACCCCGG GCAGGCATAC
TTTGAAATAG TAAAAGGCAT AGATCTGCTT GGCGAGGTAT ACCGCAGACT TTCCGAAAAC
TATGTAGAAC CGATTGACGC AGTTAAACTC ATGTATGCCG CCATTGACGG CATGCTTGCG
GTTCTTGATC CCTATACGGT ATTTCTCGAT GAGAGTCAGT CCGAAGAGCT TGGTGAAATG
ACCAGTGGAC AGTACACCGG CATAGGACTC AATATGAGCA GATTCGTTGA AAAAGTCTAT
ATAACATCGG TACTTGAAGG CTATCCCGCA TGGAAAGCCG GAATCAGGAC TGGCGACAGG
ATTGTCCGGA TCAACGGCAA TTTCGTTACA GGAAAGAATC TCGATGAGAT CAGGGCGATG
ATGAAAGGGG GAACCGGGAC GCCGCTTATG ATGAAAATCG AGCGGGAAGG AGGTCGGGAT
CCAGGGATCA TTACCCTTTC GAGAGAGGAA GTCAGGGCCG GAACGGTGCC CTATTCCGGT
ATCATCGGAC AAACCGGCTA TCTGGAGATC AGCAGCTTTT CAAGTCATTC AACTGAAGAT
ATCCGTCTGG CAGTTGAAAA ACTGCTTCGT CAATCCGCAG AGAGCCGACA GCCGATGAAC
GGTCTGATTA TCGATCTTCG CGGTAATCCC GGCGGTCTGC TTTCGGCTGC GGTTGAAATA
TCTTCTCTGT TTATGGAGAA AGGCAGTACG GTCGTCACCA TAAGGGGGCG ATCTCCGGAA
TCAGAGAAAA TCTATAAAAC GGAACAGCTC CCCATCGCCG AAGCGTTTCC GATTGCCGTA
CTGATCAACC GGGAGAGCGC ATCCGCATCC GAAATAATAT CGGGGGCCGT TCAGGATCTG
GATCGCGGAG TCGTTATCGG AGAACGCTCA TACGGGAAGG GTCTGGTACA GTCCGTTATA
CGACTGCCCT ACGACAATAC CCTGAAAGTC ACGACGGCAA AATACTATAC CCCTTCCGGT
CGTTTGATCC AGAAGCCGCA TGCCGATAGC GGTACGGCAA GAAATGTCCT GATGAAAAAC
GATGACCGCA AGGCTCTGCC GGTCTATTAC ACGGCAGGAA AACGTAAGGT ATACGGCGGC
GGTGGTATTG CACCCGATAT GACTGTGGCG GATATTTCGC GATCAGAATA CGAACAGGAG
CTTCGACGCA GGGGTATGAT TTTTTTGTTT GCTGCCCGGT ACCGGGCTTT GCACCCCGAT
GCTGTTCGGC AACCGCTCGA CCGCGCAGTG CTTATTGACG AATTCGCGTT CTTTCTCCGT
CAGCAGGGCT TTTCATTCAC TTCAGCTCCG GAACGGCATC TCAAGGAACT TGAAGAGAGT
ATAGCTGAAG AGCAGGGGGA TAAAAAAGCA GCAGGGCCCG AAAGTATTCC GGGATTGAAA
CAGGAACTCG CAAGAATGAA ACAGAAGCGC GTCGACGGAG AGTCGGAGCG GATTGCCCGG
CTGCTTGAAC TTGAAATCAT GCGCCATGGC GACGAAAATG CATCGCGCAG GGCAGCGCTC
GGCGACGATC CTGTTGTGCA GAAAGCCCTG GCTCTGCTTG CCGACCCGAA AGCCTATTCA
AGGCAGCTCA AGCCCTGA
 
Protein sequence
MKPVSFRRHV VAGLTAALLA LSLPGSHLQA VPKENPGQAY FEIVKGIDLL GEVYRRLSEN 
YVEPIDAVKL MYAAIDGMLA VLDPYTVFLD ESQSEELGEM TSGQYTGIGL NMSRFVEKVY
ITSVLEGYPA WKAGIRTGDR IVRINGNFVT GKNLDEIRAM MKGGTGTPLM MKIEREGGRD
PGIITLSREE VRAGTVPYSG IIGQTGYLEI SSFSSHSTED IRLAVEKLLR QSAESRQPMN
GLIIDLRGNP GGLLSAAVEI SSLFMEKGST VVTIRGRSPE SEKIYKTEQL PIAEAFPIAV
LINRESASAS EIISGAVQDL DRGVVIGERS YGKGLVQSVI RLPYDNTLKV TTAKYYTPSG
RLIQKPHADS GTARNVLMKN DDRKALPVYY TAGKRKVYGG GGIAPDMTVA DISRSEYEQE
LRRRGMIFLF AARYRALHPD AVRQPLDRAV LIDEFAFFLR QQGFSFTSAP ERHLKELEES
IAEEQGDKKA AGPESIPGLK QELARMKQKR VDGESERIAR LLELEIMRHG DENASRRAAL
GDDPVVQKAL ALLADPKAYS RQLKP