Gene Plut_0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_0014 
Symbol 
ID3745720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp18792 
End bp20486 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content61% 
IMG OID637768042 
Productpeptidase S41A, C-terminal protease 
Protein accessionYP_373949 
Protein GI78185906 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.366239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.216554 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACGG CGCCCTTTGG TTCCCTGAAG GCAGTTGCGT TCATTGCAGC AGGCCTTTTC 
GCTCTGGGGG CCGCTGCCCC CGGTGCGGCA TCCACGCGTG ACGACCGGTA CTTTGAAACT
GCCCGCAGCA TTGATCTCAT GGGCGATGTC ACCCGTCAGG TTGCAGAGAG TTATGTCGAC
ACGGTCGACA TCAGCCGGAT GCTGTATTCC GGCATTGACG GCATGCTCGA AACCCTTGAC
CCCTACTCAG TCTTTCTCGA CAGCGAAGAG TCCCGGGAAC TCGGGGACCT CACCAGCAAC
CAGTATGCCG GCATCGGCGT CACCATCGCC GCTCTGGACG GCAGCATCTA TGTCACTTCG
GTCGAGAAGG GCTGGCCGGC CGAAACGGCC GGCCTAAGGA CGGGAGACCG TCTCACCGCC
ATCAACGGCG TCCTCCTAGC CGGCAAATCC CTTGATGCCG TGCGCGAGCT GATCCGGGGT
AATGTCGGCT CACCCGTCAC CCTGCGGGTT CAGCGGCATG GAACTGAACC CTTCACATGC
CGGCTCGTTC GCGAAGAGGT TCGGCTGAGC ACTGTCGGGC ATGCAGCTTT CCTCGACGGG
AATGGAGGGA TTGCCTACAT TTCACTGACC AGTTTTGCCG ACCGGTCAGG TGTGGATCTC
AAAAGCGCGC TCGGACAGCT GCAGCGGGAG GCGGTGAGCC GCGCCATCCC GATGCAGGGC
ATCATCCTCG ATCTCAGGGA AAACCCCGGA GGCCTGCTTG ACGCCGCAGT CGACGTTACC
GGATCGTTTG TTGCAAAAGG AAGCCCGGTG GTATCCATCC GCGGACGTTC CGCCGCGGCA
TCCCGTTCCT ATGCAACCTC CGGCCCGCCT TTCGATGCAT CCCTTCCCCT GGCAGTTCTC
ATCAATGCCC GCAGTGCCTC GGCAGCTGAA ATCGTCGCAG GAGCTGTGCA GGATCTCGAC
CGCGGCGTCA TAATCGGTGA GCGTTCCTTC GGCAAAGGGC TCGTGCAGTC GGTCATCCGC
CTGCCCTATG AAAATGCCCT CAAGCTGACC ACCGCCAAGT ACTATACCCC GTCGGGCCGC
CTCATCCAGA AAGAGGCTGG AGATGCGCAC GGCTCACGTG ATGTCCTGCC CAGAAAAAAA
GCCGGGGAAG GGTCGGCTGA GGTGTTCAGA ACCGCAAGCA ACCGGAAGGT GTTCGGTGGT
GGCGGAATCC TTCCCGACAT CATTGCCACA GATCCCGAAC CGTCCCCATA CCTTGAATCG
CTTGAGAGGA AAGGACTGCT GTTCCTGCAT GCAGCTCTGT GGCGTGCATC GCATCCCCAG
CCGCCGGCCC CTGCCGACAG TGCGGCGCTT GTTCAGAGCT TCAGCGGGTT CCTTGAGACG
CAGGAGTTCA CGTACCGTTC TCTCGCCGAC AGGAAGCTGG AGGAACTGAA GAAGCTTCTT
GCCGGGGAAA AAACGGATGC AGCCAAACAG GCGATGGCAG TGTACCGTAC AATGGAGGTT
GAGATTGCTG CCGGGCGGGA GCGCGAGATC GGCCGTGCGT CAGGAGAGGT TTACACTGCC
CTTAGGGAGG AGGTTCTTCG CCATTACGAC GAACGGCTTG CCGCCATGGA GGGGCTCCGT
CACGATCCCG TGGTGCGGAA GGCCAGCGAG ATCATCGCCA GGTCGCGCAC CTACCGCAAA
CTGCTCAGCC GCTGA
 
Protein sequence
MGTAPFGSLK AVAFIAAGLF ALGAAAPGAA STRDDRYFET ARSIDLMGDV TRQVAESYVD 
TVDISRMLYS GIDGMLETLD PYSVFLDSEE SRELGDLTSN QYAGIGVTIA ALDGSIYVTS
VEKGWPAETA GLRTGDRLTA INGVLLAGKS LDAVRELIRG NVGSPVTLRV QRHGTEPFTC
RLVREEVRLS TVGHAAFLDG NGGIAYISLT SFADRSGVDL KSALGQLQRE AVSRAIPMQG
IILDLRENPG GLLDAAVDVT GSFVAKGSPV VSIRGRSAAA SRSYATSGPP FDASLPLAVL
INARSASAAE IVAGAVQDLD RGVIIGERSF GKGLVQSVIR LPYENALKLT TAKYYTPSGR
LIQKEAGDAH GSRDVLPRKK AGEGSAEVFR TASNRKVFGG GGILPDIIAT DPEPSPYLES
LERKGLLFLH AALWRASHPQ PPAPADSAAL VQSFSGFLET QEFTYRSLAD RKLEELKKLL
AGEKTDAAKQ AMAVYRTMEV EIAAGREREI GRASGEVYTA LREEVLRHYD ERLAAMEGLR
HDPVVRKASE IIARSRTYRK LLSR