Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_0014 |
Symbol | |
ID | 3745720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 18792 |
End bp | 20486 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637768042 |
Product | peptidase S41A, C-terminal protease |
Protein accession | YP_373949 |
Protein GI | 78185906 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.366239 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.216554 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAACGG CGCCCTTTGG TTCCCTGAAG GCAGTTGCGT TCATTGCAGC AGGCCTTTTC GCTCTGGGGG CCGCTGCCCC CGGTGCGGCA TCCACGCGTG ACGACCGGTA CTTTGAAACT GCCCGCAGCA TTGATCTCAT GGGCGATGTC ACCCGTCAGG TTGCAGAGAG TTATGTCGAC ACGGTCGACA TCAGCCGGAT GCTGTATTCC GGCATTGACG GCATGCTCGA AACCCTTGAC CCCTACTCAG TCTTTCTCGA CAGCGAAGAG TCCCGGGAAC TCGGGGACCT CACCAGCAAC CAGTATGCCG GCATCGGCGT CACCATCGCC GCTCTGGACG GCAGCATCTA TGTCACTTCG GTCGAGAAGG GCTGGCCGGC CGAAACGGCC GGCCTAAGGA CGGGAGACCG TCTCACCGCC ATCAACGGCG TCCTCCTAGC CGGCAAATCC CTTGATGCCG TGCGCGAGCT GATCCGGGGT AATGTCGGCT CACCCGTCAC CCTGCGGGTT CAGCGGCATG GAACTGAACC CTTCACATGC CGGCTCGTTC GCGAAGAGGT TCGGCTGAGC ACTGTCGGGC ATGCAGCTTT CCTCGACGGG AATGGAGGGA TTGCCTACAT TTCACTGACC AGTTTTGCCG ACCGGTCAGG TGTGGATCTC AAAAGCGCGC TCGGACAGCT GCAGCGGGAG GCGGTGAGCC GCGCCATCCC GATGCAGGGC ATCATCCTCG ATCTCAGGGA AAACCCCGGA GGCCTGCTTG ACGCCGCAGT CGACGTTACC GGATCGTTTG TTGCAAAAGG AAGCCCGGTG GTATCCATCC GCGGACGTTC CGCCGCGGCA TCCCGTTCCT ATGCAACCTC CGGCCCGCCT TTCGATGCAT CCCTTCCCCT GGCAGTTCTC ATCAATGCCC GCAGTGCCTC GGCAGCTGAA ATCGTCGCAG GAGCTGTGCA GGATCTCGAC CGCGGCGTCA TAATCGGTGA GCGTTCCTTC GGCAAAGGGC TCGTGCAGTC GGTCATCCGC CTGCCCTATG AAAATGCCCT CAAGCTGACC ACCGCCAAGT ACTATACCCC GTCGGGCCGC CTCATCCAGA AAGAGGCTGG AGATGCGCAC GGCTCACGTG ATGTCCTGCC CAGAAAAAAA GCCGGGGAAG GGTCGGCTGA GGTGTTCAGA ACCGCAAGCA ACCGGAAGGT GTTCGGTGGT GGCGGAATCC TTCCCGACAT CATTGCCACA GATCCCGAAC CGTCCCCATA CCTTGAATCG CTTGAGAGGA AAGGACTGCT GTTCCTGCAT GCAGCTCTGT GGCGTGCATC GCATCCCCAG CCGCCGGCCC CTGCCGACAG TGCGGCGCTT GTTCAGAGCT TCAGCGGGTT CCTTGAGACG CAGGAGTTCA CGTACCGTTC TCTCGCCGAC AGGAAGCTGG AGGAACTGAA GAAGCTTCTT GCCGGGGAAA AAACGGATGC AGCCAAACAG GCGATGGCAG TGTACCGTAC AATGGAGGTT GAGATTGCTG CCGGGCGGGA GCGCGAGATC GGCCGTGCGT CAGGAGAGGT TTACACTGCC CTTAGGGAGG AGGTTCTTCG CCATTACGAC GAACGGCTTG CCGCCATGGA GGGGCTCCGT CACGATCCCG TGGTGCGGAA GGCCAGCGAG ATCATCGCCA GGTCGCGCAC CTACCGCAAA CTGCTCAGCC GCTGA
|
Protein sequence | MGTAPFGSLK AVAFIAAGLF ALGAAAPGAA STRDDRYFET ARSIDLMGDV TRQVAESYVD TVDISRMLYS GIDGMLETLD PYSVFLDSEE SRELGDLTSN QYAGIGVTIA ALDGSIYVTS VEKGWPAETA GLRTGDRLTA INGVLLAGKS LDAVRELIRG NVGSPVTLRV QRHGTEPFTC RLVREEVRLS TVGHAAFLDG NGGIAYISLT SFADRSGVDL KSALGQLQRE AVSRAIPMQG IILDLRENPG GLLDAAVDVT GSFVAKGSPV VSIRGRSAAA SRSYATSGPP FDASLPLAVL INARSASAAE IVAGAVQDLD RGVIIGERSF GKGLVQSVIR LPYENALKLT TAKYYTPSGR LIQKEAGDAH GSRDVLPRKK AGEGSAEVFR TASNRKVFGG GGILPDIIAT DPEPSPYLES LERKGLLFLH AALWRASHPQ PPAPADSAAL VQSFSGFLET QEFTYRSLAD RKLEELKKLL AGEKTDAAKQ AMAVYRTMEV EIAAGREREI GRASGEVYTA LREEVLRHYD ERLAAMEGLR HDPVVRKASE IIARSRTYRK LLSR
|
| |