Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_1631 |
Symbol | |
ID | 3746137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | - |
Start bp | 1827687 |
End bp | 1829438 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637769664 |
Product | protease IV |
Protein accession | YP_375528 |
Protein GI | 78187485 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.42131 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCTCC TGCCGCTCGC CGGCCTTGCC CTCATTCTGT GGCTGAACCA CGGGGGCCGT TCCCTGCCTG ATCGCTTCGT GCTTTCAGTG CCCCTCAGCG GATCCCTTGA TGAACGGGCG CCATCGGCTT CCGGGCTGCC TTTTTCCTCA GCTGAAGGTC CTCTTTCCCT GCAGGACCTG CTTTTCACCC TCCACCATGC ATCCTCTGAC CCGAGGGTGG ATGCCGTGCT TCTCGACATC GATGGAGTGC GTACGACGCC GTCTAAGATT TCGGAGCTCC GCCGTGCAAT CGAGCGGACG CGCGCAAGCG GAAAACGTGT TATTGCATTC CTCCATAGTC CGGAGGACAG CGACTGCATG CTCGGCGCAG CATGCGATTC CGTCATTGTC GAGGAGGGGG GATTCATGCT GCTCGACGGT CTGCGTGCCG AGACCCTTTA CTTTGCGACC CCGCTCAGGA AAATCGGTGT GTCGTTCCAG GCCGCGCAGT GGAAGCGGTA CAAGAGCGGC ATCGAGCCGT TCGTACGCAC CGGACCGAGC CCCGAGGCAG AGGAAGAGGT GTCGGTGCTG CTCGACGAGG TCTATCGGGA TTATATCGGC TATGTATCCC GGCGGCGCCA TCTCAGCCCC GATTCCCTCC GCTCGATCAT TGACAATGTG ACGCTCATGA CCTCTCCGGA GGCCGTGCGG CTGGGTCTTG CCGACGGTGT TGCTTCTTCC TGGCGTTTCC ATCGCGAGCT GGAGCGGCGC TTGACCGGTA AAGAGCCTGA TCCTGAAAGC GGATTCTTTG TCGGCGCGGA CCGGTACCGG GACTCGATGG AGTGGCCCAT GAAGGCAGAC ACGAAAGACC GGATCGCCCT CATAACCCTG TCCGGCCCCA TTGTACGCAC GACGGGTGAG GAGGCGCTAG GTCTTGGTTC CGGAGTGGAC GTTGCGGCGG TGCGGCGCTC GATTGAAGGA GCGCTCAAGG ACCGGCGGGT GAAGGCCATG GTGCTGCGCA TCGACAGCCC CGGCGGGGAC GCTCTGGCTT CGGCCGAGAT GCTTGAAATG CTCGATTCGG CCGCCGTTTG CAAACCCCTC GTGGTCTCCA TGTCAGGGGT TGCCGCTTCC GGAGGTTACA TGGCCGCTCT CTCGGGGCGT TCCATTTATG CTGAACCGCT CTCCATCACC GGCTCAATAG GCGTCTACGC CCTGAAGCCC GAAATCAGCG GTCTCGTACA GAAGATCGGT CTCGGTCGCA GCATCGTCAC CAGAGGCCGC AATGCCGATG CGAACTCAAT CTATAAACCG CTTGACGGGG AGGCGTACCG GAAGTTCGTG GAGGCTTCGG GCGAGGTGTA CCGGGATTTC GTGGGGAAGG TTGCCCGGGC CCGGAAGATG AGTCCCGGCC GTGTCGATTC GCTTGCCGGC GGCCGTGTGT GGACTGGACG CCGGGCGCTT GAAGTCGGCC TCGTCGACCG CTCCGGAGGA CTGTTCGATG CACTGGGTGA GGCTCAGCGG CTCGGGGGCA TCGACAGTAC CCGCCAGCCG GAAATCGTCT GTTATCCCAG AGAGAAGAGC TGGCTGGAGC TGCTTGTCCG GGGCGACTTC TCCGCAATTG GCTCAAGGGT GGAAAGCGCC CTTGCAGGGC GTGTCATCGG CAGGCTCATG CCAGGCATAG GACCGGTGCC GGCTGCCTTG TATCCACTGT TGTCGGGAGA TCATGGCGGT GAGGTGCTGA CGGTGATGCC CTGTGACATC ATTATCCGCT GA
|
Protein sequence | MVLLPLAGLA LILWLNHGGR SLPDRFVLSV PLSGSLDERA PSASGLPFSS AEGPLSLQDL LFTLHHASSD PRVDAVLLDI DGVRTTPSKI SELRRAIERT RASGKRVIAF LHSPEDSDCM LGAACDSVIV EEGGFMLLDG LRAETLYFAT PLRKIGVSFQ AAQWKRYKSG IEPFVRTGPS PEAEEEVSVL LDEVYRDYIG YVSRRRHLSP DSLRSIIDNV TLMTSPEAVR LGLADGVASS WRFHRELERR LTGKEPDPES GFFVGADRYR DSMEWPMKAD TKDRIALITL SGPIVRTTGE EALGLGSGVD VAAVRRSIEG ALKDRRVKAM VLRIDSPGGD ALASAEMLEM LDSAAVCKPL VVSMSGVAAS GGYMAALSGR SIYAEPLSIT GSIGVYALKP EISGLVQKIG LGRSIVTRGR NADANSIYKP LDGEAYRKFV EASGEVYRDF VGKVARARKM SPGRVDSLAG GRVWTGRRAL EVGLVDRSGG LFDALGEAQR LGGIDSTRQP EIVCYPREKS WLELLVRGDF SAIGSRVESA LAGRVIGRLM PGIGPVPAAL YPLLSGDHGG EVLTVMPCDI IIR
|
| |