Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_1467 |
Symbol | |
ID | 3745455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | - |
Start bp | 1651116 |
End bp | 1652615 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637769505 |
Product | peptidase S1C, Do |
Protein accession | YP_375369 |
Protein GI | 78187326 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.441743 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCCC TGCTGCTGGT TCTTGCGGGG ATCGCCATCG GCGCAATGGT CTTCAGCAAC CTTGAATTCT CCTTTTCGGG CTTCGGTTCC AGTTCCGGAA AAGAGCTGTC GTTTTCGGGG GGGCCGCGTT ATGCTGAAGC GAAAAACACC ATTGAGGACT ACCCCGTGCA GTCCCTCAAG AGCTTCAACG AGGCCTTTGT GCAGATTGCA GAGTCTGCTA CGCCGTCGGT GGTGACTGTT TATACTGAAA AGACCGTCAG CCGACGCATG ATCTCACCGT TCGACTTTTT CGGCCATCAG TTCGATGACT TTTTCAGCTT CCCCGGAATG GATCGCAGTG AGGGTCGGAA GCAGATCCAG CATGGAATCG GTTCGGGCGT CATCGTGACG GATGATGGCT ACATCCTCAC CAACAACCAC GTTATCGACG GTGCCGATAC GGTCTTTATC CGAACCTCCG ACAACCGCCG GCTGGACGCC AGGGTTATCG GCACTGATCC CAAGACCGAC CTGGCCGTCA TAAAGGTGAC AGCCGGGAAC CTGAAGCCCA TTGCCCTCGG CAACAGCGAT CAGCTGCGGG TCGGTGAATG GGTCATCGCC ATCGGCAGTC CTCTCGGCGA AAACCTTGCC AGAACCGTCA CCCAGGGAAT TGTGAGCGCA AAGGGGCGGG CCAATGTCGG GTTGGCCGAC TATGAGGACT TCATCCAGAC CGATGCGGCC ATCAACCCCG GTAACTCCGG CGGACCTCTC GTCAACATCA ACGGGGAACT CGTCGGCATC AATACCGCCA TCGCCAGCCG GACGGGCGGG TTCGAGGGCA TCGGCTTTGC CGTCCCTTCG AACATGGCCA GGAGGGTCAT GACCGCCCTC ATCACCAACG GTAAGGTGAC CCGCGGCTAC CTCGGGGTCA GCATACAGGA TATCGATGAC AACCTCGCCA AAGCGATGCA GCTTGAGCGG GCGGATGGTG CGCTCGTCGG AACAGTTGTC GCTTCGAGCC CTGCGGCCGC GGCGGGAATC GCGACCGGAG ATGTGATCAC CACCTTCAAC GGTGCTCCGG TCAAAGGAAG CGTGGAGCTT CGCAACACCA TCGCCGGCCT GGCGCCGGGA ACCGAGGTAT CCCTCACGTT CCTGCGCGAG GGCCGGAAGC GCACCGTCCG CGTGCGCCTG GGTCAGCAGC CCGCACCCGA ACCGGTCGTT ACGGGTGCCC CCGGGCAGTC CAACCGTGCT CTCGGGTTCA CCGCTGCTCC GCTCACCCCG GAACAGGCCC GCAGGCTCGG CCCCGGGTCG GGCAAGGTCG TCATAACTTC TGTCGAACAG GCCGGCAACG CTTACCGGGC AGGTCTGCGC AAGGGTGACG TGATCCTTGC CGTCAACCGT AAGCCTGTGG AATCGTTCGC TGCATTCGGC ACCGCAGTCA GGAGCATCAA AGAGGGAGAA CTGCTGTTTC TTCTCGTCGA CCGCCAGGGG AACAAGATAT ATTTTGCCTT CAACCTGTAA
|
Protein sequence | MKPLLLVLAG IAIGAMVFSN LEFSFSGFGS SSGKELSFSG GPRYAEAKNT IEDYPVQSLK SFNEAFVQIA ESATPSVVTV YTEKTVSRRM ISPFDFFGHQ FDDFFSFPGM DRSEGRKQIQ HGIGSGVIVT DDGYILTNNH VIDGADTVFI RTSDNRRLDA RVIGTDPKTD LAVIKVTAGN LKPIALGNSD QLRVGEWVIA IGSPLGENLA RTVTQGIVSA KGRANVGLAD YEDFIQTDAA INPGNSGGPL VNINGELVGI NTAIASRTGG FEGIGFAVPS NMARRVMTAL ITNGKVTRGY LGVSIQDIDD NLAKAMQLER ADGALVGTVV ASSPAAAAGI ATGDVITTFN GAPVKGSVEL RNTIAGLAPG TEVSLTFLRE GRKRTVRVRL GQQPAPEPVV TGAPGQSNRA LGFTAAPLTP EQARRLGPGS GKVVITSVEQ AGNAYRAGLR KGDVILAVNR KPVESFAAFG TAVRSIKEGE LLFLLVDRQG NKIYFAFNL
|
| |