Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_0091 |
Symbol | |
ID | 3744862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 96193 |
End bp | 97104 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637768119 |
Product | peptidase C1A, papain |
Protein accession | YP_374024 |
Protein GI | 78185981 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCCGA AGCAGCAGAT GCTACCTGCA GCAGGTCACC CGAAACCGGT CGGTACCGGG TGGCTCCCCC CGATGCCCGA CCTTCGCGAC TACACCCCGT CCCATCCTGA CATTGCCGGT ATGGTCCGGA AGCTCGGCGT TGCAGAAGGC TCGGGCAATC TTCCTTCTTC CATGGACCTC AGGCAATGGT GCCCCCCGGT CGAAAACCAG GGCGGCATAG GCGCCTGTAC GGCGCAGGCC GCGGCAGGTA TGGTCGAGTA CTACGAGCGC CGTGCGTTCG GCCGCCATAT CGACTGCTCC CGTCTCTTCA TCTACAAGAC GACCCGTAAT CTCATGGGTG TCGTGGGTGA CGGCGGAGCC TGGCTGCGAA ACACCATGGG TGCCCTTGCG CTGTGTGGTG CACCACCTGA AAAGTACTGG CCATACAGCG ACAATGATAC CGACTATGAT CTCGAGCCAA CGGCGTTCGT TTACGCTCTG GCCGACAACT TCGAGGCGCT TCGTTACTGC TGCCATGATC CCATGGGGGC GGGTATGGAG CCGGCGACGG TGCTTGGCGG AGTGAAGCGC TTTCTCAGCG CCGGGGTGCC CTCGGCTTTC GGGTTTTTCG GTTTCCCCTC GTTCGACGAG GGCGCCGCTC CGGGCGATAT CCCGATGCCG TGCGCCGATG AACAGGCGGA GTGGGGGCAT GCCGTGCTTG CCGTAGGGTA TGACGACAGC CGTGAAGTTG GCAACAAGCG GTGCGGGACC GCTTCAAAGG GAGCTCTGCT CGTCAGAAAC TCCTGGGGCA GGGAATGGGG GGAGGACGGA TACGGATGGA TCCCCTACGG GTATGTCACG CAGGGATTGG CGATGGATTT CTGGTCGCTC TTCAGCATGG GCTGGATCGA TACCGGCCAG TTCGGCTCAT GA
|
Protein sequence | MYPKQQMLPA AGHPKPVGTG WLPPMPDLRD YTPSHPDIAG MVRKLGVAEG SGNLPSSMDL RQWCPPVENQ GGIGACTAQA AAGMVEYYER RAFGRHIDCS RLFIYKTTRN LMGVVGDGGA WLRNTMGALA LCGAPPEKYW PYSDNDTDYD LEPTAFVYAL ADNFEALRYC CHDPMGAGME PATVLGGVKR FLSAGVPSAF GFFGFPSFDE GAAPGDIPMP CADEQAEWGH AVLAVGYDDS REVGNKRCGT ASKGALLVRN SWGREWGEDG YGWIPYGYVT QGLAMDFWSL FSMGWIDTGQ FGS
|
| |