Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_0843 |
Symbol | |
ID | 3745660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 962084 |
End bp | 963532 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637768880 |
Product | Sel1 repeat-containing protein |
Protein accession | YP_374752 |
Protein GI | 78186709 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [R] General function prediction only |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.502416 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAGAA GCACCCCCAT GAAGAAGAGA ACGGCAGCAT TGCTCATTGT CTCACAAGCC GTGCATTCGG GCGGCGGAGT ATTCTGGATC CTTTTCGCCA TTCTCTTTTC CTGGCCGCTC TCCTCCCTTG CCGCAGAGAG CAAAGCGGCG GAGGTCTTTC AGCGGGTATC GAAAAGCGTG GTCGTCGTCC ATGCCCGTAA CGACAGCGGG GATGTTCTGG TGATGGGCAG CGGCGTGGTG TTGCCCGGAG GCACAGTCGC GACCTGCCAC CATGTCGTGA GTGACGCCAG AAAGATCACG GTGGATTACG CGGGCGGGGA GTATCCCGCC GTTGTCCGGT ATTCCGATAT CGAGCGGGAT ATCTGTTCCC TCACGGTTCA GGGGCTGAAA GCTCCGGCGG TGTCCATGGG GAGCATGGCA GCGCTTGCTG TCGGAGAGCG GGTCTATGCC GTGGGGACGC CGCAGGGTTA TCCGTTGACG CTTTCGGAAG GGATCATCTC CGGTTTCAGG GAGGTAAGAG GGCATCGCTA CATGCAGACG ACGGCCCCGA TCTCTTCGGG ATCAAGCGGC GGCGGCCTGT TCGATGAAGA GGGCCTGCTG GTCGGCATGA CGACCTTTTT CCTGAAGGAT GCCCAGCAGA TGAATTTCGC CGCCCCTGTC GAGTGGCTTG CCGCTTTGCC GGAGGGTCGT GCCGGGAGAG TCGAAAATAA AAGGAAGGAT GAAATCCGGG GGATACGGGC CCGTGCCGAA CTGGGTGATG CTGACGCTCA GTATACACTG GGCGCCTGTT ACTCGGAGGG GGACGGAGTC CGAAAGGATC CCGCTGAAGC TGTAAGGTGG TACCGGCTGG CTGCCCGCCA GGGCAATGCC GATGCACGGA ACAGTCTCGG CTGGGCGTAC CGGGAAGGCA ATGGCGTGAA GCGGGATTAT GATCGGGCCC TGCTGTTGTT CCGTATGGCT GCTGAGCAGA ATGAGCAGTA TGCACAGAAT AATCTCGGCC TGATGTATAT GAACGGCGAG GGTGTGAAGC AGGACAATGC CGAGGCGTTT AAATGGTTCT GCATGTCTGC CGCCCAGGGC AACGGCTACG GCCGGTGCAA CATCGGAGAG ATGTATGTGA AGGGCCAGGT TGTGGAGCAG AATTATGAAG AGGCCATGAA GTGGTTCCGT CTGGCTGCCG AGAAGGATGG CAATGATGCT GCGTACTGGA TAGGCTGGCT CTATGAAGAG GGGAAGGGGG TACCGGCGGA TCCCGATGAA GCTGCCCGAT GGTATCGAAT TGCGGCGGGA AGTACCGATC CCAATGGCCT GCTCTCAATC GGGGAGATGT ATGAAAAAGG CCTCGGGGTT CCGGGCAGCA TATCGAACGC CGAAAAATGG TACAGGAAGG CTTGCCGTGC GGGTGAAAAA GATGCATGCG AAAGGCTGAA ACGACTGGCC GGGAAATAG
|
Protein sequence | MQRSTPMKKR TAALLIVSQA VHSGGGVFWI LFAILFSWPL SSLAAESKAA EVFQRVSKSV VVVHARNDSG DVLVMGSGVV LPGGTVATCH HVVSDARKIT VDYAGGEYPA VVRYSDIERD ICSLTVQGLK APAVSMGSMA ALAVGERVYA VGTPQGYPLT LSEGIISGFR EVRGHRYMQT TAPISSGSSG GGLFDEEGLL VGMTTFFLKD AQQMNFAAPV EWLAALPEGR AGRVENKRKD EIRGIRARAE LGDADAQYTL GACYSEGDGV RKDPAEAVRW YRLAARQGNA DARNSLGWAY REGNGVKRDY DRALLLFRMA AEQNEQYAQN NLGLMYMNGE GVKQDNAEAF KWFCMSAAQG NGYGRCNIGE MYVKGQVVEQ NYEEAMKWFR LAAEKDGNDA AYWIGWLYEE GKGVPADPDE AARWYRIAAG STDPNGLLSI GEMYEKGLGV PGSISNAEKW YRKACRAGEK DACERLKRLA GK
|
| |