Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0709 |
Symbol | |
ID | 4569883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 808188 |
End bp | 809585 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 639765307 |
Product | TPR repeat-containing protein |
Protein accession | YP_911188 |
Protein GI | 119356544 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTGT CTGATTTTTT TGATGAAAAC CGTCATGAAT CATCCGGTTT ATCACAAAAA GAGCCGCCTG ATCTTGACGA TCTTGAAAGC ATGTATGATG CTGAGGAGCT GATTGATCTT ATCAGCCAGC TCAATGAAGA CGGATTTATC CAGGAGGCCC TTGCCGTTGC GCAAAGGCTT GAAGCCGTTT CGCCATACAA TGCCGAGACC TGGTTTCATC TTGGTAACTG TCTCACCGTT AACGGGTATT TCAACGATGC CCTTGAGGCA TTCAATCAGG CAGTGCTGCT CAGTCCCGCT GACAGCGAAA TGCGCCTCAA CTACGCTCTT GCGCACTTCA ATACGGGATC TCTTGACGAA GCTCTCGAAA TCCTTGAGGA TATGTATGTT GACTCATCGA TTGAACGGGA GTACTCCTAC TACCGGGGTA TCATTCTGCA GCGGCTTGAA CGGTTCACTG AATCAGAAAA AGATTTTGAA CACTGCCTCG AACTCGATCC TGATTTTTCT GACGCATGGT ATGAACTTGC CTACGGAAAA GATCTCCTGG GAAAACTCGA AGAGAGCACG GCCTGTTACA ACAAAGCACT GGACCACGAT CCCTACAATA TCAATGCATG GTACAACAAC GGTCTGGTAC TCAGTAAACT GAAACGATAC GATGAAGCGC TTCAGTGCTA CGATATGTCT CTCGCTCTTG CCGATGATTT CAGTTCCGCA TGGTATAACC GGGCCAATGT TCTTGCCATA ACGGGAAAAA TCGAAGAGGC GGCAGAAAGC TATGTGAAAA CTCTTGAGTT TGAACCTGAC GACCTCAATG CCCTGTACAA TCTTGGTATT GCCTACGAAG AACTTGAGGA GTACAGTGAA GCTATTCTCT GCTATCGACG CTGCATCGAG CTCAATAACG ATTTCCATGA TGCATGGTTT GCGCTTGCAT GCTGTTATGA AGCCATCGAA CAGTATAATG AGGGATCACT TGCCATTATT GAAGCTCTGA AGGCAATCCC TGACAGCATC GAGTTTCTGC TGCTTAAAGC TGAAATAGAG TATAATCTCA ATGAGCTCGA ACACTCTCTT GAAACCTATC GACATATCAT CACCCTTGAC CCTGAAAGTC CGCAGATATG GGTTGATTAT GCCATGGTCC TTCGCGAAGC AGGCTATAAC AACGAATCCA TTGAGGCCCT TCATCAGTCG CTGAAACTTC AGCCGCATTC GGCTGATGCC CATTTCGAGA TTGCTGCCGC CTATTTTGCC ATGGGGGACA AACTCAGCAC CCTGAAAGCG CTGAGCAAGG CATTCAAAAT CGACCCTGAT AAAAAACAAC TTTTTCAAAG CACCTTCCCG GAACTTTATC AGCAGGATTC CGTTAGAAAA ATGCTTGGCA TTTCCTGA
|
Protein sequence | MSLSDFFDEN RHESSGLSQK EPPDLDDLES MYDAEELIDL ISQLNEDGFI QEALAVAQRL EAVSPYNAET WFHLGNCLTV NGYFNDALEA FNQAVLLSPA DSEMRLNYAL AHFNTGSLDE ALEILEDMYV DSSIEREYSY YRGIILQRLE RFTESEKDFE HCLELDPDFS DAWYELAYGK DLLGKLEEST ACYNKALDHD PYNINAWYNN GLVLSKLKRY DEALQCYDMS LALADDFSSA WYNRANVLAI TGKIEEAAES YVKTLEFEPD DLNALYNLGI AYEELEEYSE AILCYRRCIE LNNDFHDAWF ALACCYEAIE QYNEGSLAII EALKAIPDSI EFLLLKAEIE YNLNELEHSL ETYRHIITLD PESPQIWVDY AMVLREAGYN NESIEALHQS LKLQPHSADA HFEIAAAYFA MGDKLSTLKA LSKAFKIDPD KKQLFQSTFP ELYQQDSVRK MLGIS
|
| |