Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0791 |
Symbol | |
ID | 4570210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 901876 |
End bp | 905067 |
Gene Length | 3192 bp |
Protein Length | 1063 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639765386 |
Product | TPR repeat-containing protein |
Protein accession | YP_911267 |
Protein GI | 119356623 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0118647 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCGG TTCCTCCATT GTTCCTGACC CCGGATTCCG CTATCATCGA GCGCTATCCA GATCTTCTTA GGCAATCCGC CGAACTCTCG AAGGCATATG CAGATCGTCA CAGTCGTATC TCCGATGCGG CGTTGCAGGC CCTGGGCAGT GCGCTCTGGC AGGTGCTTGA TGCGGACGAG AAGCTGCAAC ACGCAAAAAA GCAGGCCGGA ACAGGGATAC TGCCACTCGT CATCGCAAGC GATGATCCGG CAATCCTGCA ACTGCCATGG GAGACACTGT ACCATCCCGA TTACGGATTT CTTGCCCGGC ATGAGGGGTT CACACTCTCC CGCACCATCC CTTCGATCAA GACAGGAGTG TCGGATATCG AACCCGGCCC CCTGAGAATC CTGCTCTTTT CCTCTCTGCC CGATAACCTT GGAGAAAAGA ATCAGCTTGA GATCGAGGTT GAACAGGGGA GAGTGCTTGA AGCCCTTGGC CAGTGGCGGC AGAGCGGACA TGTAGTGCTT GAAATGCCTG ATGATGGTCG GTTTAGCGAG TTCAAGAGGA TACTCAAATC ATTCAAACCG CATCTTGTCT GGCTCAGCGG CCACGGTATA TTCCAGAGCG ACCCGCTGAA CCACCATGAC AAGGGGTACT TCCTCTTTGA AAACGAGCAT GGTGATGGCG GTGAACTGGT CGATGAAGAT CAGCTTGCAG AAGCATTCAC AGGCATTGAT TTGCAGGGCA TCATTCTCTC AGCCTGTCAG ACAGGCAAAG CCGACTCGGC CAACCTGAAC AACGGCCTGA TGTACAAGCT TGCATGGAAA GGCGTTCCCC ATGTCATCGG GATGCGTGAA TCCATTCTTG ATCGTGCAGG CGTGCAGTTT GCCCAGGCAT TCTTTGAAGC CCTGATCGAT AAAAAAGGTA TAGCCCTTGC CCTCCAGGAG GCCCGCCGGG CAATCATTCT CCCCCTGAAA GACGATGAAG AGGCCAGAAC GTCGCTTGAC GCTGAACTCT CGCTCGGGCA GTGGTGCCTG CCGATGCTCC TTAGCAGAGA GCATAACCGG CCGATCATCG ATTGGCATTT CACGCCGCAG CCGATGCGTG CGGCAAATCT TCTGAACGAA AGCCTCGACA GGATTACCCT GCCTGCACAG TTTATCGGAC GGCGTCGGGA GTTACGGAAA CTGCAGCGGG AGTTCCGGGA AGGGAAGACA AACGTGCTGC TCTTCACCGG AGCTGGAGGC ATGGGTAAAA CAGCATTTGC CGGTAAACTC ATCAACAACC TCAAGGCGGA TGGCTTTGAA ATATTCGGGT TCTCGGCCAG AGCGGAACAC GACTGGCGGG ATACCCTTTT TCAGATGAAA CTGATGCTCG ACAAGGAGCG TATCGAAAAA TACACGCTCA TCGAAAAACA GTATCCCGAT CCGGCAAAAC AGGCAGCAGG GCTTCTGAAA CTGGTGCTTG AGCAGTTTCA GCGGAAAGTG GCGATATTCT TCGATAACCT CGAATCCGTG CAGGATACGG TCTCCCGAAC CATCATCGAC CCTGAGCTGC TGATCTGGAT CAATGCGGCA GTTGGCCTGA AAAAGGAAGG TCTGAGAGTC CTCCTAACCT CGCGATGGGC ACTGCCGGAG TGGAAAGAGC CGGTTTATCC ACTTGGAAAA CCGGTCTACC GTGACTTCCT TGCCGTAGCT CAGCAGCAGA AACTGCCAAA GAGCTTTCTC AGGGAGTCGA AACGGTTACG AAAAGCCTAT GATGTACTGA ACGGCAACTA CCGGGCACTG GAGTTCTTTT CGGCTGCATT GCAGAATATG GATGCCGGTG AAGAAGAGGT ATTTCTTCAG CAGTTGCAAA AAGCAGAAGC TGAAATCCAG GTTGACATGG CGCTCGAAAA GGTGTGGCGT CACAGAACAG CGGAGGAACA GGAACTGCTC AGACGCATGA CTGCTTTTGA GGTGCCTGTA GCTCTGGAGG GAGTGCAGAA AATCGCCATG CTCGATCCAC AGCAACCTGT CGAAGCCATG GAAACACTGC TCTCGGTATC GCTGATCGAG CGGTACTATA ATCCTAAATG GAAAACCGAT GAGTTTCTGG TGTCATCCCT TGTGCGAAGC TGGCTGGAAA AGAGAGGTAT AGCCAAACCC GTACCTGAGC TGTTGCAGAA GGCTGCAACC TATCACGAGT GGCTTCTGAT GCACGAACGA AACACGCTCG ATCAGGCAAT CACTACCCAT ACGGCGCTCA TGAACGCAGG GATGGACGAA AAGGCACATC GCATAACGCT TGACCGGATT GTCGGGCCGA TGAATATGGC AGGGATGTAC CAGACCCTGC TGCAGACATG GCTGCTTCCA GCCTGTAACT CCGATGACCA GCAAACTCTG GCGGTAGCTT TGGGGGAAAC AGGACGTCAA TATCTCGATT TGTGTGAATA TGAGACGGCT CTGGAGTACC TGAACCGGTC GCTTTCGATA AGGCAGGAAA TCGGTCACAA GTTAAGAGAA GGCGCGACGT TGAATAATAT TTCTCAGATA TATAAAGTCC GAGGGGAGTA TGACACGGCG CTGGAGTACC TCAACCGCTC TCTTGCGATA TGCCAGGAGA TCGGCGACAA ACGGGGTGAA GGCACCACGC TCAATAATAT TTCGCTGATA TATAGTGCCC GAGGGGAGTA TGACACGGCG CTGGAGTACC TCAAGCGATC CCTTGCGATC AGGCAGGAGA TCGGCGACAA AAAGGGCGAA GGTGCCACAC TCAATAATAT TTCGCAGATA TATCATGCCC GAGGGGAGTA TGACACGGCG CTGGAGTACC TCAACCGCTC CCTTGCGATA TGCCAGGAGA TCGGCGACAA AAAGGGCGAA GGCACCACGC TCAATAATAT TTCGCTGATA TATAAAGTCC GAGGGGAGTA TGACACGGCG CTGGAGTACC TCAAACGCTC CCTTGAGATC AGACAGGAGA TCGGCGACAG TGCGGGGTTA TGTGCAACAC TGTTCAACAT GGGTCATATT CATGCACAAA ACGAAGATCT GCCGAATGCG GTATCATCGT GGGTAACGTC TTACAGGATA GCGAGCAACA TCAATCTGGC GGAAGGATTA CAGGCACTGG CAGCACTTGC GCCTCAGCTT GGGTTGCCTG AAGGACTTGA GGGGTGGGAG ATGCTGGCGA AGCAGATGGA TGAGCAACAG AAACAATCAT AA
|
Protein sequence | MNPVPPLFLT PDSAIIERYP DLLRQSAELS KAYADRHSRI SDAALQALGS ALWQVLDADE KLQHAKKQAG TGILPLVIAS DDPAILQLPW ETLYHPDYGF LARHEGFTLS RTIPSIKTGV SDIEPGPLRI LLFSSLPDNL GEKNQLEIEV EQGRVLEALG QWRQSGHVVL EMPDDGRFSE FKRILKSFKP HLVWLSGHGI FQSDPLNHHD KGYFLFENEH GDGGELVDED QLAEAFTGID LQGIILSACQ TGKADSANLN NGLMYKLAWK GVPHVIGMRE SILDRAGVQF AQAFFEALID KKGIALALQE ARRAIILPLK DDEEARTSLD AELSLGQWCL PMLLSREHNR PIIDWHFTPQ PMRAANLLNE SLDRITLPAQ FIGRRRELRK LQREFREGKT NVLLFTGAGG MGKTAFAGKL INNLKADGFE IFGFSARAEH DWRDTLFQMK LMLDKERIEK YTLIEKQYPD PAKQAAGLLK LVLEQFQRKV AIFFDNLESV QDTVSRTIID PELLIWINAA VGLKKEGLRV LLTSRWALPE WKEPVYPLGK PVYRDFLAVA QQQKLPKSFL RESKRLRKAY DVLNGNYRAL EFFSAALQNM DAGEEEVFLQ QLQKAEAEIQ VDMALEKVWR HRTAEEQELL RRMTAFEVPV ALEGVQKIAM LDPQQPVEAM ETLLSVSLIE RYYNPKWKTD EFLVSSLVRS WLEKRGIAKP VPELLQKAAT YHEWLLMHER NTLDQAITTH TALMNAGMDE KAHRITLDRI VGPMNMAGMY QTLLQTWLLP ACNSDDQQTL AVALGETGRQ YLDLCEYETA LEYLNRSLSI RQEIGHKLRE GATLNNISQI YKVRGEYDTA LEYLNRSLAI CQEIGDKRGE GTTLNNISLI YSARGEYDTA LEYLKRSLAI RQEIGDKKGE GATLNNISQI YHARGEYDTA LEYLNRSLAI CQEIGDKKGE GTTLNNISLI YKVRGEYDTA LEYLKRSLEI RQEIGDSAGL CATLFNMGHI HAQNEDLPNA VSSWVTSYRI ASNINLAEGL QALAALAPQL GLPEGLEGWE MLAKQMDEQQ KQS
|
| |