Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0788 |
Symbol | |
ID | 4570207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 897813 |
End bp | 901118 |
Gene Length | 3306 bp |
Protein Length | 1101 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639765383 |
Product | TPR repeat-containing protein |
Protein accession | YP_911264 |
Protein GI | 119356620 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000957174 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTATCAC CCCTGATCCT CATCCCGTCA GCAGACCTGC TCGACAACCA TCCGTATCTT TCCAGTCAGG CAAAAGAGCT TTCGCAAGCC TATGCTGAAA AAAAGGTCGT GACCGACAAC CAGTTGAAAC CAATCGGCAG TGCGCTCTGG TCGGCGCTTG GTGACGGGGT AGAGCTTCAG CAGGCGAAAC ACCAGGCCGG ACAGTCAATC CTGCCCATCG TCATCGAGAG CGACATCCCG GCAATCCTGC AACTGCCATG GGAGATGCTT TGGCATCCGG AATACGGGTT TCTTGCCCTG CACAAGGAGT TTACGCTTTC GCGCAGCAGC CCGGCCATCA AGGTGCACAT GCCGGATATC GAAACCGGAC CGCTGCGCAT CCTGCTTTTT TCATCGTTGC CCGACGATCT CGACGAAACC GATCAGTTGC AGATCGAGGA GGAGCAGGCT GGTGTGCTCG AAGCACTTGG CCCATGGTTG CAGAGTGGCC ATGTGGTGAT CGAAATGCCT GACGACGGAC GGTTCAGCCT GTTCGAAGAA CTACTGCACT CGTTCAGGCC TCACCTCGTC TGGCTCAGTG GTCACGGGGT ATTCAGTAAA GACCTGCTGA ACCATAACCA TAAAGGGTAT TTCCTGTTTG AGGACGAAGA GAGCGGCAAC GGGTCACTCG TTGACGAGAA TACCCTTGCC GGAGCATTCA GCGGCACCGC CGTGCAGGGG GTGATCCTCT CGGCCTGCCA GAGCGGCAAG GCAATTTCCT CCGACCTGAA CAACGGCCTG ATGTACGCGC TTGCGCAGAA GGGCATCCCC CATGTCATTG GAATGCGGGA GTCCATCTTT GACCGTGCTG GCGTTCAGTT CGCCAAAACT TTTTTTTCCG GTCTGCTGCA GAAGCGGGAG ATAGCACAGG CGCTGCAGCA AGCCCGTCAG GCCATCACGA TGCCAATGCG GGACGACGAG CATGCAAAAC GGTACCGCTA CGCCGACCTC TCGTTCGGTC AGTGGTGCCT GCCGATGCTG CTGAGCAGAG AGCATAACCG GTCGATCATC GATTGGCATT TCATGCCGCA GCCGATGGGT GCGGTGAACA GAAGGAATAG AAGTGTCAAG CAACTCTCAT TGCCGGAAAG ATTTATCGGG CGTCGTCGGG AATTGCGAAA GATCCAGCAA AACTTCAGAA ACAATCAGGA AAAAGTGTTA TTGCTCATTG GTGCAGGAGG TATGGGCAAA ACAGCGTTTG CCGGCAAACT CCTTGATACC CTCAAGTCTG ATGGTTATGA GGTGTTTTAT ATTTCAATAC ATCCGAATCA CGATTGGCGA AAAACAATAT CATCGCGAAT ACCATTTTCA CTCGATGATA AGAGACGACC GGTATACGAT AATGAAATCA GCGATATTCA CGACATTGTC GATCGGGCAG AATGCCTTTT TGTTTTGTTA CTTGAGCAAT TTGATGGAAA AGTAGCCCTG CTTTATGACA ACATTGAATC CGTTCAGGAC CCTCTCACGT GCGCTATTAC CGATAGAGAT TTACAACGAC TGATCGATTT GTCGCTTTCG ATGCAGGAAG ATGGTCTGCA TGTCCTCCTC ACCTCGCGAT GGGCACTGCC GGAGTGGAAA GAGCCGGTTC ATCCGCTTGG AAAACCGGTC TACCGTGACT TCCTTGCCGT AGCCCAGCAG CAGAAACTGC CAAAGAGCTT TCTCAGGGAG TCGAAACGGT TACGAAAAGC CTATGATGTA CTGAACGGTA ACTACCGGGC ACTGGAGTTC TTTTCGGCTG CATTGCAGAA TATGGATGCC GGTGAAGAAG AGGTATTTCT TCAGCAGTTG CAAAAAGCAG AAGCTGAAAT CCAGGTTGAC ATGGCGCTCG AAAAGGTGTG GCGTCACAGA ACAGCGGAGG AACAGGAGCT GCTCAGACGC ATGACTGCTT TTGAGGTGCC TGTAGCTCTG GAGGGAGTGC AGAAAATCGC CATGCTCGAT CCACAGCAAC CTGTCGAAGC CATGGAAACA CTGCTCTCGG TATCGCTGAT CGAGCGGTAC TATAATCCTA AATGGAAAAC CGATGAGTTT CTGGTGTCAT CCCTCGTGCG AAGCTGGCTG GAAAAACAGG GGGTAGCGAA ACCCGAACAG GAGCTGCTGC AACAAGCAGC AACGTATCAC GAATGGCTGC TTGAGTACGA ACGAAACACG CTCGACCAGG CAATCACTAC CCATACGGCG CTCATGAGCG CAGGTATGGA CGAAAAGGCT CATCGCATAA CGCTTGACTG GATTGTTGGG CCGATGAACA TGGCAGGGAT GTACCAGACC CTGCTGCAGA CATGGCTGCT TCCAGCCTGT AACTCCGCTG ATCAGCAAAC TCTGGCAGAA GCTTTGGGGC AGACCGGCAA ACAGTATCAC CATGTCGGGG AGTATGACAC GGCGCTGGAG TACCTCAAAC GCTCCCTTGC GATATGCGAG GAGATCGGTG ACAAAAAGGG TGAAGGCGCC ACGCTCAATA ATATCTCGCA GATATATGAT GCCCGAGGGG AGTATGACAC GGCGCTGGAG TACCTCAAAC GCTCCCTTGC GATCAGACAG GAGATCGGCG ACAAACAGGG CGAAGGTGTC ACGCTGAATA ATATTTCGCA GATATATCAT GCCCGAGGGG AGTATGACTC GGCGCTGGAG TACCTCAAGC GCTCCCTTGC AATCAGGCAG GAGATCGGCG ACAAACAGGG CGAAGGCACC ACGCTCAATA ATCTTTCGGG AATATATCAT GCCCGAGGGG AGTATGACTC GGCGCTGGAG TACCTCAAGC GCTCCCTTGC AATCAGGCAG GAGATCGGCG ACAAACAGGG CGAAGGCGCC ACGCTCAATA ATATTTCGCT GATATATAGA GTCCGAGGGG AGTATGACTC GGCTTTGGAG TACCTCAAGC GCTCCCTTGC GATTCAGCAG GAGATCGGCG ACAAACAGGG AGAAGGCACC ACGCTCAATA ATATTTCACT GATATATCAT GCCCGAGGTG ACTATGAGAC GGCGCTGGAG TACCTCAAGC GCTCCCTTGC GATCAGGCAG GAGATCGGCG ACAGTTCGGG GTTATGTGCA ACACTGTTCA ACATGGGTCA TATCTATTAT CAAAATAAAG ATCTATCGAA TGCGGTATTA TCGTGGGTAA AGGTTTACAG GATAGCGAGT AACATCAATC TGGCGCAAGC CTTGCAGGCA CTGGCAGCTC TTGCGCCTCG ACATGGATTG CCTGAAGGAC TTGAGGGGTG GGAGATGCTG GCGAAGCAGA TGGATGAGCA ACAGAAACAA TCATAA
|
Protein sequence | MLSPLILIPS ADLLDNHPYL SSQAKELSQA YAEKKVVTDN QLKPIGSALW SALGDGVELQ QAKHQAGQSI LPIVIESDIP AILQLPWEML WHPEYGFLAL HKEFTLSRSS PAIKVHMPDI ETGPLRILLF SSLPDDLDET DQLQIEEEQA GVLEALGPWL QSGHVVIEMP DDGRFSLFEE LLHSFRPHLV WLSGHGVFSK DLLNHNHKGY FLFEDEESGN GSLVDENTLA GAFSGTAVQG VILSACQSGK AISSDLNNGL MYALAQKGIP HVIGMRESIF DRAGVQFAKT FFSGLLQKRE IAQALQQARQ AITMPMRDDE HAKRYRYADL SFGQWCLPML LSREHNRSII DWHFMPQPMG AVNRRNRSVK QLSLPERFIG RRRELRKIQQ NFRNNQEKVL LLIGAGGMGK TAFAGKLLDT LKSDGYEVFY ISIHPNHDWR KTISSRIPFS LDDKRRPVYD NEISDIHDIV DRAECLFVLL LEQFDGKVAL LYDNIESVQD PLTCAITDRD LQRLIDLSLS MQEDGLHVLL TSRWALPEWK EPVHPLGKPV YRDFLAVAQQ QKLPKSFLRE SKRLRKAYDV LNGNYRALEF FSAALQNMDA GEEEVFLQQL QKAEAEIQVD MALEKVWRHR TAEEQELLRR MTAFEVPVAL EGVQKIAMLD PQQPVEAMET LLSVSLIERY YNPKWKTDEF LVSSLVRSWL EKQGVAKPEQ ELLQQAATYH EWLLEYERNT LDQAITTHTA LMSAGMDEKA HRITLDWIVG PMNMAGMYQT LLQTWLLPAC NSADQQTLAE ALGQTGKQYH HVGEYDTALE YLKRSLAICE EIGDKKGEGA TLNNISQIYD ARGEYDTALE YLKRSLAIRQ EIGDKQGEGV TLNNISQIYH ARGEYDSALE YLKRSLAIRQ EIGDKQGEGT TLNNLSGIYH ARGEYDSALE YLKRSLAIRQ EIGDKQGEGA TLNNISLIYR VRGEYDSALE YLKRSLAIQQ EIGDKQGEGT TLNNISLIYH ARGDYETALE YLKRSLAIRQ EIGDSSGLCA TLFNMGHIYY QNKDLSNAVL SWVKVYRIAS NINLAQALQA LAALAPRHGL PEGLEGWEML AKQMDEQQKQ S
|
| |