Gene Cpha266_0788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0788 
Symbol 
ID4570207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp897813 
End bp901118 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content53% 
IMG OID639765383 
ProductTPR repeat-containing protein 
Protein accessionYP_911264 
Protein GI119356620 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000957174 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATCAC CCCTGATCCT CATCCCGTCA GCAGACCTGC TCGACAACCA TCCGTATCTT 
TCCAGTCAGG CAAAAGAGCT TTCGCAAGCC TATGCTGAAA AAAAGGTCGT GACCGACAAC
CAGTTGAAAC CAATCGGCAG TGCGCTCTGG TCGGCGCTTG GTGACGGGGT AGAGCTTCAG
CAGGCGAAAC ACCAGGCCGG ACAGTCAATC CTGCCCATCG TCATCGAGAG CGACATCCCG
GCAATCCTGC AACTGCCATG GGAGATGCTT TGGCATCCGG AATACGGGTT TCTTGCCCTG
CACAAGGAGT TTACGCTTTC GCGCAGCAGC CCGGCCATCA AGGTGCACAT GCCGGATATC
GAAACCGGAC CGCTGCGCAT CCTGCTTTTT TCATCGTTGC CCGACGATCT CGACGAAACC
GATCAGTTGC AGATCGAGGA GGAGCAGGCT GGTGTGCTCG AAGCACTTGG CCCATGGTTG
CAGAGTGGCC ATGTGGTGAT CGAAATGCCT GACGACGGAC GGTTCAGCCT GTTCGAAGAA
CTACTGCACT CGTTCAGGCC TCACCTCGTC TGGCTCAGTG GTCACGGGGT ATTCAGTAAA
GACCTGCTGA ACCATAACCA TAAAGGGTAT TTCCTGTTTG AGGACGAAGA GAGCGGCAAC
GGGTCACTCG TTGACGAGAA TACCCTTGCC GGAGCATTCA GCGGCACCGC CGTGCAGGGG
GTGATCCTCT CGGCCTGCCA GAGCGGCAAG GCAATTTCCT CCGACCTGAA CAACGGCCTG
ATGTACGCGC TTGCGCAGAA GGGCATCCCC CATGTCATTG GAATGCGGGA GTCCATCTTT
GACCGTGCTG GCGTTCAGTT CGCCAAAACT TTTTTTTCCG GTCTGCTGCA GAAGCGGGAG
ATAGCACAGG CGCTGCAGCA AGCCCGTCAG GCCATCACGA TGCCAATGCG GGACGACGAG
CATGCAAAAC GGTACCGCTA CGCCGACCTC TCGTTCGGTC AGTGGTGCCT GCCGATGCTG
CTGAGCAGAG AGCATAACCG GTCGATCATC GATTGGCATT TCATGCCGCA GCCGATGGGT
GCGGTGAACA GAAGGAATAG AAGTGTCAAG CAACTCTCAT TGCCGGAAAG ATTTATCGGG
CGTCGTCGGG AATTGCGAAA GATCCAGCAA AACTTCAGAA ACAATCAGGA AAAAGTGTTA
TTGCTCATTG GTGCAGGAGG TATGGGCAAA ACAGCGTTTG CCGGCAAACT CCTTGATACC
CTCAAGTCTG ATGGTTATGA GGTGTTTTAT ATTTCAATAC ATCCGAATCA CGATTGGCGA
AAAACAATAT CATCGCGAAT ACCATTTTCA CTCGATGATA AGAGACGACC GGTATACGAT
AATGAAATCA GCGATATTCA CGACATTGTC GATCGGGCAG AATGCCTTTT TGTTTTGTTA
CTTGAGCAAT TTGATGGAAA AGTAGCCCTG CTTTATGACA ACATTGAATC CGTTCAGGAC
CCTCTCACGT GCGCTATTAC CGATAGAGAT TTACAACGAC TGATCGATTT GTCGCTTTCG
ATGCAGGAAG ATGGTCTGCA TGTCCTCCTC ACCTCGCGAT GGGCACTGCC GGAGTGGAAA
GAGCCGGTTC ATCCGCTTGG AAAACCGGTC TACCGTGACT TCCTTGCCGT AGCCCAGCAG
CAGAAACTGC CAAAGAGCTT TCTCAGGGAG TCGAAACGGT TACGAAAAGC CTATGATGTA
CTGAACGGTA ACTACCGGGC ACTGGAGTTC TTTTCGGCTG CATTGCAGAA TATGGATGCC
GGTGAAGAAG AGGTATTTCT TCAGCAGTTG CAAAAAGCAG AAGCTGAAAT CCAGGTTGAC
ATGGCGCTCG AAAAGGTGTG GCGTCACAGA ACAGCGGAGG AACAGGAGCT GCTCAGACGC
ATGACTGCTT TTGAGGTGCC TGTAGCTCTG GAGGGAGTGC AGAAAATCGC CATGCTCGAT
CCACAGCAAC CTGTCGAAGC CATGGAAACA CTGCTCTCGG TATCGCTGAT CGAGCGGTAC
TATAATCCTA AATGGAAAAC CGATGAGTTT CTGGTGTCAT CCCTCGTGCG AAGCTGGCTG
GAAAAACAGG GGGTAGCGAA ACCCGAACAG GAGCTGCTGC AACAAGCAGC AACGTATCAC
GAATGGCTGC TTGAGTACGA ACGAAACACG CTCGACCAGG CAATCACTAC CCATACGGCG
CTCATGAGCG CAGGTATGGA CGAAAAGGCT CATCGCATAA CGCTTGACTG GATTGTTGGG
CCGATGAACA TGGCAGGGAT GTACCAGACC CTGCTGCAGA CATGGCTGCT TCCAGCCTGT
AACTCCGCTG ATCAGCAAAC TCTGGCAGAA GCTTTGGGGC AGACCGGCAA ACAGTATCAC
CATGTCGGGG AGTATGACAC GGCGCTGGAG TACCTCAAAC GCTCCCTTGC GATATGCGAG
GAGATCGGTG ACAAAAAGGG TGAAGGCGCC ACGCTCAATA ATATCTCGCA GATATATGAT
GCCCGAGGGG AGTATGACAC GGCGCTGGAG TACCTCAAAC GCTCCCTTGC GATCAGACAG
GAGATCGGCG ACAAACAGGG CGAAGGTGTC ACGCTGAATA ATATTTCGCA GATATATCAT
GCCCGAGGGG AGTATGACTC GGCGCTGGAG TACCTCAAGC GCTCCCTTGC AATCAGGCAG
GAGATCGGCG ACAAACAGGG CGAAGGCACC ACGCTCAATA ATCTTTCGGG AATATATCAT
GCCCGAGGGG AGTATGACTC GGCGCTGGAG TACCTCAAGC GCTCCCTTGC AATCAGGCAG
GAGATCGGCG ACAAACAGGG CGAAGGCGCC ACGCTCAATA ATATTTCGCT GATATATAGA
GTCCGAGGGG AGTATGACTC GGCTTTGGAG TACCTCAAGC GCTCCCTTGC GATTCAGCAG
GAGATCGGCG ACAAACAGGG AGAAGGCACC ACGCTCAATA ATATTTCACT GATATATCAT
GCCCGAGGTG ACTATGAGAC GGCGCTGGAG TACCTCAAGC GCTCCCTTGC GATCAGGCAG
GAGATCGGCG ACAGTTCGGG GTTATGTGCA ACACTGTTCA ACATGGGTCA TATCTATTAT
CAAAATAAAG ATCTATCGAA TGCGGTATTA TCGTGGGTAA AGGTTTACAG GATAGCGAGT
AACATCAATC TGGCGCAAGC CTTGCAGGCA CTGGCAGCTC TTGCGCCTCG ACATGGATTG
CCTGAAGGAC TTGAGGGGTG GGAGATGCTG GCGAAGCAGA TGGATGAGCA ACAGAAACAA
TCATAA
 
Protein sequence
MLSPLILIPS ADLLDNHPYL SSQAKELSQA YAEKKVVTDN QLKPIGSALW SALGDGVELQ 
QAKHQAGQSI LPIVIESDIP AILQLPWEML WHPEYGFLAL HKEFTLSRSS PAIKVHMPDI
ETGPLRILLF SSLPDDLDET DQLQIEEEQA GVLEALGPWL QSGHVVIEMP DDGRFSLFEE
LLHSFRPHLV WLSGHGVFSK DLLNHNHKGY FLFEDEESGN GSLVDENTLA GAFSGTAVQG
VILSACQSGK AISSDLNNGL MYALAQKGIP HVIGMRESIF DRAGVQFAKT FFSGLLQKRE
IAQALQQARQ AITMPMRDDE HAKRYRYADL SFGQWCLPML LSREHNRSII DWHFMPQPMG
AVNRRNRSVK QLSLPERFIG RRRELRKIQQ NFRNNQEKVL LLIGAGGMGK TAFAGKLLDT
LKSDGYEVFY ISIHPNHDWR KTISSRIPFS LDDKRRPVYD NEISDIHDIV DRAECLFVLL
LEQFDGKVAL LYDNIESVQD PLTCAITDRD LQRLIDLSLS MQEDGLHVLL TSRWALPEWK
EPVHPLGKPV YRDFLAVAQQ QKLPKSFLRE SKRLRKAYDV LNGNYRALEF FSAALQNMDA
GEEEVFLQQL QKAEAEIQVD MALEKVWRHR TAEEQELLRR MTAFEVPVAL EGVQKIAMLD
PQQPVEAMET LLSVSLIERY YNPKWKTDEF LVSSLVRSWL EKQGVAKPEQ ELLQQAATYH
EWLLEYERNT LDQAITTHTA LMSAGMDEKA HRITLDWIVG PMNMAGMYQT LLQTWLLPAC
NSADQQTLAE ALGQTGKQYH HVGEYDTALE YLKRSLAICE EIGDKKGEGA TLNNISQIYD
ARGEYDTALE YLKRSLAIRQ EIGDKQGEGV TLNNISQIYH ARGEYDSALE YLKRSLAIRQ
EIGDKQGEGT TLNNLSGIYH ARGEYDSALE YLKRSLAIRQ EIGDKQGEGA TLNNISLIYR
VRGEYDSALE YLKRSLAIQQ EIGDKQGEGT TLNNISLIYH ARGDYETALE YLKRSLAIRQ
EIGDSSGLCA TLFNMGHIYY QNKDLSNAVL SWVKVYRIAS NINLAQALQA LAALAPRHGL
PEGLEGWEML AKQMDEQQKQ S