Gene Cpha266_0791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0791 
Symbol 
ID4570210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp901876 
End bp905067 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content53% 
IMG OID639765386 
ProductTPR repeat-containing protein 
Protein accessionYP_911267 
Protein GI119356623 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0118647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCGG TTCCTCCATT GTTCCTGACC CCGGATTCCG CTATCATCGA GCGCTATCCA 
GATCTTCTTA GGCAATCCGC CGAACTCTCG AAGGCATATG CAGATCGTCA CAGTCGTATC
TCCGATGCGG CGTTGCAGGC CCTGGGCAGT GCGCTCTGGC AGGTGCTTGA TGCGGACGAG
AAGCTGCAAC ACGCAAAAAA GCAGGCCGGA ACAGGGATAC TGCCACTCGT CATCGCAAGC
GATGATCCGG CAATCCTGCA ACTGCCATGG GAGACACTGT ACCATCCCGA TTACGGATTT
CTTGCCCGGC ATGAGGGGTT CACACTCTCC CGCACCATCC CTTCGATCAA GACAGGAGTG
TCGGATATCG AACCCGGCCC CCTGAGAATC CTGCTCTTTT CCTCTCTGCC CGATAACCTT
GGAGAAAAGA ATCAGCTTGA GATCGAGGTT GAACAGGGGA GAGTGCTTGA AGCCCTTGGC
CAGTGGCGGC AGAGCGGACA TGTAGTGCTT GAAATGCCTG ATGATGGTCG GTTTAGCGAG
TTCAAGAGGA TACTCAAATC ATTCAAACCG CATCTTGTCT GGCTCAGCGG CCACGGTATA
TTCCAGAGCG ACCCGCTGAA CCACCATGAC AAGGGGTACT TCCTCTTTGA AAACGAGCAT
GGTGATGGCG GTGAACTGGT CGATGAAGAT CAGCTTGCAG AAGCATTCAC AGGCATTGAT
TTGCAGGGCA TCATTCTCTC AGCCTGTCAG ACAGGCAAAG CCGACTCGGC CAACCTGAAC
AACGGCCTGA TGTACAAGCT TGCATGGAAA GGCGTTCCCC ATGTCATCGG GATGCGTGAA
TCCATTCTTG ATCGTGCAGG CGTGCAGTTT GCCCAGGCAT TCTTTGAAGC CCTGATCGAT
AAAAAAGGTA TAGCCCTTGC CCTCCAGGAG GCCCGCCGGG CAATCATTCT CCCCCTGAAA
GACGATGAAG AGGCCAGAAC GTCGCTTGAC GCTGAACTCT CGCTCGGGCA GTGGTGCCTG
CCGATGCTCC TTAGCAGAGA GCATAACCGG CCGATCATCG ATTGGCATTT CACGCCGCAG
CCGATGCGTG CGGCAAATCT TCTGAACGAA AGCCTCGACA GGATTACCCT GCCTGCACAG
TTTATCGGAC GGCGTCGGGA GTTACGGAAA CTGCAGCGGG AGTTCCGGGA AGGGAAGACA
AACGTGCTGC TCTTCACCGG AGCTGGAGGC ATGGGTAAAA CAGCATTTGC CGGTAAACTC
ATCAACAACC TCAAGGCGGA TGGCTTTGAA ATATTCGGGT TCTCGGCCAG AGCGGAACAC
GACTGGCGGG ATACCCTTTT TCAGATGAAA CTGATGCTCG ACAAGGAGCG TATCGAAAAA
TACACGCTCA TCGAAAAACA GTATCCCGAT CCGGCAAAAC AGGCAGCAGG GCTTCTGAAA
CTGGTGCTTG AGCAGTTTCA GCGGAAAGTG GCGATATTCT TCGATAACCT CGAATCCGTG
CAGGATACGG TCTCCCGAAC CATCATCGAC CCTGAGCTGC TGATCTGGAT CAATGCGGCA
GTTGGCCTGA AAAAGGAAGG TCTGAGAGTC CTCCTAACCT CGCGATGGGC ACTGCCGGAG
TGGAAAGAGC CGGTTTATCC ACTTGGAAAA CCGGTCTACC GTGACTTCCT TGCCGTAGCT
CAGCAGCAGA AACTGCCAAA GAGCTTTCTC AGGGAGTCGA AACGGTTACG AAAAGCCTAT
GATGTACTGA ACGGCAACTA CCGGGCACTG GAGTTCTTTT CGGCTGCATT GCAGAATATG
GATGCCGGTG AAGAAGAGGT ATTTCTTCAG CAGTTGCAAA AAGCAGAAGC TGAAATCCAG
GTTGACATGG CGCTCGAAAA GGTGTGGCGT CACAGAACAG CGGAGGAACA GGAACTGCTC
AGACGCATGA CTGCTTTTGA GGTGCCTGTA GCTCTGGAGG GAGTGCAGAA AATCGCCATG
CTCGATCCAC AGCAACCTGT CGAAGCCATG GAAACACTGC TCTCGGTATC GCTGATCGAG
CGGTACTATA ATCCTAAATG GAAAACCGAT GAGTTTCTGG TGTCATCCCT TGTGCGAAGC
TGGCTGGAAA AGAGAGGTAT AGCCAAACCC GTACCTGAGC TGTTGCAGAA GGCTGCAACC
TATCACGAGT GGCTTCTGAT GCACGAACGA AACACGCTCG ATCAGGCAAT CACTACCCAT
ACGGCGCTCA TGAACGCAGG GATGGACGAA AAGGCACATC GCATAACGCT TGACCGGATT
GTCGGGCCGA TGAATATGGC AGGGATGTAC CAGACCCTGC TGCAGACATG GCTGCTTCCA
GCCTGTAACT CCGATGACCA GCAAACTCTG GCGGTAGCTT TGGGGGAAAC AGGACGTCAA
TATCTCGATT TGTGTGAATA TGAGACGGCT CTGGAGTACC TGAACCGGTC GCTTTCGATA
AGGCAGGAAA TCGGTCACAA GTTAAGAGAA GGCGCGACGT TGAATAATAT TTCTCAGATA
TATAAAGTCC GAGGGGAGTA TGACACGGCG CTGGAGTACC TCAACCGCTC TCTTGCGATA
TGCCAGGAGA TCGGCGACAA ACGGGGTGAA GGCACCACGC TCAATAATAT TTCGCTGATA
TATAGTGCCC GAGGGGAGTA TGACACGGCG CTGGAGTACC TCAAGCGATC CCTTGCGATC
AGGCAGGAGA TCGGCGACAA AAAGGGCGAA GGTGCCACAC TCAATAATAT TTCGCAGATA
TATCATGCCC GAGGGGAGTA TGACACGGCG CTGGAGTACC TCAACCGCTC CCTTGCGATA
TGCCAGGAGA TCGGCGACAA AAAGGGCGAA GGCACCACGC TCAATAATAT TTCGCTGATA
TATAAAGTCC GAGGGGAGTA TGACACGGCG CTGGAGTACC TCAAACGCTC CCTTGAGATC
AGACAGGAGA TCGGCGACAG TGCGGGGTTA TGTGCAACAC TGTTCAACAT GGGTCATATT
CATGCACAAA ACGAAGATCT GCCGAATGCG GTATCATCGT GGGTAACGTC TTACAGGATA
GCGAGCAACA TCAATCTGGC GGAAGGATTA CAGGCACTGG CAGCACTTGC GCCTCAGCTT
GGGTTGCCTG AAGGACTTGA GGGGTGGGAG ATGCTGGCGA AGCAGATGGA TGAGCAACAG
AAACAATCAT AA
 
Protein sequence
MNPVPPLFLT PDSAIIERYP DLLRQSAELS KAYADRHSRI SDAALQALGS ALWQVLDADE 
KLQHAKKQAG TGILPLVIAS DDPAILQLPW ETLYHPDYGF LARHEGFTLS RTIPSIKTGV
SDIEPGPLRI LLFSSLPDNL GEKNQLEIEV EQGRVLEALG QWRQSGHVVL EMPDDGRFSE
FKRILKSFKP HLVWLSGHGI FQSDPLNHHD KGYFLFENEH GDGGELVDED QLAEAFTGID
LQGIILSACQ TGKADSANLN NGLMYKLAWK GVPHVIGMRE SILDRAGVQF AQAFFEALID
KKGIALALQE ARRAIILPLK DDEEARTSLD AELSLGQWCL PMLLSREHNR PIIDWHFTPQ
PMRAANLLNE SLDRITLPAQ FIGRRRELRK LQREFREGKT NVLLFTGAGG MGKTAFAGKL
INNLKADGFE IFGFSARAEH DWRDTLFQMK LMLDKERIEK YTLIEKQYPD PAKQAAGLLK
LVLEQFQRKV AIFFDNLESV QDTVSRTIID PELLIWINAA VGLKKEGLRV LLTSRWALPE
WKEPVYPLGK PVYRDFLAVA QQQKLPKSFL RESKRLRKAY DVLNGNYRAL EFFSAALQNM
DAGEEEVFLQ QLQKAEAEIQ VDMALEKVWR HRTAEEQELL RRMTAFEVPV ALEGVQKIAM
LDPQQPVEAM ETLLSVSLIE RYYNPKWKTD EFLVSSLVRS WLEKRGIAKP VPELLQKAAT
YHEWLLMHER NTLDQAITTH TALMNAGMDE KAHRITLDRI VGPMNMAGMY QTLLQTWLLP
ACNSDDQQTL AVALGETGRQ YLDLCEYETA LEYLNRSLSI RQEIGHKLRE GATLNNISQI
YKVRGEYDTA LEYLNRSLAI CQEIGDKRGE GTTLNNISLI YSARGEYDTA LEYLKRSLAI
RQEIGDKKGE GATLNNISQI YHARGEYDTA LEYLNRSLAI CQEIGDKKGE GTTLNNISLI
YKVRGEYDTA LEYLKRSLEI RQEIGDSAGL CATLFNMGHI HAQNEDLPNA VSSWVTSYRI
ASNINLAEGL QALAALAPQL GLPEGLEGWE MLAKQMDEQQ KQS