Gene Cpha266_0209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0209 
Symbol 
ID4570590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp233837 
End bp235177 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content49% 
IMG OID639764809 
Productpentapeptide repeat-containing protein 
Protein accessionYP_910700 
Protein GI119356056 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGA TTTGGCACCT TGCAGGTATC GTTCTTTTAA CCGTTTCGGG TTTTCTTCCA 
GTTTCATCAG CCATCGGGTT TGATCCTGAT GCCGTTAAGC TGCTGGTGAA AAGTCCGAAA
GAGTGGAACG CCTTTCGCCA GCAGCATTCC ATGCAGCCTG TTGATCTTGA CAAGGCAAAA
CTTGAAGATG CCGATCTTGA GGGGGCGAAC CTCAGCAACA GTTCGCTTGT CAGAGCCGAG
TTGAGCGGGG CTAATCTGAA CAATGCCGAT TTACGAGGAT CAAACCTTCA GCAGGCGTTT
ATTAAAAAAG CGGATCTTAA AGGGGCCGAC TTGCGTGAGG CTTACCTTGT CAAGGTGAAT
CTTAAAGAAG CATTCATGGA GAAGTCCATG CTTCAAAAAG CGAATCTTCA AAGCGCTAAT
CTCAGATGGA CAAGATTTCA CAGGGCAGAT CTTGCAGGAT CGAACCTTCA GGATGCCGTT
CTGTTTGAAA CCAGCTTTGT TGATGCTGAT CTTCGGGGCG CCAATCTCAA GGGTGCGCTC
TATGTTGGAA ATGCCAATTT CAGCGGAGCA AAAATATCCA GTAACACCAT TACGCCTTCT
GGTGAAAAAG CAACAGCGTC TTGGGCTGCG ATACGCGATG CTGAATATAT CAAAGAAGCG
GATGCGGCAA TGCCGGTTTA TGCTTCCCTT CCGGTTATGG TATTTGCATC CCCATCGGCA
GGCCTGAAGT CCGCATCCGC TTCCTCCGGT CAGGCAGTGA TGGGCAGCAA ACAGCAGCAA
GCGCTGATGG TTGATGATGT CGAAACCTGG AATACCATGC GTGCCGCCAA TCCTGAGCTG
AAAATAGAGT TGAAAGAGGA AAAACTTGAA AATGCAAGGC TGAAGGGAGT GAATTTGCAG
AAAGCTTCGA TGCCTGGTGC CGATTTCGAG GATGCAAATC TTGACGAGGC CATGATGGAG
GGGGCCGATC TGAGTAAAGC CGATTTCCAG AAAGCGGATA TGAAAAAAGT TAAACTTCAG
GGGGCAAATC TTTCCGGAGC GAATCTTGAC CGTTCATTCA TGGAAGGGGC AGATCTGCGC
AATGCCAATC TCAGCGGAGC GAACCTGTTC GGAGCTATGC TTAAGGATGC CAATCTCAGC
GGCGCTAATC TCAGCGGAGC ATCCCTGTTT GAGACAGATC TTGAGGGAGC AAATCTTTCC
GGAGCGAATC TTAAGGGAGC AAATCTCGTG GAGCCTAACC TGAAAAATGC GATCATCTCA
CCGGACACCA TTCTTCCATC CGGCAAGAAT GCGACGCAGA GTTGGGCGGT TATAAAGGGA
GCGACATTTG TTAAGCCTTA G
 
Protein sequence
MKQIWHLAGI VLLTVSGFLP VSSAIGFDPD AVKLLVKSPK EWNAFRQQHS MQPVDLDKAK 
LEDADLEGAN LSNSSLVRAE LSGANLNNAD LRGSNLQQAF IKKADLKGAD LREAYLVKVN
LKEAFMEKSM LQKANLQSAN LRWTRFHRAD LAGSNLQDAV LFETSFVDAD LRGANLKGAL
YVGNANFSGA KISSNTITPS GEKATASWAA IRDAEYIKEA DAAMPVYASL PVMVFASPSA
GLKSASASSG QAVMGSKQQQ ALMVDDVETW NTMRAANPEL KIELKEEKLE NARLKGVNLQ
KASMPGADFE DANLDEAMME GADLSKADFQ KADMKKVKLQ GANLSGANLD RSFMEGADLR
NANLSGANLF GAMLKDANLS GANLSGASLF ETDLEGANLS GANLKGANLV EPNLKNAIIS
PDTILPSGKN ATQSWAVIKG ATFVKP