Gene Cpha266_0306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0306 
Symbol 
ID4570599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp340193 
End bp341449 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content53% 
IMG OID639764906 
Producthypothetical protein 
Protein accessionYP_910792 
Protein GI119356148 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0825598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCC TTGAAAAATT ACAGATTCTG TCCGGAGCAG CGCGTTACGA CGCCTCGTGT 
TCATCCAGCG GTAGCAAACG AGAAGGATCT TCGAGCGGCC TTGGCAACAC TTCGTCGAGC
GGTATATGCC ACTCCTGGTC GGATGACGGG CGGTGTATTT CGCTGTTGAA AATTCTCCTC
TCCAATGACT GTCGTTACGA TTGCGCCTAC TGTGTCAACA GGATATCCAA TCCGGTTCAG
AGAGCCTCTT TCACTGCACG GGAAGTGGTC GATCTCACTA TGGAGTTTTA TCGGCGTAAC
TATATCGAGG GTCTCTTTTT AAGCTCGGCA GTCATGCAGA GCCCCGATCA CACCATGGAG
CGGATGGTCA GCGTTGCCGA AACGCTTCGT ATCGATGAAA AATTCGGCGG CTACATACAT
CTGAAAATCA TTCCGGGCAG CAGCAGCGAA CTGGTGCGGA AGGCGGGACT CTATGCCGAT
CGCATCAGCG TCAATATCGA GCTCCCCTCC GAGACGGCTT TACAGCGTCT TGCGCCACAG
AAACAGAAAG CCGGCATTCT TGAGCCAATG GCCTTTATCG GACGGGAGAT AAAAGGATCT
CTTCTTGAGC GGCAGAGAGG TCGCAACGCG ACGCCACGGT TTGCTCCTGC CGGACAGAGC
ACTCAGATGA TTATCGGAGC AAGCCCCGAA AGCGATTTTC AGATACTCAA GCTTTCACAG
GGGCTCTACA AAAAAATGAA TCTTAAACGG GTCTATTATT CGGCTTTTAT TCCGGTCAAT
GAGGACAGTC GTCTTCCCGT GCTCGCCTCG CCGCCGCTCC TTCGCGAACA CAGGCTCTAT
CAGGCCGACT GGCTGCTGCG CTTTTACGGT TTTACCGCAG AAGAGATTCT TTCAGACGAA
GCGCCCAACC TTGACGAAAC ATTTGATCCC AAAACAGCCT GGGCTCTTCG CAATCCCGGG
TTTTTTCCTG TAGAGATCAA TCGCGCAGAC TATAGCGTTC TCCTTCGTGT TCCAGGTATA
GGGGTCACTT CGGCCAGGCG TATTGTTGCC GCTCGTCGGT TTGCCTCCAT TACCCCTGAA
GGAATGAAAA AGATCGGAGT GGTCATGAAA CGGGCGAAAT ATTTTATCAC CTGCTCCGGC
AGGCCTTTTG AAAATACAGA CCGGCAACCG GCCCTTCTGA AGAGCCGGCT CCTGCTTGCC
GGGGGCGTCG CTCCGGAACC TCCGAAGCAG CTTGTGCTGC CCGGCCTTTT TGCCTGA
 
Protein sequence
MNTLEKLQIL SGAARYDASC SSSGSKREGS SSGLGNTSSS GICHSWSDDG RCISLLKILL 
SNDCRYDCAY CVNRISNPVQ RASFTAREVV DLTMEFYRRN YIEGLFLSSA VMQSPDHTME
RMVSVAETLR IDEKFGGYIH LKIIPGSSSE LVRKAGLYAD RISVNIELPS ETALQRLAPQ
KQKAGILEPM AFIGREIKGS LLERQRGRNA TPRFAPAGQS TQMIIGASPE SDFQILKLSQ
GLYKKMNLKR VYYSAFIPVN EDSRLPVLAS PPLLREHRLY QADWLLRFYG FTAEEILSDE
APNLDETFDP KTAWALRNPG FFPVEINRAD YSVLLRVPGI GVTSARRIVA ARRFASITPE
GMKKIGVVMK RAKYFITCSG RPFENTDRQP ALLKSRLLLA GGVAPEPPKQ LVLPGLFA