Gene Cpha266_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0830 
Symbol 
ID4570367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp942706 
End bp944067 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content50% 
IMG OID639765428 
Productzeta-carotene desaturase 
Protein accessionYP_911305 
Protein GI119356661 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02731] phytoene desaturase
[TIGR02732] carotene 7,8-desaturase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.762196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAACAG CTATTTTCGG AGCAGGAGTT GCAGGGCTCA GTGCAGCAAT AGAACTGGTC 
GAACGCGGCC ATACCGTCGA ACTATATGAA AAACGAAAGG TACTCGGCGG CAAGGTTTCT
GTATGGAAAG ATAACGACGG CGATTCCATC GAGTCGGGCC TGCATATCGT TTTTGGTGGG
TACGCCCAGC TTCAGAACTA TCTGAAACGG GTTGGAGCTG CCGATAACTA CTCATGGAAA
GCCCACTCCC TGATCTATGC CGAACCGGAC GGAAAACAAT CCTTCTTCAA AAAAGCAAAC
CTTCCAAGCC CCTGGGCCGA AGTGGTTGGA GGACTTCAGA CTGATTTTCT CACCATGTGG
GACAAGATTT CGCTCCTCAA GGGGCTTTAC CCTGCTCTGG CCGGTAACGA AGCGTATTTC
CGCAGCCAGG ATCATATGAC CTATTCGGAA TGGCACAGAA AAAGAGGCGC CTCTGAACAC
TCGCTTCAGA AACTATGGCG CGCTATTGCC CTTGCCATGA ACTTCATTGA ACCGAATGTG
ATCAGCGCTC GTCCGATGAT CACCATTTTC AAGTATTTCG GTACCGATTA TGAAGCGACG
AAATTTGCGT TTTTCAAAAA GAACCCGGGC GATTCAATGA TTGAGCCGAT GCGCCAGTAT
ATCCAGAGCA AAGGCGGCCG GATTTTCGTT GATGCAAAGC TTAACCGGTT CGAACTCAAC
AGCGATGAAA CCGTCAAACA TGCCGTTCTT CAGGATGGGC AGATCATCGA AGCCGATGCG
TTCATTTCAG CCCTTCCCGT GCACACGGTT AAAAAAATCA TTCCCAGGCC ATGGCTTGCC
CACAAGTATT TCAGGAACCT CCATGAATTT CAGGGAAGCC CTGTTGCAAA CTGCCAGCTC
TGGTTTGACC GAAAAATCAC TGATACGGAT AATCTCATGT TTTCACAGGG AACGATTTTT
GCAACCTTTG CCGATGTTTC GATCACCTGT CCTGATGATT TTCAGAAGGG AAACGGCACA
GCAAATGGAG GCAGCGTCAT GAGCCTTGTG CTTGCGCCGG CACACCAACT GATGGATATG
CCTAACGAGG TCATAACGGA ACTGGTCATG AACGACATTC ACGACCGCTT TCCGGCATCA
CGCCAGGCCA AGCTCCTGAA ATCAACCATC GTCAAGATTC CTCAGTCAGT ATATAAGGCT
GTACCCGATG TCGACAAGTT CCGCCCCGAC CAGATAAGCC CGGTGAAAAA TTTCTTCCTC
GCAGGTGACT ATACCGATCA GCACTATCTC GCATCAATGG AAGGCGCTGC CCTGAGCGGA
AAACTGGTGG CCGAAAAACT TCACGCAAAA TTCGGATCCT GA
 
Protein sequence
MKTAIFGAGV AGLSAAIELV ERGHTVELYE KRKVLGGKVS VWKDNDGDSI ESGLHIVFGG 
YAQLQNYLKR VGAADNYSWK AHSLIYAEPD GKQSFFKKAN LPSPWAEVVG GLQTDFLTMW
DKISLLKGLY PALAGNEAYF RSQDHMTYSE WHRKRGASEH SLQKLWRAIA LAMNFIEPNV
ISARPMITIF KYFGTDYEAT KFAFFKKNPG DSMIEPMRQY IQSKGGRIFV DAKLNRFELN
SDETVKHAVL QDGQIIEADA FISALPVHTV KKIIPRPWLA HKYFRNLHEF QGSPVANCQL
WFDRKITDTD NLMFSQGTIF ATFADVSITC PDDFQKGNGT ANGGSVMSLV LAPAHQLMDM
PNEVITELVM NDIHDRFPAS RQAKLLKSTI VKIPQSVYKA VPDVDKFRPD QISPVKNFFL
AGDYTDQHYL ASMEGAALSG KLVAEKLHAK FGS