Gene Cpha266_1173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1173 
Symbol 
ID4570750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1322177 
End bp1323562 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content48% 
IMG OID639765766 
Productzeta-carotene desaturase 
Protein accessionYP_911634 
Protein GI119356990 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02731] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000320899 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACAC ATAACAAAAC AGCTCTGATT CTCGGGGGCG GACTTGCAGG TCTTACTGCA 
GCAAAGCGTT TGACTGACAG GGGTTTTCAG GTCAGGGTTC TTGAGAAAAG GGAGATTTTC
GGTGGAAAGG TGTCGTCATG GAAAGATGAA GAGGGCGACT GGATTGAGTC GGGAACTCAC
TGTTTTTTCG GCGCCTATTC GGTTCTCTAT GATCTCATGA AGGAGATCGA TACCTATCAC
GCAGTGCTCT GGAAAGAGCA CAAGCTAACC TATACTCTGG CTGAAGGAGA TCGGTTTACG
TTTAATACGT GGGATCTTCC AAGTCCGCTT CACCTTCTTC CGGCCATTTT GAAAAATGGC
TATTTTTCTT TTGGCGAGAT GGCTTCTTTT TCGAAATCGC TGATTCCGCT GGCCCTGCAG
CAGCAGCAAT ATCCTCCTTC TCAGGATCAT CTTACCTTTG CAGAATGGGC TGAAGAGAAA
AAGTTCGGGC ATCGTCTGAT GCAGAAAATG TTCAGACCGA TGGCGCTTGC GCTGAAATTT
ATTCCCCCGG AGGAGATCTC GGCCAAAATC ATTCTCGATG TTACCGAAAC CTTTTACCGC
CTTCCGAACG CATCTCGGAT GGGCTTTCTC AAAGGTTCGC CCCAGGAATA TCTTCATCAA
CCCCTTCTTG ATTATGCAAC TGCAAAAGGG GCCGTCTTCA AAAACAAGAC GGCAATAGAA
GAGCTTTTGT ATGACGGCGG TGAAATCAGG GGCGTTCATC TTCGTAATGG CGAGATACTT
ACGGCTGATT ACTACCTGTC GGCACTGCCG ATCAATGATC TGAACAAGGT ATTGCCCGAG
GAGCTTAAAA AGCATGATCG ATTTTTCTCG GTGCTTGGCA ATCTTGAAGG CGTTCCTGTT
ATTTCTGTAC AGATATGGTA TGACAAAGAG ATTACGCCCG TTGATAATGT GCTTTTCAGC
CCTGATGGCA TTATTCCGGT CTATGCCAAT CTGGCAAAAA CGACGCCTGA GTACCAGACA
CTCAGGGGTG AGCCGTTCAG CGGAAAAACA CGTTTTGAGT TCTGTGTCGC GCCGGCACGA
AATCTTATGG GCCTGACGAA AGAGGAGATT ATTCACCAGG TCGATCTCAG CGTCAGAAAC
TGTTATCCGA AATCGTCGGC TGGTGCAAGG ATATTGAAAG CAACCGTCGT GAAGATTCCG
CACTCCGTCT ATGCGCCGTT GCCCAATATG GAGCAGTATC GGCCAACGCA AAGAACGCCT
GTGCGCAACC TGTTTCTGGC TGGCGGGTTT ACCCGGCAGC TTTATTATGA TTCAATGGGT
GGTGCGGTCA TGAGCGCAAA TCTTGCCGTA GAGGGTATTC TGAAAGCATC GGGAGTTATG
GATTAA
 
Protein sequence
MSTHNKTALI LGGGLAGLTA AKRLTDRGFQ VRVLEKREIF GGKVSSWKDE EGDWIESGTH 
CFFGAYSVLY DLMKEIDTYH AVLWKEHKLT YTLAEGDRFT FNTWDLPSPL HLLPAILKNG
YFSFGEMASF SKSLIPLALQ QQQYPPSQDH LTFAEWAEEK KFGHRLMQKM FRPMALALKF
IPPEEISAKI ILDVTETFYR LPNASRMGFL KGSPQEYLHQ PLLDYATAKG AVFKNKTAIE
ELLYDGGEIR GVHLRNGEIL TADYYLSALP INDLNKVLPE ELKKHDRFFS VLGNLEGVPV
ISVQIWYDKE ITPVDNVLFS PDGIIPVYAN LAKTTPEYQT LRGEPFSGKT RFEFCVAPAR
NLMGLTKEEI IHQVDLSVRN CYPKSSAGAR ILKATVVKIP HSVYAPLPNM EQYRPTQRTP
VRNLFLAGGF TRQLYYDSMG GAVMSANLAV EGILKASGVM D