Gene Cag_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1888 
Symbol 
ID3746787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2399583 
End bp2401100 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content50% 
IMG OID637774425 
Productcarotenoid isomerase, putative 
Protein accessionYP_380181 
Protein GI78189843 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02730] carotene isomerase
[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATA AAAGAGCTGA CGTTATTGTG ATTGGGGCAG GCATTGGTGG CTTAACTACC 
GCCGCCTTGC TTCAAGAGAG GGGAATTCAA ACCGTGGTGT TTGAAAAAAA TCGCTATGCT
GGTGGCAGTT GCTCGGCGTT TCGCCGCGAA GGCTACACCT TTGATGCTGG GGCATCGGTT
TTTTATGGTT TTGGCGATAA TGCTTCAAGC GGAACGCTTA ATTTACATAC TCGCATTTTT
CGCAAGTTTG GGATTAAGGT GGCAACGGTG CCCGATTCTG TGCAAATTCA CTACCATCTT
CCCAATGGAT TTTCGGTAGC CGCAAGCCAT AATCGGCAGC AATTTTTAGC TGCGCTGAAA
GCCCGTTTTC CGCATGAAGC TGAGGGAATT GAGCGCTTTT ACGAGGAGCT AACAGCCGTC
TGCGATATTT TACGCGCTAT GCCTGCTGGC TCGTTAGAAG ATGTTGTTCA CTTAGCCTCA
GTTGGCGCTG CTCATCCGTT AAAAACTGTT GCATTGGCGC TGAAAAGTTT TCGCTCCATG
GGCAAAACAG CTCGCCGCTA CATTCGAGAT GAAGAGCTGC TCCGCTTTAT TGACATTGAA
GCTTATTCGT GGGCGGTGCA GGATGCTACG GCAACACCGC TTGTGAATGC AGGCATCTGC
CTTGCCGATC GCCATTATGG TGGCATTAAC TATCCCATCG GCGGTTCAGG CGCAATTCCC
GAAGGACTTT GCAAAGCCTT TCAGCAACAC GGTGGTACGC TTAACTATCA AGCTGAGGTG
GTAGAAATTC TCTTGGAAGC GGGTGAAGCG CGAGGTGTGC GGTTAGCGGA TGGCACGGTG
CATTATGCCA AAGTGGTTAT TAGTAATGCA ACCATTTGGG ACACCTTTAA TCGGATGGTA
AAGGATGTGC GTTACCGTGT TGAGGAGGAT CGCTTTTTGC GGGCACCAAG TTGGTTTCAG
CTTTTTCTTG GTGTGGATAG CCGTGTAATT CCAGAAGGCT TTAACGTTCA CCATATTATT
GTGGATAATT GGCAAAGTTA CCAGCAGCTT GGCGGCACTC TTTATTTTTC GGCTCCCACC
ATTCTTGATC CATCATTAGC ACCTGCAGGG CATCATATTA TTCATGCTTT TGTTACCGAT
GAGGTTGCAT GTTGGAGCAA CTACGAGCGT GGTAGTAGCG CCTACCGTGC AGCCAAAGAG
GAAAAAGCAG CCGCTCTTAT TGCTCGCATT GAGCGCATTG TGCCTGAGCT TTCGTCGGCG
ATTAAGCTGA AAGTGCTTGC TACGCCGCTC ACGCATGAAC GCTACCTCAA TCGTTATAAA
GGCTCGTATG GAGCACTGTT AAAGCCCGGT CAAACCATTT TGCAAAAGCC ACAAAACACA
ACGCCCGTGC GCAATTTATA TGCGGTGGGC GATAGTACTT TTCCCGGGCA GGGAGTAATT
GCGGTTACCT ATTCGGGTGT TTCGTGCGCT TCGTATGTGG CGCGCCGTTT TGGCAAACCG
CTTGAGGAGT TGGGGTAG
 
Protein sequence
MADKRADVIV IGAGIGGLTT AALLQERGIQ TVVFEKNRYA GGSCSAFRRE GYTFDAGASV 
FYGFGDNASS GTLNLHTRIF RKFGIKVATV PDSVQIHYHL PNGFSVAASH NRQQFLAALK
ARFPHEAEGI ERFYEELTAV CDILRAMPAG SLEDVVHLAS VGAAHPLKTV ALALKSFRSM
GKTARRYIRD EELLRFIDIE AYSWAVQDAT ATPLVNAGIC LADRHYGGIN YPIGGSGAIP
EGLCKAFQQH GGTLNYQAEV VEILLEAGEA RGVRLADGTV HYAKVVISNA TIWDTFNRMV
KDVRYRVEED RFLRAPSWFQ LFLGVDSRVI PEGFNVHHII VDNWQSYQQL GGTLYFSAPT
ILDPSLAPAG HHIIHAFVTD EVACWSNYER GSSAYRAAKE EKAAALIARI ERIVPELSSA
IKLKVLATPL THERYLNRYK GSYGALLKPG QTILQKPQNT TPVRNLYAVG DSTFPGQGVI
AVTYSGVSCA SYVARRFGKP LEELG