Gene Cagg_2970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2970 
Symbol 
ID7266501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3640632 
End bp3642113 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content52% 
IMG OID643567792 
Productphytoene desaturase 
Protein accessionYP_002464266 
Protein GI219849833 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000576965 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCAAA AGGAGATTGT CGTAATTGGG AGTGGGTTTG GGGGGCTTAG TGCTGCTATT 
CGGCTTGCGG CGCAGGGCCA TTCGGTGACG ATCCTCGAAC AACGTGATCG CCCCGGTGGT
CGCGCCTATG TCTATCAAAC GAAAGGCTAT ACCTTTGATA GTGGACCGAC GGTGATTACT
GCGCCGTTTA TGTTTGATGA ACTATGGCAG TTAGCCGGTA AGCGACGCGA AGATTATGTG
ACGTTCGAGC CATGCCGACC GTATTATCGT CTCTTCAACC ATGAGGGCCG CTACCTTGAA
TATGGTGATG ATGAACAGGC TTTGCTTGAG CAGATCCGCC GGTGGAATCC GGCTGATGTG
GAAGGGTATC GCCGTTTTAT TGCTAGCACT CGCCCAATCT TTGAAAAGGG TTTTAGCTTG
ATCGATAAGC CGTTTCTTCA TTTTCGCGAT ATGTTACGGG TGACCCCCGA CCTGATCCGG
CTCAAATCGT ATCAAAGTGT TTATCAGTTT GTCTCACAGT TTTTCCAAGA TGATTTTCTC
CGCCGTTGCT TCTCGTTCCA TCCACTCTTT ATCGGGGGGA ATCCGTTTGA CTCAACGTCG
ATCTATGCGA TGGTGCATTA CCTTGAGCGG CGGTGGGGCG TTTATCATGC TCGTGGTGGT
ACCGGTGCTA TTGTAAAAGC AATGGTGCAA CTCTTTACTG AGTTGGGTGG TACACTCGAG
TTGAACGCTA AGGCGGTAGA GATTGTGATC AATGGTCGTC GTGCCAGCGC CGTGCGTACT
CAAGATGGTC GTCTGTTCCC GGCCGATATT GTGGTCTCGA ATGCTGATGT GCCACAGACG
TATATGAGCT TAATCCCACC ACGGTATCGC AAAGTGCAGA CTGACCGCCG CCTGCGTCGG
ATGCGCTACA GTATGTCGCT CGCCGTTATC TATCTCGGTA TTAATCGTCG TTATGATGAT
GGTCGGTTGG TCCAGCACAA TATCATCTTT AGTGAACGGT ATAAGGGATT ACTCGACGAT
ATCTTCAATC GAAAACGGCT TGCCGACGAT TTCTCGCTCT ATCTCCACCG TCCGTCGCAT
AACGATCCAA CGTTAGCTCC GCCTGGTCAC GAGGCGCTCT ATGTCCTAAC GCCGGTGCCT
AATCTGGCGG CGAACATCGA CTGGGCGACG GCTGGGCCTC GCCTACGCGA GGCAATTCTT
ACCTTCCTTG AAGAGCATTA CATGCCTGAT CTGCGGCGCC ATATCGTGGT TGAACATATG
GTCGATCCGC GCTACTACCG TGATGCTCTG AATAGCTACC TCGGTGCCGG CTTTTCGATC
CAACCGCTCT TAACCCAATC TGCGTGGTTC CGTCCACATA ACCGTTCTGA AGATATCGAT
AATCTGTATC TGGTAGGAGC AGGTACTCAT CCCGGTGCCG GCTTACCCGG TGTGATTGCT
TCGGGTGCGA TTGTTGCTCA TTTAGTGGCT CAAGAGGCCT GA
 
Protein sequence
MTQKEIVVIG SGFGGLSAAI RLAAQGHSVT ILEQRDRPGG RAYVYQTKGY TFDSGPTVIT 
APFMFDELWQ LAGKRREDYV TFEPCRPYYR LFNHEGRYLE YGDDEQALLE QIRRWNPADV
EGYRRFIAST RPIFEKGFSL IDKPFLHFRD MLRVTPDLIR LKSYQSVYQF VSQFFQDDFL
RRCFSFHPLF IGGNPFDSTS IYAMVHYLER RWGVYHARGG TGAIVKAMVQ LFTELGGTLE
LNAKAVEIVI NGRRASAVRT QDGRLFPADI VVSNADVPQT YMSLIPPRYR KVQTDRRLRR
MRYSMSLAVI YLGINRRYDD GRLVQHNIIF SERYKGLLDD IFNRKRLADD FSLYLHRPSH
NDPTLAPPGH EALYVLTPVP NLAANIDWAT AGPRLREAIL TFLEEHYMPD LRRHIVVEHM
VDPRYYRDAL NSYLGAGFSI QPLLTQSAWF RPHNRSEDID NLYLVGAGTH PGAGLPGVIA
SGAIVAHLVA QEA