Gene Cagg_0851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0851 
Symbol 
ID7268303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1059006 
End bp1060487 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content61% 
IMG OID643565699 
Productchlorophyllide reductase subunit Z 
Protein accessionYP_002462208 
Protein GI219847775 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01278] light-independent protochlorophyllide reductase, B subunit
[TIGR02014] chlorophyllide reductase subunit Z 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00549555 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATTA CGCTGATCCG TGATATTTCC GATACCAGTA GCTACTGGGG TGTGTCGTGG 
GTGTTCGGCT GCTTCCCCGA CGTTCATATC GTCTGCGATG CACCGATCGG CTGCTACAAC
CTGCTCGGTA TGGCAGTGAC CGACTATACC GATGCGCTAC CCCACATGGC AAACCTCACC
CCGACCTCGA TCCGCGAGGA GGATGTGATC AACGGTACGG CCAAGGCGCT GATCCGTACC
ATCGACGATC TGCGCACGAT GGGGATGCTG GCAGGCAAAC GGCTGCTGGT CGTTTCGACC
GCCGAGAGCG AGATGATCAG CGCCGATCAC GCTCAACTAG TAGCGCAGAT CGATCCCGAA
GCGCGGTTCT TCTGGAGCCA ATCACTCGAA CAGGATGAGT GGACGGGACG CGAGCGAGCG
TTATTGTTTG CGTGGGAACA GTACGGTAAA CCATTTGTGC CGGCAGATGT GCAACCACGT
CCGCGCACGG TCAATATCAT CGGCCCCTCG TTGGGATGTT TTAACGCACC TAGCGACCTC
TACGAACTCA AGCGGTTGAT CACCGGCATC GGCGCCGAGA TTAACCTCGT CTACCCCTAC
GAAGGCAGTA TCGCTACCAC CCCCAAGCTG GCGGAAGCGG CAGTCAACGT CGTGATGTAC
CGCGAGTTTG GTCAAGGTCT AGCCGAAGCA TTGGGCCGGC CCTACCTCTT CGCGCCGTTC
GGGGTCTTTG GCACCACTGC GTTTCTGCGC GAACTAGGCC AACTCCTTGG GATTGAGCCG
GAGCGTGTCG AAGCCTTCAT CAACCACGAA AAGCGCACTA CACTGCAACC GGTGTGGGAT
CTGTGGCGCG GCCCGCAGAG TGACTGGTTT GCGACGGTTG ATTGCGCCAT TGTCGCGGCG
CGCAGCTACG CCGACGGGTT GCGCAGCTTT CTCGGCGACG AGCTGGGGAT GAAGATCGCG
TGGATCTCGG GGCGACCCCG CCGCGACGAC GAGCCGGATA ATATCGAGAT TCGGAAGCGG
TTGCACGCCA AGGCGCCGGC GTTCGTGTTC GGTAGCATTA ACGAGAAGAT CTATCTGGCC
GAAGCTAATG CGCGCGGTAC GCACTATATC CCGGCCACCT TCCCCGGCCC GGTGGTGCGG
CGCACAACGG GCACGCCGTT TATGGGGTAT GCCGGCGCGG CCAACCTGAT GCAAGAGCTG
GTCAACCGCT TCTACGAGAC GGTGATCAAC TTCTTGCCGG TCGAGACGGT AACACCGGCA
GCAGGTGGGC CACCGCAGCC AACCTCTGCC GAAACGATAC CGTGGACGAA AGAGGCGACC
GACCGGCTCA ACGCTGCGCT CGACGCGGTG CCCTACCTTG CCCGCATCAG CGCCAGCCGT
TCGCTGCGCG CCGCCGCCGA GCAAGCAGCG CGGGCGCGCG GGTTGAAAGA AGTCACGCTC
GAAATCATCG AAGCGGCAAT TGCTCAGGGC GCGACCTCGT GA
 
Protein sequence
MSITLIRDIS DTSSYWGVSW VFGCFPDVHI VCDAPIGCYN LLGMAVTDYT DALPHMANLT 
PTSIREEDVI NGTAKALIRT IDDLRTMGML AGKRLLVVST AESEMISADH AQLVAQIDPE
ARFFWSQSLE QDEWTGRERA LLFAWEQYGK PFVPADVQPR PRTVNIIGPS LGCFNAPSDL
YELKRLITGI GAEINLVYPY EGSIATTPKL AEAAVNVVMY REFGQGLAEA LGRPYLFAPF
GVFGTTAFLR ELGQLLGIEP ERVEAFINHE KRTTLQPVWD LWRGPQSDWF ATVDCAIVAA
RSYADGLRSF LGDELGMKIA WISGRPRRDD EPDNIEIRKR LHAKAPAFVF GSINEKIYLA
EANARGTHYI PATFPGPVVR RTTGTPFMGY AGAANLMQEL VNRFYETVIN FLPVETVTPA
AGGPPQPTSA ETIPWTKEAT DRLNAALDAV PYLARISASR SLRAAAEQAA RARGLKEVTL
EIIEAAIAQG ATS