Gene Cagg_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1559 
Symbol 
ID7267336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1907124 
End bp1909436 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content61% 
IMG OID643566401 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_002462897 
Protein GI219848464 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.755878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTATG CAGCCCTGAT TGGCGCAGAA GTGAAACGCC GCGAAGATCC GCGTCTCATC 
CGGGGTCAGG GCACGTATGT GAGCGATTTG CGTCTACCGG GGATGTTATA CGTTGCCATT
GCCCGTAGTC CGTATACCCA CGCCCGGATT ATCGCCATCG ACAAAGCTGC CGCGCTGGCG
ATGCCTGATG TCGTAGCCGT CTATACCGGC ACCGACCTGC TCGCTTGTTG TCAGCCATTG
CCGTTGGCCA GCTCAGGCGA AGGCGGCGGC GGGCCACAGC GCTACACGGG CCGCACTCGC
TATGTGCTGG CGGTAGAGCG GGTGCGTCAC GCCGGTGAAG CGGTCGCCGC AGTGATCGCC
CGGACGCCAG AAGCCGCGCT CGATGCCGCG CTTGCCTTGC CCATTGAATG GGAGCCACTA
CCGGCAGTGG TTGATCCGCT CACCGCAATT GCCCCCGACG CCCCGATCAT CTTCGATGGT
CTGCCTGACA ATATTGACCA CCGTCGTCGC CGGCAAAAGG GTGATGTCGA GGCCGCCTTC
GCCACTGCTC ATCGGGTTAT CCGCCAACGA ATGGTCAATC AACGTCTGCT CGGTTTCCCG
ATGGAAGGAC GCGCAGTGGT TGCTGCGCCC GACCCCGCTA ACGATGGTGT GACGGTATGG
ACGAGTACGC AGACCCCGCA CCAGGTGCGT GGCGAGATCG CTAAAGTGGT CGGCCTTGAC
GAGAATCGGG TGCGCGTGAT TGCGCCTGAT GTCGGCGGTG GCTTCGGGGT TAAGATCGGT
ATCTACCCCG AAGAGGCGCT GCTGGCAGCG CTAGCCCGTC AGCTTAATAC ACCATTGCGT
TGGATCGAAC ACCGCCTCGA ACATGTACAG GCAACGACTC ACGGACGCGG GCAAGTGTGC
GATGTCGAGG CTGCCGTTAC CGCTGATGGC GAAGTGACTG CGCTGCGTAT GCAGATCGTA
GCCGATCTCG GCGCTTATCC TCTCGCCCCC GGTCTACCCG ATCTGACCAC TGCCATGGCT
ATTGGCGTCT ACAAAATCCC TGCCGTCGAT CTGGAAGCAA TTTGTGTTTA TACCAATACC
ACACCGGTCG CTGCCTACCG TGGTGCGGGT CGGCCCGAAG CTGCGTACTA TATCGAACGA
CTGATGGATC TGATTGCCGC TGAATTGCAT ATCGATCCCG CCGAGGTTCG TCGCCGTAAC
TTCATTCCCC CCGACGCCTT CCCGTACAAG ACGCCGACCG GCCTGACGTA TGATAGCGGC
GAGTACGATC GCGCCCTGAC TAAAGCTTTG ACATTATCGC GATACGAACA GTTACGCGCC
GAACAAGCTG CCCGCCGCGC CGCTGATGAC CGGATGTTGC TCGGCATCGG GATTGCCTGT
TATGTCGAGA TGTGCGGCTT CGGCCCCTAC GAAAGCGCTC AAATCAAGGT CGAACCGAGC
GGTACGGTGA CGGTGACGAC CGGCATCTCG CCGCACGGTC AGGGCACTGC CACCACCTTC
GCCCAGATCG TCGCCGACCA GATCGGGGCT GACTTTGAGC GGATTGTGGT TAAGCACAGC
GACACCGCGA TCACGCCGAT GGGTATCGGG ACGATGGGGT CACGGTCGTT GGCCGTTGGT
GGCGCAGCGC TCGTGCGGGC AGCGACAAAG GTACGCGAGA AAGCACGCCA GATTGCGGCA
GCCATGCTTG AAGCTAGTGT GGCCGATATT GAACTGCACG AGGGTCGCTA TCGGGTACGC
GGCGTGCCCG ACCGTGCCCT GACCCTAACC GAGATTGCCC GTCGCGCCTA CAGTAACAAA
CTCCCGCCAG ACCTCGATCC CGGTTTGGAA GCGGTCGATT ACTTCCGTCC ACCCGACCTG
ATCTATCCCT TTGGCGCGCA CGTCGCCGTG GTCGAAGTCG ATCGCGAAAC CGGCCACGTT
CGCATCCGCG AGTACTACTC GGTTGATGAT TGCGGGCCGC GCATTAGCCC ACTGATCGTT
ACCGGTCAGG TGCATGGTGG GTTGGCCCAA GGTATTGCTC AAGCGCTCCT CGAAGAGGTC
GTGTACGACG CAAACGGCCA ATTGCTCAGT GGTACCCTGA TGGATTACGC CTTACCGCGC
GCCGACTTCT TCCCACCCTT CACAGTTGAT AAGACCGAAA CGCCGACTCC GCTCAACCCG
CTCGGCGTCA AGGGTATCGG TGAAGCGGCA ACCATTGGTT CAACACCGGC TATTGCGAAC
GCGGTGATCG ACGCACTCGC ACCGTTTGGC GTGCGCCATC TTGATATTCC ACTCCGCTCA
GAAAAGATCT GGCGAGCAAT CCACGGCCGA TAA
 
Protein sequence
MAYAALIGAE VKRREDPRLI RGQGTYVSDL RLPGMLYVAI ARSPYTHARI IAIDKAAALA 
MPDVVAVYTG TDLLACCQPL PLASSGEGGG GPQRYTGRTR YVLAVERVRH AGEAVAAVIA
RTPEAALDAA LALPIEWEPL PAVVDPLTAI APDAPIIFDG LPDNIDHRRR RQKGDVEAAF
ATAHRVIRQR MVNQRLLGFP MEGRAVVAAP DPANDGVTVW TSTQTPHQVR GEIAKVVGLD
ENRVRVIAPD VGGGFGVKIG IYPEEALLAA LARQLNTPLR WIEHRLEHVQ ATTHGRGQVC
DVEAAVTADG EVTALRMQIV ADLGAYPLAP GLPDLTTAMA IGVYKIPAVD LEAICVYTNT
TPVAAYRGAG RPEAAYYIER LMDLIAAELH IDPAEVRRRN FIPPDAFPYK TPTGLTYDSG
EYDRALTKAL TLSRYEQLRA EQAARRAADD RMLLGIGIAC YVEMCGFGPY ESAQIKVEPS
GTVTVTTGIS PHGQGTATTF AQIVADQIGA DFERIVVKHS DTAITPMGIG TMGSRSLAVG
GAALVRAATK VREKARQIAA AMLEASVADI ELHEGRYRVR GVPDRALTLT EIARRAYSNK
LPPDLDPGLE AVDYFRPPDL IYPFGAHVAV VEVDRETGHV RIREYYSVDD CGPRISPLIV
TGQVHGGLAQ GIAQALLEEV VYDANGQLLS GTLMDYALPR ADFFPPFTVD KTETPTPLNP
LGVKGIGEAA TIGSTPAIAN AVIDALAPFG VRHLDIPLRS EKIWRAIHGR