Gene Cagg_3386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3386 
Symbol 
ID7267126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4103104 
End bp4106193 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content59% 
IMG OID643568195 
ProductFe-S-cluster-containing hydrogenase components 1-like protein 
Protein accessionYP_002464666 
Protein GI219850233 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.364426 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGC ATCAATCTGA TCTCGAGGCC ATTCGCGCTC AGTTGCGCGA CGCGCGCGGA 
CCACAGTTCT GGCGTTCGCT CGACCAACTG GCCGATTCGC CGGCGTTCCG TGAATTAGTG
GAGCGCGAAT TTCCGCGCGG CGCGAGCGAG ATGAGCGATG GAATGAGCCG GCGCACGTTC
CTCAAGCTGA TGGGTGCCTC ACTAGCGCTA GCCGGTGTGA CGGCTTGTAC CTATCAGCCA
CGCCAGTATA TCGCGCCATT CGATCGCCAA CCCGAAGGGC GTATTCCCGG AGTACCACAA
TACTTCGCGT CGACACTGAC GCTCGGTGGC TACGGTACCG GCGTACTGGT ACGCGCGAAT
GAGGGCCGCC CGACGAAGGT TGAGGGGAAC CCGCGCCACC CGGCCAGTCT CGGCAGCACC
GATCTGTTTG CTCAGGCCGA GATTCTGACG ATGTACGACC CTGATCGCTC AACGACGGTG
CTGCGCCAAG GGGTACCAAG TACGTGGGCC GAATTTACCA CGACCCTCGC GAATGCATTG
ACGGCAGCGC AGGCAACACA AGGCGCTGGC GTGCGTCTCT TGACCACGAC GGTGACGTCG
CCGTCGCTGG CTGCCCAAAT TGAGCAGTTC TTGCAGGCTT ATCCACAGGC ACGCTGGTAT
CAGTACGAGC CGGTTAATCG CGATAATGTC GTGGAAGGCG CACGCCTTGC GTTTGGCCGT
GATGTTACCA CGCGCTACGA TTTGGCAGCC GCCCAGGTAA TTGTCAGCCT CGACGCCGAC
TTCCTCGCGC CCGGCCCCGG CTTCATCGCA TATGCCCGCG CCTTTGCCGA TGGCCGTAAG
GTGCGGAAAG ATAGCACCGG TATGAACCGG CTGTACGTGA TCGAGGCAAG CCCCTCGACG
ACCGGCACGG CAGCCGATCA CCGACTGGCG CTACGGGCCG ATGCGATCGC CGCGTTTGCC
GGCGCTCTGG CCCACGAACT CGGTATTGGT GGAGCACCGG CAACCCTTGC TGCGAAAGCT
GAAGAGTTCT TGAAGGCGAT AGCCAAAGAC CTTGAAGAGC ACCGTGGGCG CTCGGTAGTC
ATTGCCGGCG ACCAGCAGCC ACCTATTGTG CATGCCCTGG CCCACCTGAT CAACGCTGAG
CTGGGCAACG TCGGCAAGAC GGTCTTCTAT CACGAACCGG TGGAAGCCCG TCCGACTAAT
CAAACAAACG AGCTGGTAAC GCTGGTCAGC GAGATGGCTG CCGGTCGAGT GGAGCTGCTC
GTTATGATCG GCGGCAACCC GGTCTACAAC GCTCCTGGCG ACCTGCGTTT TGCCGAACGG
ATGGCCACGG TCCCGCTGAC CGTTCACTTA AGCCAGTTCG TCGACGAGAC TTCGGTACAG
GCGACGTGGC ATATTCCGCA GGCCCACCCA CTGGAAAGCT GGGGGGATGC GCGTGCCTTT
GACGGGACGG CCAGTATCGT GCAACCACTG ATTGAGCCAC TCTACGGCGG TAAAACGGCC
AACGAGTTGC TGGCAGCAAT GCTCGGTCAA CCCGATGCGG AAAGCTACGA TCTGGTGCGC
GGTTACTGGG AGGAACGGAT CGGCAATACC AATTGGAATG TGGCACTGGC CACCGGCGTG
ATCGCCGATA CATCTGCTCC GGTGATTAAT CCAACTCTCA ACGAAGCAGC GATTCGCGCC
ACTGCGATCC CCCAACCCGG TGACGGTGTT GAAATCGTCT TCCGGCCAGA TCCATCAGTT
TTCGATGGCT TCTATGCAAA TAACGGTTGG CTACAAGAGC TACCACGCCC GCTCACCAAG
CTGGTTTGGG ATAACGCCGC GTTGATGAGT CCACGGACTG CGATCAAGCT CCTTGGTTTA
CCCTTCAGTG CCGACCGACT GGTAGGCAAC GAAGCCGATG ACCGCGAGCG CCAACGCTAC
CTCGAACAAC TCTCGAAAGT CAACGGGACG ATTGCACGGA TCGAGTACCG TGGTGGAGTT
GTAGAACTGC CCATCTGGCT CCTCCCCGGC CACGCCGAAG ACTCGATTAC GCTGAACCTC
GGTTATGGCC GCACCAATGC GGGCCGGGTC GGCAATGGCG TGGGGATTAA TGTCTACCCC
ATCCGCACGA GCGATAGCCC ATGGTTTGGC GCCGGTGCGC GTGTCACCAA CACCGGCAGC
ACTTACTTGC TGGTCAGCAC TCAAGATCAC TGGACGCTCG AAGGACGCGA TATCTATCGC
GTTGGCGAGT TTAAGAAGTT CAAGGAAGAC CCCAAGTACA TCGCCAAAGA GGTATACAAA
GAGGAGTATG GTCGCGAAGC TCCCACGTAT CTCTCGTTAC AACCCGGCGA TAACTACGCC
GGACGCAACG CCTGGGGTAT GACCATCAAC CTCAATGCGT GTATCGGCTG CAATGCCTGC
GTTGTCGCTT GCCAAGCGGA AAACAACATC GCTGTCGTCG GTAAAGATCA AGTCTCACGC
GGTCGCGAAA TGCACTGGAT CCGCATCGAC CGGTACTTCG CGGGTGAAGA TCTCGACAAC
CCGGCCATCT ACATGATGCC TGTCAACTGT ATGCAGTGTG AAAAGGCACC GTGTGAGGTC
GTTTGCCCGG TTGCTGCCAC CGTGCATGAT TACGAGGGTC TGAACAACAT GGTGTATAAT
CGCTGTGTCG GCACGAAGTA TTGCTCGAAC AACTGCCCGT ATAAAGTACG GCGGTTCAAC
TTCTTGCAAT ACAGCGATAC GACAACCGAG ACCTTCAAGC TCGCGTTCAA CCCAGATGTG
ACGGTGCGTA TCCGAGGTGT GATGGAGAAG TGTACCTACT GTGTGCAACG CATTAGCGGC
GCACGCATTG CCGCCAAACG CGCTGCGGTA CAGGCTGGAC AATCGTCGTA TGTCATCAGC
GATGGCGCCA TTCAAACCGC TTGTGAACAG GCATGTCCGA CCGGTGCAAT CGTGTTCGGC
GACATCAACG ATCCGAGCAG CCGTGTCGCA AAGTGGAAGG CGGAAGGTCA CAACTATAGC
CTCCTCGGCT TCCTCAACAC CTTACCGCGC ACGACATATC TGGCCCGTGT CCGCAACCCG
TCTGAAGATC TAGAAAAGGT GGAAGGCTAG
 
Protein sequence
MTQHQSDLEA IRAQLRDARG PQFWRSLDQL ADSPAFRELV EREFPRGASE MSDGMSRRTF 
LKLMGASLAL AGVTACTYQP RQYIAPFDRQ PEGRIPGVPQ YFASTLTLGG YGTGVLVRAN
EGRPTKVEGN PRHPASLGST DLFAQAEILT MYDPDRSTTV LRQGVPSTWA EFTTTLANAL
TAAQATQGAG VRLLTTTVTS PSLAAQIEQF LQAYPQARWY QYEPVNRDNV VEGARLAFGR
DVTTRYDLAA AQVIVSLDAD FLAPGPGFIA YARAFADGRK VRKDSTGMNR LYVIEASPST
TGTAADHRLA LRADAIAAFA GALAHELGIG GAPATLAAKA EEFLKAIAKD LEEHRGRSVV
IAGDQQPPIV HALAHLINAE LGNVGKTVFY HEPVEARPTN QTNELVTLVS EMAAGRVELL
VMIGGNPVYN APGDLRFAER MATVPLTVHL SQFVDETSVQ ATWHIPQAHP LESWGDARAF
DGTASIVQPL IEPLYGGKTA NELLAAMLGQ PDAESYDLVR GYWEERIGNT NWNVALATGV
IADTSAPVIN PTLNEAAIRA TAIPQPGDGV EIVFRPDPSV FDGFYANNGW LQELPRPLTK
LVWDNAALMS PRTAIKLLGL PFSADRLVGN EADDRERQRY LEQLSKVNGT IARIEYRGGV
VELPIWLLPG HAEDSITLNL GYGRTNAGRV GNGVGINVYP IRTSDSPWFG AGARVTNTGS
TYLLVSTQDH WTLEGRDIYR VGEFKKFKED PKYIAKEVYK EEYGREAPTY LSLQPGDNYA
GRNAWGMTIN LNACIGCNAC VVACQAENNI AVVGKDQVSR GREMHWIRID RYFAGEDLDN
PAIYMMPVNC MQCEKAPCEV VCPVAATVHD YEGLNNMVYN RCVGTKYCSN NCPYKVRRFN
FLQYSDTTTE TFKLAFNPDV TVRIRGVMEK CTYCVQRISG ARIAAKRAAV QAGQSSYVIS
DGAIQTACEQ ACPTGAIVFG DINDPSSRVA KWKAEGHNYS LLGFLNTLPR TTYLARVRNP
SEDLEKVEG