Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3386 |
Symbol | |
ID | 7267126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 4103104 |
End bp | 4106193 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643568195 |
Product | Fe-S-cluster-containing hydrogenase components 1-like protein |
Protein accession | YP_002464666 |
Protein GI | 219850233 |
COG category | [C] Energy production and conversion |
COG ID | [COG0437] Fe-S-cluster-containing hydrogenase components 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.364426 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAGC ATCAATCTGA TCTCGAGGCC ATTCGCGCTC AGTTGCGCGA CGCGCGCGGA CCACAGTTCT GGCGTTCGCT CGACCAACTG GCCGATTCGC CGGCGTTCCG TGAATTAGTG GAGCGCGAAT TTCCGCGCGG CGCGAGCGAG ATGAGCGATG GAATGAGCCG GCGCACGTTC CTCAAGCTGA TGGGTGCCTC ACTAGCGCTA GCCGGTGTGA CGGCTTGTAC CTATCAGCCA CGCCAGTATA TCGCGCCATT CGATCGCCAA CCCGAAGGGC GTATTCCCGG AGTACCACAA TACTTCGCGT CGACACTGAC GCTCGGTGGC TACGGTACCG GCGTACTGGT ACGCGCGAAT GAGGGCCGCC CGACGAAGGT TGAGGGGAAC CCGCGCCACC CGGCCAGTCT CGGCAGCACC GATCTGTTTG CTCAGGCCGA GATTCTGACG ATGTACGACC CTGATCGCTC AACGACGGTG CTGCGCCAAG GGGTACCAAG TACGTGGGCC GAATTTACCA CGACCCTCGC GAATGCATTG ACGGCAGCGC AGGCAACACA AGGCGCTGGC GTGCGTCTCT TGACCACGAC GGTGACGTCG CCGTCGCTGG CTGCCCAAAT TGAGCAGTTC TTGCAGGCTT ATCCACAGGC ACGCTGGTAT CAGTACGAGC CGGTTAATCG CGATAATGTC GTGGAAGGCG CACGCCTTGC GTTTGGCCGT GATGTTACCA CGCGCTACGA TTTGGCAGCC GCCCAGGTAA TTGTCAGCCT CGACGCCGAC TTCCTCGCGC CCGGCCCCGG CTTCATCGCA TATGCCCGCG CCTTTGCCGA TGGCCGTAAG GTGCGGAAAG ATAGCACCGG TATGAACCGG CTGTACGTGA TCGAGGCAAG CCCCTCGACG ACCGGCACGG CAGCCGATCA CCGACTGGCG CTACGGGCCG ATGCGATCGC CGCGTTTGCC GGCGCTCTGG CCCACGAACT CGGTATTGGT GGAGCACCGG CAACCCTTGC TGCGAAAGCT GAAGAGTTCT TGAAGGCGAT AGCCAAAGAC CTTGAAGAGC ACCGTGGGCG CTCGGTAGTC ATTGCCGGCG ACCAGCAGCC ACCTATTGTG CATGCCCTGG CCCACCTGAT CAACGCTGAG CTGGGCAACG TCGGCAAGAC GGTCTTCTAT CACGAACCGG TGGAAGCCCG TCCGACTAAT CAAACAAACG AGCTGGTAAC GCTGGTCAGC GAGATGGCTG CCGGTCGAGT GGAGCTGCTC GTTATGATCG GCGGCAACCC GGTCTACAAC GCTCCTGGCG ACCTGCGTTT TGCCGAACGG ATGGCCACGG TCCCGCTGAC CGTTCACTTA AGCCAGTTCG TCGACGAGAC TTCGGTACAG GCGACGTGGC ATATTCCGCA GGCCCACCCA CTGGAAAGCT GGGGGGATGC GCGTGCCTTT GACGGGACGG CCAGTATCGT GCAACCACTG ATTGAGCCAC TCTACGGCGG TAAAACGGCC AACGAGTTGC TGGCAGCAAT GCTCGGTCAA CCCGATGCGG AAAGCTACGA TCTGGTGCGC GGTTACTGGG AGGAACGGAT CGGCAATACC AATTGGAATG TGGCACTGGC CACCGGCGTG ATCGCCGATA CATCTGCTCC GGTGATTAAT CCAACTCTCA ACGAAGCAGC GATTCGCGCC ACTGCGATCC CCCAACCCGG TGACGGTGTT GAAATCGTCT TCCGGCCAGA TCCATCAGTT TTCGATGGCT TCTATGCAAA TAACGGTTGG CTACAAGAGC TACCACGCCC GCTCACCAAG CTGGTTTGGG ATAACGCCGC GTTGATGAGT CCACGGACTG CGATCAAGCT CCTTGGTTTA CCCTTCAGTG CCGACCGACT GGTAGGCAAC GAAGCCGATG ACCGCGAGCG CCAACGCTAC CTCGAACAAC TCTCGAAAGT CAACGGGACG ATTGCACGGA TCGAGTACCG TGGTGGAGTT GTAGAACTGC CCATCTGGCT CCTCCCCGGC CACGCCGAAG ACTCGATTAC GCTGAACCTC GGTTATGGCC GCACCAATGC GGGCCGGGTC GGCAATGGCG TGGGGATTAA TGTCTACCCC ATCCGCACGA GCGATAGCCC ATGGTTTGGC GCCGGTGCGC GTGTCACCAA CACCGGCAGC ACTTACTTGC TGGTCAGCAC TCAAGATCAC TGGACGCTCG AAGGACGCGA TATCTATCGC GTTGGCGAGT TTAAGAAGTT CAAGGAAGAC CCCAAGTACA TCGCCAAAGA GGTATACAAA GAGGAGTATG GTCGCGAAGC TCCCACGTAT CTCTCGTTAC AACCCGGCGA TAACTACGCC GGACGCAACG CCTGGGGTAT GACCATCAAC CTCAATGCGT GTATCGGCTG CAATGCCTGC GTTGTCGCTT GCCAAGCGGA AAACAACATC GCTGTCGTCG GTAAAGATCA AGTCTCACGC GGTCGCGAAA TGCACTGGAT CCGCATCGAC CGGTACTTCG CGGGTGAAGA TCTCGACAAC CCGGCCATCT ACATGATGCC TGTCAACTGT ATGCAGTGTG AAAAGGCACC GTGTGAGGTC GTTTGCCCGG TTGCTGCCAC CGTGCATGAT TACGAGGGTC TGAACAACAT GGTGTATAAT CGCTGTGTCG GCACGAAGTA TTGCTCGAAC AACTGCCCGT ATAAAGTACG GCGGTTCAAC TTCTTGCAAT ACAGCGATAC GACAACCGAG ACCTTCAAGC TCGCGTTCAA CCCAGATGTG ACGGTGCGTA TCCGAGGTGT GATGGAGAAG TGTACCTACT GTGTGCAACG CATTAGCGGC GCACGCATTG CCGCCAAACG CGCTGCGGTA CAGGCTGGAC AATCGTCGTA TGTCATCAGC GATGGCGCCA TTCAAACCGC TTGTGAACAG GCATGTCCGA CCGGTGCAAT CGTGTTCGGC GACATCAACG ATCCGAGCAG CCGTGTCGCA AAGTGGAAGG CGGAAGGTCA CAACTATAGC CTCCTCGGCT TCCTCAACAC CTTACCGCGC ACGACATATC TGGCCCGTGT CCGCAACCCG TCTGAAGATC TAGAAAAGGT GGAAGGCTAG
|
Protein sequence | MTQHQSDLEA IRAQLRDARG PQFWRSLDQL ADSPAFRELV EREFPRGASE MSDGMSRRTF LKLMGASLAL AGVTACTYQP RQYIAPFDRQ PEGRIPGVPQ YFASTLTLGG YGTGVLVRAN EGRPTKVEGN PRHPASLGST DLFAQAEILT MYDPDRSTTV LRQGVPSTWA EFTTTLANAL TAAQATQGAG VRLLTTTVTS PSLAAQIEQF LQAYPQARWY QYEPVNRDNV VEGARLAFGR DVTTRYDLAA AQVIVSLDAD FLAPGPGFIA YARAFADGRK VRKDSTGMNR LYVIEASPST TGTAADHRLA LRADAIAAFA GALAHELGIG GAPATLAAKA EEFLKAIAKD LEEHRGRSVV IAGDQQPPIV HALAHLINAE LGNVGKTVFY HEPVEARPTN QTNELVTLVS EMAAGRVELL VMIGGNPVYN APGDLRFAER MATVPLTVHL SQFVDETSVQ ATWHIPQAHP LESWGDARAF DGTASIVQPL IEPLYGGKTA NELLAAMLGQ PDAESYDLVR GYWEERIGNT NWNVALATGV IADTSAPVIN PTLNEAAIRA TAIPQPGDGV EIVFRPDPSV FDGFYANNGW LQELPRPLTK LVWDNAALMS PRTAIKLLGL PFSADRLVGN EADDRERQRY LEQLSKVNGT IARIEYRGGV VELPIWLLPG HAEDSITLNL GYGRTNAGRV GNGVGINVYP IRTSDSPWFG AGARVTNTGS TYLLVSTQDH WTLEGRDIYR VGEFKKFKED PKYIAKEVYK EEYGREAPTY LSLQPGDNYA GRNAWGMTIN LNACIGCNAC VVACQAENNI AVVGKDQVSR GREMHWIRID RYFAGEDLDN PAIYMMPVNC MQCEKAPCEV VCPVAATVHD YEGLNNMVYN RCVGTKYCSN NCPYKVRRFN FLQYSDTTTE TFKLAFNPDV TVRIRGVMEK CTYCVQRISG ARIAAKRAAV QAGQSSYVIS DGAIQTACEQ ACPTGAIVFG DINDPSSRVA KWKAEGHNYS LLGFLNTLPR TTYLARVRNP SEDLEKVEG
|
| |