Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0471 |
Symbol | |
ID | 7266639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 581974 |
End bp | 583611 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643565334 |
Product | nickel-dependent hydrogenase large subunit |
Protein accession | YP_002461848 |
Protein GI | 219847415 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.980581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.166741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAATA GTATCGAGAA TCAGGCAACG ATCACCGGGC GCGATCTCCG CATCAGCCCG CTTGGCCGCG TCGAGGGAGA CCTCGACTTG CGGGTCACAA TCCGCGATGG GGTTGTGACC AGCGCATGGA CTGAGGCCTC GATGTTTCGC GGCTTTGAGA TCATTCTGAA GGGGAAGGAT CCGCAGGCCG GCTTGATCGT TACGCCGCGT ATTTGCGGCA TCTGCGGCGG CAGCCACTTG ACCAAAGCGG TCTATGCGCT CGATACGGCA TGGCAAACCG AATTACCGCC CAATGCGACC CTAATTCGCA ATATTGCGCA AGCTTGCGAG ACGCTGCAAA GCATTCCGCG CTGGTTCTAC GCTCTGTTTG CGATCGACCT AACCAACAAG AAGTACGCCC ACCTGCCCGA ATACGATGAG GCGGTGCGCC GGTTCGCCCC CTTTGTCGGC ACGAGCTATG AGCACGGCGT GACTCTCTCG AATAAGCCGG TCGAGATTTA CGCCATCTTC GGCGGCCAGT GGCCCCACTC CAGCTTCATG ATCCCCGGCG GCGTGATGTG CGCGCCGACG CTGGCTGATG TCACCCGAGC CATCGCCATC CTCGAATACT GGAAGGATGA ATGGCTGGAG AAGAAGTGGC TCGGTTGCTC GGTCGATCGC TGGCTGCAGA ACAAGAGCTG GGCCGATGTG ATGGAGTGGA TGAACGAGAA CGAGCGCCAC TACAACTCTG ACTGCGGCTT CTTCATCCGT TTTGCGATGG CCGCTGGTCT TGACAAGTAT GGCGCCGGTT GGAATAACTA CATTGCCACC GGTACCTACT TCCACCCCGA ACTGTACGCC CGCCCAACCA TCGAGGGGAG AAATGCGGCG CTGATCGCGC GCTCCGGCGT GTATGTCAAC GGCCAGTTCT ACGATTTCGA TCAGGCCAAC GTGCGCGAAG ACGTGACCCA CTCGTTCTAC GAGGGGAATC ACGCGCTGCA CCCGTTTGAG GGACGCACTG AGCCAATTGA TCCGGCGATC GGACATCGAC AAGGGAAGTA CTCGTGGGCG AAAGCGCCAC GCTATCTCAT CCCCGGCGTT GGCAGTCAGC CGGTCGAGGC CGGCCCACTG GCCCGCCAAG TCATCGCCGG TCGGCCCGGC GCCGCCAATT GGCAAGACTA CGACCCGCTC TTCCTCGATG CGGTGACGAC GGTCGGGCCG AGTGTGCTGG TGCGCGTGAT GGCCCGGATG CACGAAGCGC CCAAGTATTA CAAGCTGGTG CGCAAGTGGC TTGACCAGAT CAATCTGCAC GAGAAGTTTT ATATCAAACC GAAGGAGCTG CCCGAAGGGC GTGGGTTTGG TTCGACCGAA GCCGCTCGCG GCAGCCTCTC GGACTGGATC GTGCTTAAGG ATGGTAAGAT CGAGAACTAT CAGGTGGTGA CGCCGACGGC ATGGAACATC GGGCCGCGCG ATGGCCGCGA TGTCAATGGG CCGATGGAGC AAGCCTTCCT CGGCGCGCCG ATTGCCGATC CCAACGATCC GGTCGAACTC GGCCATGTGG CGCGGAGCTA CGACTCGTGC CTCGTCTGTA CAGTGCATGC CTACGACGAA AAGACCGGCA AGGAGTTGGC GCGGTTCCGC ATTGGTGAAG GCGCGTAG
|
Protein sequence | MVNSIENQAT ITGRDLRISP LGRVEGDLDL RVTIRDGVVT SAWTEASMFR GFEIILKGKD PQAGLIVTPR ICGICGGSHL TKAVYALDTA WQTELPPNAT LIRNIAQACE TLQSIPRWFY ALFAIDLTNK KYAHLPEYDE AVRRFAPFVG TSYEHGVTLS NKPVEIYAIF GGQWPHSSFM IPGGVMCAPT LADVTRAIAI LEYWKDEWLE KKWLGCSVDR WLQNKSWADV MEWMNENERH YNSDCGFFIR FAMAAGLDKY GAGWNNYIAT GTYFHPELYA RPTIEGRNAA LIARSGVYVN GQFYDFDQAN VREDVTHSFY EGNHALHPFE GRTEPIDPAI GHRQGKYSWA KAPRYLIPGV GSQPVEAGPL ARQVIAGRPG AANWQDYDPL FLDAVTTVGP SVLVRVMARM HEAPKYYKLV RKWLDQINLH EKFYIKPKEL PEGRGFGSTE AARGSLSDWI VLKDGKIENY QVVTPTAWNI GPRDGRDVNG PMEQAFLGAP IADPNDPVEL GHVARSYDSC LVCTVHAYDE KTGKELARFR IGEGA
|
| |