Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0352 |
Symbol | |
ID | 4711322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 407682 |
End bp | 408860 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 639854815 |
Product | 4Fe-4S ferredoxin iron-sulfur binding domain-containing protein |
Protein accession | YP_001001948 |
Protein GI | 121997161 |
COG category | [C] Energy production and conversion |
COG ID | [COG1143] Formate hydrogenlyase subunit 6/NADH:ubiquinone oxidoreductase 23 kD subunit (chain I) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0028801 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGAGC GGCCCCACGG CGGTGCGGGC GGTTTCCGGG TGCTGCGCCC GGAAATCGGC AGGCTGGCGT TCCGCCCGGA GGCCTGCCTG CCCCGGCGCA CCCCGCTGTC CGCGTGTCAG GCCTGCGCCC GGGCGTGTCC GGTGGAGGCC CTGCACCTGG CGAACGGCCC GGCGCAGCTT ACGGACCACT GCCTGAGCTG CGGGCAGTGT GTCGCGGCCT GCCCCACGGG CGCCCTGCAG GTACCGGGCT TCGGCGTGGC GGTGGCCGAC GACGCGCCGG CGCGGCTCGA TTGCTACCGG GTCGATCGGA CGGACGCCGC GGACGTCCGG GTGCCTTGCC TGGGCGGGCT CGGCGCCGCC GATCTGCTCG AGCTCCATAC CGCCGGCGGC GGCCGGGGGC CCGTGCTGCT GGATCGGGGC TGGTGCGCCG ACTGCCCGCT GGGCGGCCCC GAACACCCGG CCCAGCGGGC CATCGAGCAG GCGGCTGATC TGCTGGCCGG GGGCCGGCCC GCCGAAGCCC TTCAGCCCCG CATCGAGACC GCCCCGCTCC CTTCCGCCCG CGCGCAGGCC CCGCTGCCCG GGGTTGCCGA CGAGGCGCCG GTGAGCCGCC GCGACCTGCT GCGCCGTCTG GCCGGCCAGG CCGGCGCCAC CGCCCGGGTC CTGGACGACG ACGCCACCCC CGCGCACACC GCCCCGCTGC GCCACAAGAT CACGCCCCGA CCGCGTACCC GGATGCTCAA CGCCCTGGCG GCGCTCGGCG TCACGCCACC CCCGGCGCTC ACCCCCCGGG TGACCATCCA CGGTGGCTGC CAGGATCACG GCGTCTGCGC CGCACTCTGC CCCACCGGGG CCCTCCACCG GGGCGAGGAG GCCGACGGCA GCGGGCTGGA TTTCGACCCG GCCGCGTGCA TCGCCTGCCA ACTCTGCCAG CAGGCCTGCC CCGAACAGGC CCTCACCGTC ACCGCGACGG GCGGCGCCCC GGCGCCGCAT CCGCTGACCC GGCATGCGCA GCGCCCCTGC GCCGACTGCC AACGCACCTT CCCGGCCGCG GCAGACGAGG CGCTCTGCCC GGCGTGCCGC AAGTCCCGGG AGTTCGCCCG GGACGCCGCC CAGCGATTCC GCCCGCCTCG CCCGGGGCGC GGCAGCGACG CAACCGATCC ACAGGAGGAG AGCGCATGA
|
Protein sequence | MPERPHGGAG GFRVLRPEIG RLAFRPEACL PRRTPLSACQ ACARACPVEA LHLANGPAQL TDHCLSCGQC VAACPTGALQ VPGFGVAVAD DAPARLDCYR VDRTDAADVR VPCLGGLGAA DLLELHTAGG GRGPVLLDRG WCADCPLGGP EHPAQRAIEQ AADLLAGGRP AEALQPRIET APLPSARAQA PLPGVADEAP VSRRDLLRRL AGQAGATARV LDDDATPAHT APLRHKITPR PRTRMLNALA ALGVTPPPAL TPRVTIHGGC QDHGVCAALC PTGALHRGEE ADGSGLDFDP AACIACQLCQ QACPEQALTV TATGGAPAPH PLTRHAQRPC ADCQRTFPAA ADEALCPACR KSREFARDAA QRFRPPRPGR GSDATDPQEE SA
|
| |