Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0273 |
Symbol | |
ID | 4711169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 310188 |
End bp | 311654 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639854733 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_001001869 |
Protein GI | 121997082 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.280105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACT TGACGCAGGA AGAGGCCCAG GCGGTGATCG AGGAGGTCCT CGAGGTCTAC CCCGAGGCCA CCCGCGCGGA ACGGGACAAG CACCTGGCCG CCGTCGGCCC GGATCAGCAG CCTAGCAAGT GCGTGACCTC CAACCGCAAG TCGGTCCCCG GGGTGATGAC CCAGCGCGGC TGCGCCTACG CCGGCTCCAA GGGTGTGGTC TGGGGGCCGA TCAAGGACAT GGTGCACGTC TCCCACGGCC CGGTGGGTTG TGGTCAGCTC TCCCGGGCCG GCCGGCGCAA CTACTACACC GGCCATACCG GGGTGGACAC CTTCGGCACC ATCAACTTCA CCTCAGACTT CCAGGAGCGG GACATCGTCT TTGGTGGCGA CCCGAAGCTG GAGAAGATCG TCGACGAGAT CGAGATGCTC TTCCCGCTGA ACAAGGGCAT CACCGTGCAG TCGGAGTGCC CCATCGGGCT GATCGGCGAC GACATCGACT CCGTGGGTCG GCAGGCCACC GAGCGCCTCG GTAAGCCGGT GATCCCGGTG CGCTGCGAGG GGTTCCGCGG CGTGTCGCAG TCCCTCGGCC ACCACATCGC CAACGACACC ATCCGCGACC ACGTCCTGGA GAACCGCGAG GGCGACGGCC GCCAGGCCGG GCCCTACGAC GTGGCCATCG TCGGCGACTA CAACATCGGT GGCGACTGCT GGGCCTCGCG CATCCTGCTC GAGGAGATGG GCCTGAACGT GGTGGCGCAG TGGTCCGGCG ACGGCACCCT GGCCGAGATG GAGAACACGC CCAAGGTCCA GCTCAATCTC CTGCACTGCT ACCGCTCCAT GAACTACATC TGCGAGCACA TGGAGAAGAC CCACGGCATC CCGTGGATGG AGTTCAACTT CTTCGGGCCG ACGCGGATCG CCCAGAGCCT ACGCGAGATC GCGGCGCAGT TCGACGAGAC CATCCAGGCC AACGCCGAGC AGGTCATCGC CAAGTACCAG CCGACCATGG AGGCGGTGAT CGCCAAGTAT CGGCCGCGGC TCGAGGGCAA GAAGGTGATG CTCTACGTCG GCGGGCTGCG CTCGCGCCAC GTCATCGGGG CCTACGAGGA CCTGGGCATG GAGGTCATCG GCACCGGCTA CGAGTTCGCC CACGACGACG ACTACGACCG CACCTACCCG GAGCTCAAGG AGGGGACGCT GGTCTACGAC GACGCCAACG CCTTCGAGCT GGAGCGTTTC ATCGAGCGGG CCCGACCGGA CCTGGTGGCC GCCGGGATCA AGGAGAAGTA CGTCTTCCAG AAGATGGGTC TGCCGTTCCG GCAGATGCAC TCCTGGGACT ACTCCGGGCC GTACCACGGC TACGACGGCT TCGCGATCTT CGCCCGGGAT ATGGACATGA CCCTGAACAA CCCGGTCTGG GATCGCATGA CTCCGCCGTG GAAAGCCACC GGTGAGCCGG CCTCCAAGGC GGCCTGA
|
Protein sequence | MSNLTQEEAQ AVIEEVLEVY PEATRAERDK HLAAVGPDQQ PSKCVTSNRK SVPGVMTQRG CAYAGSKGVV WGPIKDMVHV SHGPVGCGQL SRAGRRNYYT GHTGVDTFGT INFTSDFQER DIVFGGDPKL EKIVDEIEML FPLNKGITVQ SECPIGLIGD DIDSVGRQAT ERLGKPVIPV RCEGFRGVSQ SLGHHIANDT IRDHVLENRE GDGRQAGPYD VAIVGDYNIG GDCWASRILL EEMGLNVVAQ WSGDGTLAEM ENTPKVQLNL LHCYRSMNYI CEHMEKTHGI PWMEFNFFGP TRIAQSLREI AAQFDETIQA NAEQVIAKYQ PTMEAVIAKY RPRLEGKKVM LYVGGLRSRH VIGAYEDLGM EVIGTGYEFA HDDDYDRTYP ELKEGTLVYD DANAFELERF IERARPDLVA AGIKEKYVFQ KMGLPFRQMH SWDYSGPYHG YDGFAIFARD MDMTLNNPVW DRMTPPWKAT GEPASKAA
|
| |