Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0263 |
Symbol | |
ID | 4711145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 301899 |
End bp | 303278 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639854723 |
Product | nitrogenase MoFe cofactor biosynthesis protein NifE |
Protein accession | YP_001001859 |
Protein GI | 121997072 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01283] nitrogenase molybdenum-iron cofactor biosynthesis protein NifE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAACC AGGAGATCGC CGCCCTCCTC GACGAACCGG GCTGCGCCCA CAACCAGGGC AGCAAATCGG GCTGCGCGCG CCCCACGCCG GGCGCCGCCG CTGGTGGATG CGCCTTCGAC GGCGCCCAGA TCACCCTGCT GCCGGTGGCC GACGCCGCGC ACATCGTCCA CGGCCCCATC GGCTGCGCCG GCAGCTCCTG GGACAACCGC GGCACCCGCT CCACCGGCCC CACCCTGTGG CGCATCGGCA TGACCACCGA CCTCAGCGAG CAGGACGTGA TCATGGGCCG GGGCGAGGCG CGGCTGTTCC ACGCCATCAA GCAGGCGGTG GACGATCACG CCCCGGCGGC GGTCTTCGTC TACAACACCT GCGTCCCGGC GTTGACCGGC GACGACATCG AGGCGGTGTG CCGCAGCGCC GAGCGCCGCT GGGGGACCCC GGTGGTTCCG GTGGACTGCG CCGGCTTCTA CGGCAGCAAG AACCTCGGCA ACCGCATCGC CGGCGAGGCG GTGGTCCAGC ACATCGTCGG CACCCGCGAA CCGGAGCCGG TGCCGGCGGA GCGCCGGCCG CGGGATCACC AGGTCCACGA CGTCGGCCTG ATCGGCGAGT TCAACATCGC CGGCGAGTTC TGGAACGTGC TGCCGCTGCT CGACGAGCTC GGCCTGCGCC TGCTCGGCAG CCTCTCCGGC GACGCCCGCT ACCGCGAGCT GCAGACCCTG CACCGCGCCG AGGCCAACAT GCTGGTCTGC TCCAAGGCCC TGCTCAACGT GGCCCGCACC CTGGAAGAGC GCTACGGCAT CCCGTATTTC GAGGGCAGCT TCTACGGCGT CGCCGATACC TCCGCGGCCC TGCGCGGCTT CGCCGGGCTG ATCGACGACC CAGACCTGAG CGCACGCACC GAGCAGGTCA TCGCCCGCGA GGAGGCCCGT GCCGATGCCG CCCTGGAGCC CTACCGCGAG CGCCTGCGCG GCCGCCGGGC CCTGCTCTAC ACCGGCGGCG TGAAGAGCTG GTCGGTGGTC TCGGCGCTGC AGGACCTGGG CATGGAGGTG GTCGCCACCG GCACCCGCAA GTCCACCGAG GCGGACAAGG CGCGCATCCG CGAGCTGATG GGCGAGCAGG CGCAGATGCT GGAAAGCGGT GCGCCGCGGA CGCTGATCGA CACCGTCCGC GCCCACCAGG CCGACGTGCT CATCGCCGGC GGGCGCAACA TGTACACCGC CCTCAAGGCG CGGATCCCGT TCCTGGACAT CAACCAGGAG CGCCCCCACG CCTACGCCGG CTACACCGGC ATGGTGGAGC TGGCCCGCCA ACTGTGCCGC AGCATCGAGA GCCCGATCTG GCCGCAGGTG CGTGAGCCGG CTCCCTGGGA ACGGCAGTAA
|
Protein sequence | MRNQEIAALL DEPGCAHNQG SKSGCARPTP GAAAGGCAFD GAQITLLPVA DAAHIVHGPI GCAGSSWDNR GTRSTGPTLW RIGMTTDLSE QDVIMGRGEA RLFHAIKQAV DDHAPAAVFV YNTCVPALTG DDIEAVCRSA ERRWGTPVVP VDCAGFYGSK NLGNRIAGEA VVQHIVGTRE PEPVPAERRP RDHQVHDVGL IGEFNIAGEF WNVLPLLDEL GLRLLGSLSG DARYRELQTL HRAEANMLVC SKALLNVART LEERYGIPYF EGSFYGVADT SAALRGFAGL IDDPDLSART EQVIAREEAR ADAALEPYRE RLRGRRALLY TGGVKSWSVV SALQDLGMEV VATGTRKSTE ADKARIRELM GEQAQMLESG APRTLIDTVR AHQADVLIAG GRNMYTALKA RIPFLDINQE RPHAYAGYTG MVELARQLCR SIESPIWPQV REPAPWERQ
|
| |