Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0282 |
Symbol | |
ID | 4711184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 318575 |
End bp | 320152 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639854742 |
Product | transcriptional regulator NifA |
Protein accession | YP_001001878 |
Protein GI | 121997091 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.262632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACC CCATGCCCGT CGGCCCCCGC GAGCGCGCCC GGGCCGAACT GGTCGAGGCC CAGCTCGAAG CGCTCTACCA GGTCAGCCGC GTCCTCTCCC GCAGCCTCGA CCTGCGCGAT ACGCTCCAGG CCGTGCTCGA AGTGCTCCAC GAGCACGCCG ACATGCACTC CGGGATGGTC GCCCTGCGCG ACCCGGTGCG CCGCGAGACC ATCGTCCGCG CGCTGCACGG TCAGACCCCC TCCAATGAGG TCATCTACCG TCCCGGCGAG GGGATCGTCG GCGAGATCCT CAACCAGGCC GAGCCGCTGA TCGTCCGGCG CATCTCCGCA GAGCCTCGGT TCCTCGATCG GCTCGGCCTC TACGATCCGG ATCTGCCGTT CATCGGCGTT CCCATCAACG GTCCCGGCGA ACACGACCCG CCGGTGGGCG TGCTCGCCGC GCAGCCCGCC GCCGACGAGG GCGCGTTGCT GGCGGAGCGT TCGCGGTTCA TGCAGATGGT CGCCAATCTG GTCGCCCAGA CCGTGCGCCT GGCCACCGAG GTCGAGCGCG AGCAGGCGGC CATTGTCGAT GAGCGCGATC GCCTGCGCCG GGAGGTGCAA GAGAGCTACG GCTTCGAGAA CGTCATCGGC CGCACCCCGG CCATCCGCCG CGTCTTCGAA CAGGTGCGCC GGGTCGCACA GTGGAGCACC ACGGTGCTGC TGCGCGGCGA GTCCGGCACC GGCAAGGAGC TGCTGGCCAA CGCCATCCAC TACAACTCGC CGCGGGCCGA CGCGCCCTTC GTCAAGCTCA ACTGCGCGGC GCTGCCCGAC AGCCTGCTCG AGAGCGAGCT CTTCGGCCAC GAGAAGGGGG CGTTCACCGG CGCCGTGCAG AGCCGGGTGG GGCGTTTCGA GCAGGCCGAC GGCGGCACCA TCTTCCTCGA CGAGATGGGC GAGATCTCGC CGGCCTTCCA GGCCAAGCTG CTGCGCATCC TCCAGGAGGG GGAGTTCGAG CGGGTGGGCA GCACCCGCAC ACGGCATGTG GATGTGCGCG TGCTCGCCGC CACCAACCGC GATCTGGAGG CGGCGGTCGC TGCCGGCGAG TTCCGCGAGG ACCTCTACTA CCGGCTCAAC GTGATGGCCA TCACCATGCC CGCGCTGCGC GAGCGGCTCG AGGATGTCCC GGAGCTGGCT CGCTTCCTGG TCGGACGCAT CGCCGACGTT CAGCAGCGGG CGCTGGATAT CTCCGACGAC GCCATCCGTA CGCTGATGCG CTACCACTGG CCGGGCAATG TCCGCGAGTT GCAGAACGCC CTGGAGCGGG CGGCGATCAT GAGCGAGGAC GGCCACATTG ATCACGAGGC GCTGCTCCTC GCCGGCATCC CCGAAGACGG TTCGGGGCCC GCCGGCAATG GAGTCTTGCC TGCCGGGCCG GCGGCGCCGG ATCCCAACGA CCCGGCGCTG GATGAGCGCG AGCGGGTGAT CGCCGCCCTT GAACGCACCG GCTGGGTGCA GGCCAAGGCG GCGCGCCTGC TGGGCATGAC GCCGCGGCAG ATCGCGTACC GGATCAAGAC CCTGGATATC CCGGTGAAGA GCCTGTGA
|
Protein sequence | MENPMPVGPR ERARAELVEA QLEALYQVSR VLSRSLDLRD TLQAVLEVLH EHADMHSGMV ALRDPVRRET IVRALHGQTP SNEVIYRPGE GIVGEILNQA EPLIVRRISA EPRFLDRLGL YDPDLPFIGV PINGPGEHDP PVGVLAAQPA ADEGALLAER SRFMQMVANL VAQTVRLATE VEREQAAIVD ERDRLRREVQ ESYGFENVIG RTPAIRRVFE QVRRVAQWST TVLLRGESGT GKELLANAIH YNSPRADAPF VKLNCAALPD SLLESELFGH EKGAFTGAVQ SRVGRFEQAD GGTIFLDEMG EISPAFQAKL LRILQEGEFE RVGSTRTRHV DVRVLAATNR DLEAAVAAGE FREDLYYRLN VMAITMPALR ERLEDVPELA RFLVGRIADV QQRALDISDD AIRTLMRYHW PGNVRELQNA LERAAIMSED GHIDHEALLL AGIPEDGSGP AGNGVLPAGP AAPDPNDPAL DERERVIAAL ERTGWVQAKA ARLLGMTPRQ IAYRIKTLDI PVKSL
|
| |