Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_0547 |
Symbol | nifA |
ID | 3718056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 2283761 |
End bp | 2285506 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640071756 |
Product | NifA subfamily transcriptional regulator |
Protein accession | YP_353620 |
Protein GI | 77464116 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.202829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACACGT CCGCGGCACG GTCCGGCGCA GTGGCCGAAC GGGGTGAGGA ATATCTGACC CTCGACGCGC TCTGCGAGAT CGCCAAGCTT CTGACCGGCG CGAGCGATCC GATCGCCTGC ATGCCTGCAG TGTTCGGGGT GCTGGGCGCC TTCATGGGTC TGCGGCACGG CGCGCTCGCC CTCCTGCAGG AGGGGGCGCA GGCCGAGACC CAGCGCAATG CCCGCCACGT CAATCCCTAT GTGATCGCGG CCACCGCCTC GGGCGTGCCG CCCGCCGGGG CCGAGGCCCG CGCGATCCCC GCGCAGGTGG CGCGGCATGT CTTCCGCAAC GGCGTGTCGC TCGTTTCCTG CGACATCCTC GAGGAGTTCG GCGCCGAGGC GCTGCCGCCG GGCCTCGGCG ACAGCCGGCA GGCGCTGGTG GCGGTGCCGA TCCGCGATCA GGCCAATTCG CCCTTCGTGC TGGGCGTGCT CTGCGCCTAC CGCAGCCTCA AGGACAATGG CGCGCGCTAC CTCGACACCG ACCTGCGCGT CCTGAACATG GTGGCGGCGG TGCTCGAACA GTCGATCCGC TTCCGCCGTC TCGTCGCCCG CGACCGCGAC CGGATCGTGC AGGAGGCGCG CGAGGCGATC CGGGTCGCGG CCGAGGCGAC CGCGGGCCCG CCCGTCGAGG CGCCGGCCGA GGCGCTCGAG GGGGTGATCG GCTCCTCCCC CGCCATCCAG CGCGTGATCG GCCAGATCCG CAAGGTGGCG GGCACCCACA CGCCCGTCCT GCTGCGCGGC GAGAGCGGCA CCGGCAAGGA GGTCTTCGCT CGCGCGCTCC ATGCGCTGTC CGAGCGGCGC GACAAGGCCT TCATCAAGGT CAACTGCGCG GCGCTGAGCC AGTCGCTGCT CGAATCCGAA CTGTTCGGAC ATGAGAAGGG CTCCTTCACC GGCGCCGTCC AGCAGAAGAA GGGCCGGTTC GAGATGGCCG AGGGCGGCAC GCTGTTTCTC GACGAGATCG GCGAAATCAG CCTCGAGTTT CAGGCCAAGC TCCTGCGCAT CCTGCAGGAG GGCGAGTTCG AGCGGGTGGG GGGCACGCGC ACGCTGCGCG TCGATGTCCG GCTGGTGACG GCCACGAACA AGGATCTCGA GCGGGCGGTG GCGAACGGCA CCTTCCGCGC CGACCTCTAT TTCCGCATCT GCGTGGTGCC CATCGTGCTG CCGCCGCTGC GCGACCGCAA GGAGGACATC GGCCTTCTGG CGCAGGGGCT GCTCGAGCGG TTCAACAAGC GCAACGGGAT GAAGAAGAAG CTGCATCCCT CGGCTGTGGC CGCGCTTGCC CAGTGCAACT TCCCCGGCAA CGTGCGCGAG CTCGAGAACT GCATCGCGCG TGTGGCGGCC CTCTCGCCCG AGACGGTGAT CCACGCCGAC GATCTGGCCT GCCACCACGA CCATTGCCTG TCGGCCGATC TCTGGCGGCT CCAGACCGGA TCGGCCTCGC CGGTGGGCGG GCTCGCGCAG GGGCCGCTGG AGCTGCCGGT TCTGGGCAGC CGCCCGCCCG CAGCCGCCCC CAGCGCGCCG CCACCCCCGC CGCCGACCGT CCCCTCCGCG CCGCTCGACG GCGAGGCGGC CGAGCGCGAG GCGCTGATCG AGGCGATGGA GCGAGCCGGC TGGGTGCAGG CCAAGGCCGC GCGCCTGCGC GGCATGACCC CGCGCCAGAT CGGCTATGCG CTGAAGAAAT ACAACATCCG GGTCGAGAAG TTCTAG
|
Protein sequence | MDTSAARSGA VAERGEEYLT LDALCEIAKL LTGASDPIAC MPAVFGVLGA FMGLRHGALA LLQEGAQAET QRNARHVNPY VIAATASGVP PAGAEARAIP AQVARHVFRN GVSLVSCDIL EEFGAEALPP GLGDSRQALV AVPIRDQANS PFVLGVLCAY RSLKDNGARY LDTDLRVLNM VAAVLEQSIR FRRLVARDRD RIVQEAREAI RVAAEATAGP PVEAPAEALE GVIGSSPAIQ RVIGQIRKVA GTHTPVLLRG ESGTGKEVFA RALHALSERR DKAFIKVNCA ALSQSLLESE LFGHEKGSFT GAVQQKKGRF EMAEGGTLFL DEIGEISLEF QAKLLRILQE GEFERVGGTR TLRVDVRLVT ATNKDLERAV ANGTFRADLY FRICVVPIVL PPLRDRKEDI GLLAQGLLER FNKRNGMKKK LHPSAVAALA QCNFPGNVRE LENCIARVAA LSPETVIHAD DLACHHDHCL SADLWRLQTG SASPVGGLAQ GPLELPVLGS RPPAAAPSAP PPPPPTVPSA PLDGEAAERE ALIEAMERAG WVQAKAARLR GMTPRQIGYA LKKYNIRVEK F
|
| |