Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2198 |
Symbol | |
ID | 4895760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2328638 |
End bp | 2330383 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640112792 |
Product | transcriptional regulator NifA |
Protein accession | YP_001044073 |
Protein GI | 126462959 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.920318 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACGT CCGCGGCACG GTCCGGCGCA GTGGCCGAAC GGGGTGAGGA ATATCTGACC CTCGACGCGC TCTGCGAGAT CGCCAAGCTT CTGACCGGCG CGAGCGATCC GATCGCCTGC ATGCCCGCGG TGTTCGGGGT GCTGGGCGCC TTCATGGGTC TGCGGCACGG TGCGCTCGCC CTCCTGCAGG AGGGGGCGCA GGCCGAGACC CAGCGCAATG CCCGCCACGT CAATCCCTAT GTGATCGCGG CCACCGCCTC GGGCGTGCCG CCCGCCGGGG CCGAGGCCCG CGCGATCCCC GCGCAGGTGG CGCGGCATGT CTTCCGCAAC GGCGTGTCGC TCGTCTCCTG CGACATCCTC GAGGAGTTCG GCGCCGAGGC GCTGCCGCCG GGTCTCGGCG ACAGCCGGCA GGCGCTGGTG GCGGTGCCGA TCCGCGATCA GGCCAATTCG CCCTTCGTGC TGGGCGTGCT CTGCGCCTAC CGCAGCCTCA AGGACAATGG CGCGCGCTAC CTCGACACCG ACCTGCGCGT CCTGAACATG GTGGCGGCGG TGCTCGAACA GTCGATCCGC TTCCGCCGTC TCGTCGCCCG CGACCGCGAC CGGATCGTGC AGGAGGCGCG CGAGGCGATC CGGGTCGCGG CCGAGGCGAC CGCGGGCCCG CCCGTCGAGG CGCCCGCCGA AGCGCTCGAG GGGGTGATCG GCTCCTCGCC CGCCATCCAG CGCGTGATCG GCCAGATCCG CAAGGTGGCG GGTACCCACA CGCCCGTCCT GCTGCGCGGC GAAAGCGGCA CCGGCAAGGA GGTCTTCGCC CGCGCGCTCC ATGCGCTGTC CGAGCGGCGC GACAAGGCCT TCATCAAGGT CAACTGCGCG GCGCTGAGCC AGTCGCTGCT CGAATCCGAA CTGTTCGGAC ATGAGAAGGG CTCCTTCACC GGCGCCGTCC AGCAGAAGAA GGGCCGGTTC GAGATGGCCG AGGGCGGCAC GCTGTTTCTC GACGAGATCG GCGAGATCAG CCTCGAGTTT CAGGCCAAGC TCCTGCGCAT CCTGCAGGAG GGCGAGTTCG AGCGGGTGGG GGGCACGCGC ACGCTGCGCG TCGATGTCCG GCTGGTGACG GCGACGAACA AGGATCTCGA GCGGGCGGTG GCGAACGGCA CTTTCCGCGC CGACCTCTAT TTCCGCATCT GCGTGGTGCC CATCGTGCTG CCGCCGCTGC GCGATCGCAA GGAGGACATC GGCCTTCTGG CGCAGGGCCT GCTCGAGCGG TTCAACAAAC GCAACGGGAT GAAGAAGAAG CTGCATCCCT CGGCCGTGGC CGCGCTCGCC CAGTGCAACT TCCCCGGCAA CGTGCGCGAG CTCGAGAACT GCATCGCGCG TGTGGCGGCC CTCTCGCCCG AGACGGTGAT CCACGCCGAC GATCTGGCCT GCCACCACGA CCATTGCCTG TCGGCCGATC TCTGGCGGCT CCAGACCGGA TCGGCCTCGC CGGTGGGCGG GCTTGCGCAG GGGCCGCTGG AGCTGCCGGT TCTGGGCAGC CGCCCGCCCG CAGCCGCCCC CAGCGCGCCG CCGCCCCCGC CGCCGACCGT CCCCTCCGCG CCGCTCGACG GCGAGGCGGC CGAGCGGGAG GCCTTGATCG AGGCGATGGA GCGGGCCGGC TGGGTGCAGG CCAAGGCCGC GCGCCTGCGC GGCATGACCC CGCGCCAGAT CGGCTATGCG CTGAAGAAAT ACAACATCCG GGTCGAGAAG TTCTAG
|
Protein sequence | MDTSAARSGA VAERGEEYLT LDALCEIAKL LTGASDPIAC MPAVFGVLGA FMGLRHGALA LLQEGAQAET QRNARHVNPY VIAATASGVP PAGAEARAIP AQVARHVFRN GVSLVSCDIL EEFGAEALPP GLGDSRQALV AVPIRDQANS PFVLGVLCAY RSLKDNGARY LDTDLRVLNM VAAVLEQSIR FRRLVARDRD RIVQEAREAI RVAAEATAGP PVEAPAEALE GVIGSSPAIQ RVIGQIRKVA GTHTPVLLRG ESGTGKEVFA RALHALSERR DKAFIKVNCA ALSQSLLESE LFGHEKGSFT GAVQQKKGRF EMAEGGTLFL DEIGEISLEF QAKLLRILQE GEFERVGGTR TLRVDVRLVT ATNKDLERAV ANGTFRADLY FRICVVPIVL PPLRDRKEDI GLLAQGLLER FNKRNGMKKK LHPSAVAALA QCNFPGNVRE LENCIARVAA LSPETVIHAD DLACHHDHCL SADLWRLQTG SASPVGGLAQ GPLELPVLGS RPPAAAPSAP PPPPPTVPSA PLDGEAAERE ALIEAMERAG WVQAKAARLR GMTPRQIGYA LKKYNIRVEK F
|
| |