Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1240 |
Symbol | |
ID | 5084413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1280991 |
End bp | 1282739 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640482798 |
Product | transcriptional regulator NifA |
Protein accession | YP_001167446 |
Protein GI | 146277287 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0431383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATTT CCGCGGAAGG TTCCGACGCA GCGGCCGAGC GCGGCGAGGA GTATCTCACC CTCGACGCCC TGTGCGAGAT CGCCAAGCTT CTGACCGGCG CCAGCGACCC GATCGCCCAC ATGCCCGCGG TCTTCGGCGT GCTCGCCTCG TTCATGGGGC TGAAGCACGG GGCGCTTGCG CTGCTGCAGG AAGGGGCGCC GGGCGAATCG TCCCGCAACG CGCGCCACAT CAACCCCTAT GTCGTCGCGG CGACCGCCAG CGGAGGGCCC GTGTCCGCGG ACTGCCGTGC CATCCCGGCG CAGGCGGCGC GGCATGTCTT TCGCAACGGC GTCTCGCTCG TGTCCTGCGA CATCGTCGAT GAGTTCGGGG CCGACGCCCT TCCGCCCGGC CTCGAGGACA AGTATCAGGC GCTGATCGCC GTGCCGATCC GGGACCAGGC CAATTCGCCC TTCGTGCTGG GTGTGCTCTG TGCCTATCGC GGCCTCCGGG ACAATGGCGC GCGGTTTCTC GACACCGATC TTCGGGTGCT GAACATGGTC GCGGCCGTGC TCGAGCAGTC GATCCGCTTC CGCAGGCTGG TGGCGCGCGA TCGCGACCGG ATCGTGCAGG AGGCGCGCGA GGCGATCCGC GCGGCGGCCG AAGCCACATC GGCCGCGCCC GCCGAACTGC CGGCGGCCGA ACTGGATGGG GTGATCGGCT CCTCGGCCGC GATCCAGCGG GTGATCGGCC AGATCCGCAA GGTCGCGGGC ACCCACACGC CCGTCCTGCT GCGGGGCGAG AGCGGCACCG GCAAGGAGGT CTTCGCCCGC GCCCTCCACG CCCTGTCGGA CCGGCGCGAC CGGCCCTTCA TCAAGGTCAA CTGCGCGGCG CTGAGCCAGT CGCTGCTGGA ATCCGAGCTG TTCGGCCACG AGAAGGGCTC GTTCACCGGG GCGGTGCAGC AGAAGAAAGG CCGGTTCGAG CTGGCCGACG GCGGCACGCT CTTTCTCGAC GAGATCGGCG AGATCAGCCT CGAATTCCAG GCCAAGCTCC TGCGCATCCT GCAGGAGGGC GAGTTCGAGC GGGTGGGCGG CTCGCGCACG CTGAAGGTGG ACGTGCGGCT TGTGACCGCC ACCAACAAGG ATCTGGAACG GGCGGTGGCC AACGGCACCT TCCGCGCCGA CCTCTATTTC CGCATCTGCG TCGTGCCGAT CGTCCTGCCG CCGCTGCGCG AGCGCAAGGA GGACATCGGG CCGCTGGCCC AAGGGCTGCT CGAGCGGTTC AACAAGCGCA ACGGCATGAA GAAGCGGCTG CACCCCTCGG CGATCTCGGC GCTGGCCGAG TGCAACTTTC CCGGCAACGT CCGCGAGCTG GAAAACTGCA TCGCCCGCGT GGCCGCCCTC TCGCCCGAGT CGGTGATCCA CGCCGACGAT CTGGCCTGCC ACCACGACCA TTGCCTCTCG GCCGATCTCT GGCGGCTGCA GACGGGCGGC GCCTCGCCGG TGGGCGGGCT CGCACAGGGG CCGCTGGAAT TGCCGGTGCT GGGCAGCCGG CCGCCTACGC CCGCGGCGCC GGAAGGGGCC GCCGCTCCGC CCGCGTCGCC CGCGCGGCCT GCACCGCTCG ACAGCGAGGC GGCCGAGCGC GAGGCGTTGA TCGAGGCGAT GGAGCGCGCG GGATGGGTGC AGGCCAAGGC CGCCCGGCTG CGCGGCATGA CCCCGCGCCA GATCGGCTAC GCGCTCAGGA AGCACAACAT CCGCGTCGAG AAGTTCTGA
|
Protein sequence | MDISAEGSDA AAERGEEYLT LDALCEIAKL LTGASDPIAH MPAVFGVLAS FMGLKHGALA LLQEGAPGES SRNARHINPY VVAATASGGP VSADCRAIPA QAARHVFRNG VSLVSCDIVD EFGADALPPG LEDKYQALIA VPIRDQANSP FVLGVLCAYR GLRDNGARFL DTDLRVLNMV AAVLEQSIRF RRLVARDRDR IVQEAREAIR AAAEATSAAP AELPAAELDG VIGSSAAIQR VIGQIRKVAG THTPVLLRGE SGTGKEVFAR ALHALSDRRD RPFIKVNCAA LSQSLLESEL FGHEKGSFTG AVQQKKGRFE LADGGTLFLD EIGEISLEFQ AKLLRILQEG EFERVGGSRT LKVDVRLVTA TNKDLERAVA NGTFRADLYF RICVVPIVLP PLRERKEDIG PLAQGLLERF NKRNGMKKRL HPSAISALAE CNFPGNVREL ENCIARVAAL SPESVIHADD LACHHDHCLS ADLWRLQTGG ASPVGGLAQG PLELPVLGSR PPTPAAPEGA AAPPASPARP APLDSEAAER EALIEAMERA GWVQAKAARL RGMTPRQIGY ALRKHNIRVE KF
|
| |