Gene Rsph17025_1240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1240 
Symbol 
ID5084413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1280991 
End bp1282739 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content70% 
IMG OID640482798 
Producttranscriptional regulator NifA 
Protein accessionYP_001167446 
Protein GI146277287 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0431383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATTT CCGCGGAAGG TTCCGACGCA GCGGCCGAGC GCGGCGAGGA GTATCTCACC 
CTCGACGCCC TGTGCGAGAT CGCCAAGCTT CTGACCGGCG CCAGCGACCC GATCGCCCAC
ATGCCCGCGG TCTTCGGCGT GCTCGCCTCG TTCATGGGGC TGAAGCACGG GGCGCTTGCG
CTGCTGCAGG AAGGGGCGCC GGGCGAATCG TCCCGCAACG CGCGCCACAT CAACCCCTAT
GTCGTCGCGG CGACCGCCAG CGGAGGGCCC GTGTCCGCGG ACTGCCGTGC CATCCCGGCG
CAGGCGGCGC GGCATGTCTT TCGCAACGGC GTCTCGCTCG TGTCCTGCGA CATCGTCGAT
GAGTTCGGGG CCGACGCCCT TCCGCCCGGC CTCGAGGACA AGTATCAGGC GCTGATCGCC
GTGCCGATCC GGGACCAGGC CAATTCGCCC TTCGTGCTGG GTGTGCTCTG TGCCTATCGC
GGCCTCCGGG ACAATGGCGC GCGGTTTCTC GACACCGATC TTCGGGTGCT GAACATGGTC
GCGGCCGTGC TCGAGCAGTC GATCCGCTTC CGCAGGCTGG TGGCGCGCGA TCGCGACCGG
ATCGTGCAGG AGGCGCGCGA GGCGATCCGC GCGGCGGCCG AAGCCACATC GGCCGCGCCC
GCCGAACTGC CGGCGGCCGA ACTGGATGGG GTGATCGGCT CCTCGGCCGC GATCCAGCGG
GTGATCGGCC AGATCCGCAA GGTCGCGGGC ACCCACACGC CCGTCCTGCT GCGGGGCGAG
AGCGGCACCG GCAAGGAGGT CTTCGCCCGC GCCCTCCACG CCCTGTCGGA CCGGCGCGAC
CGGCCCTTCA TCAAGGTCAA CTGCGCGGCG CTGAGCCAGT CGCTGCTGGA ATCCGAGCTG
TTCGGCCACG AGAAGGGCTC GTTCACCGGG GCGGTGCAGC AGAAGAAAGG CCGGTTCGAG
CTGGCCGACG GCGGCACGCT CTTTCTCGAC GAGATCGGCG AGATCAGCCT CGAATTCCAG
GCCAAGCTCC TGCGCATCCT GCAGGAGGGC GAGTTCGAGC GGGTGGGCGG CTCGCGCACG
CTGAAGGTGG ACGTGCGGCT TGTGACCGCC ACCAACAAGG ATCTGGAACG GGCGGTGGCC
AACGGCACCT TCCGCGCCGA CCTCTATTTC CGCATCTGCG TCGTGCCGAT CGTCCTGCCG
CCGCTGCGCG AGCGCAAGGA GGACATCGGG CCGCTGGCCC AAGGGCTGCT CGAGCGGTTC
AACAAGCGCA ACGGCATGAA GAAGCGGCTG CACCCCTCGG CGATCTCGGC GCTGGCCGAG
TGCAACTTTC CCGGCAACGT CCGCGAGCTG GAAAACTGCA TCGCCCGCGT GGCCGCCCTC
TCGCCCGAGT CGGTGATCCA CGCCGACGAT CTGGCCTGCC ACCACGACCA TTGCCTCTCG
GCCGATCTCT GGCGGCTGCA GACGGGCGGC GCCTCGCCGG TGGGCGGGCT CGCACAGGGG
CCGCTGGAAT TGCCGGTGCT GGGCAGCCGG CCGCCTACGC CCGCGGCGCC GGAAGGGGCC
GCCGCTCCGC CCGCGTCGCC CGCGCGGCCT GCACCGCTCG ACAGCGAGGC GGCCGAGCGC
GAGGCGTTGA TCGAGGCGAT GGAGCGCGCG GGATGGGTGC AGGCCAAGGC CGCCCGGCTG
CGCGGCATGA CCCCGCGCCA GATCGGCTAC GCGCTCAGGA AGCACAACAT CCGCGTCGAG
AAGTTCTGA
 
Protein sequence
MDISAEGSDA AAERGEEYLT LDALCEIAKL LTGASDPIAH MPAVFGVLAS FMGLKHGALA 
LLQEGAPGES SRNARHINPY VVAATASGGP VSADCRAIPA QAARHVFRNG VSLVSCDIVD
EFGADALPPG LEDKYQALIA VPIRDQANSP FVLGVLCAYR GLRDNGARFL DTDLRVLNMV
AAVLEQSIRF RRLVARDRDR IVQEAREAIR AAAEATSAAP AELPAAELDG VIGSSAAIQR
VIGQIRKVAG THTPVLLRGE SGTGKEVFAR ALHALSDRRD RPFIKVNCAA LSQSLLESEL
FGHEKGSFTG AVQQKKGRFE LADGGTLFLD EIGEISLEFQ AKLLRILQEG EFERVGGSRT
LKVDVRLVTA TNKDLERAVA NGTFRADLYF RICVVPIVLP PLRERKEDIG PLAQGLLERF
NKRNGMKKRL HPSAISALAE CNFPGNVREL ENCIARVAAL SPESVIHADD LACHHDHCLS
ADLWRLQTGG ASPVGGLAQG PLELPVLGSR PPTPAAPEGA AAPPASPARP APLDSEAAER
EALIEAMERA GWVQAKAARL RGMTPRQIGY ALRKHNIRVE KF