Gene Rsph17029_2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2198 
Symbol 
ID4895760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2328638 
End bp2330383 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content71% 
IMG OID640112792 
Producttranscriptional regulator NifA 
Protein accessionYP_001044073 
Protein GI126462959 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.920318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACGT CCGCGGCACG GTCCGGCGCA GTGGCCGAAC GGGGTGAGGA ATATCTGACC 
CTCGACGCGC TCTGCGAGAT CGCCAAGCTT CTGACCGGCG CGAGCGATCC GATCGCCTGC
ATGCCCGCGG TGTTCGGGGT GCTGGGCGCC TTCATGGGTC TGCGGCACGG TGCGCTCGCC
CTCCTGCAGG AGGGGGCGCA GGCCGAGACC CAGCGCAATG CCCGCCACGT CAATCCCTAT
GTGATCGCGG CCACCGCCTC GGGCGTGCCG CCCGCCGGGG CCGAGGCCCG CGCGATCCCC
GCGCAGGTGG CGCGGCATGT CTTCCGCAAC GGCGTGTCGC TCGTCTCCTG CGACATCCTC
GAGGAGTTCG GCGCCGAGGC GCTGCCGCCG GGTCTCGGCG ACAGCCGGCA GGCGCTGGTG
GCGGTGCCGA TCCGCGATCA GGCCAATTCG CCCTTCGTGC TGGGCGTGCT CTGCGCCTAC
CGCAGCCTCA AGGACAATGG CGCGCGCTAC CTCGACACCG ACCTGCGCGT CCTGAACATG
GTGGCGGCGG TGCTCGAACA GTCGATCCGC TTCCGCCGTC TCGTCGCCCG CGACCGCGAC
CGGATCGTGC AGGAGGCGCG CGAGGCGATC CGGGTCGCGG CCGAGGCGAC CGCGGGCCCG
CCCGTCGAGG CGCCCGCCGA AGCGCTCGAG GGGGTGATCG GCTCCTCGCC CGCCATCCAG
CGCGTGATCG GCCAGATCCG CAAGGTGGCG GGTACCCACA CGCCCGTCCT GCTGCGCGGC
GAAAGCGGCA CCGGCAAGGA GGTCTTCGCC CGCGCGCTCC ATGCGCTGTC CGAGCGGCGC
GACAAGGCCT TCATCAAGGT CAACTGCGCG GCGCTGAGCC AGTCGCTGCT CGAATCCGAA
CTGTTCGGAC ATGAGAAGGG CTCCTTCACC GGCGCCGTCC AGCAGAAGAA GGGCCGGTTC
GAGATGGCCG AGGGCGGCAC GCTGTTTCTC GACGAGATCG GCGAGATCAG CCTCGAGTTT
CAGGCCAAGC TCCTGCGCAT CCTGCAGGAG GGCGAGTTCG AGCGGGTGGG GGGCACGCGC
ACGCTGCGCG TCGATGTCCG GCTGGTGACG GCGACGAACA AGGATCTCGA GCGGGCGGTG
GCGAACGGCA CTTTCCGCGC CGACCTCTAT TTCCGCATCT GCGTGGTGCC CATCGTGCTG
CCGCCGCTGC GCGATCGCAA GGAGGACATC GGCCTTCTGG CGCAGGGCCT GCTCGAGCGG
TTCAACAAAC GCAACGGGAT GAAGAAGAAG CTGCATCCCT CGGCCGTGGC CGCGCTCGCC
CAGTGCAACT TCCCCGGCAA CGTGCGCGAG CTCGAGAACT GCATCGCGCG TGTGGCGGCC
CTCTCGCCCG AGACGGTGAT CCACGCCGAC GATCTGGCCT GCCACCACGA CCATTGCCTG
TCGGCCGATC TCTGGCGGCT CCAGACCGGA TCGGCCTCGC CGGTGGGCGG GCTTGCGCAG
GGGCCGCTGG AGCTGCCGGT TCTGGGCAGC CGCCCGCCCG CAGCCGCCCC CAGCGCGCCG
CCGCCCCCGC CGCCGACCGT CCCCTCCGCG CCGCTCGACG GCGAGGCGGC CGAGCGGGAG
GCCTTGATCG AGGCGATGGA GCGGGCCGGC TGGGTGCAGG CCAAGGCCGC GCGCCTGCGC
GGCATGACCC CGCGCCAGAT CGGCTATGCG CTGAAGAAAT ACAACATCCG GGTCGAGAAG
TTCTAG
 
Protein sequence
MDTSAARSGA VAERGEEYLT LDALCEIAKL LTGASDPIAC MPAVFGVLGA FMGLRHGALA 
LLQEGAQAET QRNARHVNPY VIAATASGVP PAGAEARAIP AQVARHVFRN GVSLVSCDIL
EEFGAEALPP GLGDSRQALV AVPIRDQANS PFVLGVLCAY RSLKDNGARY LDTDLRVLNM
VAAVLEQSIR FRRLVARDRD RIVQEAREAI RVAAEATAGP PVEAPAEALE GVIGSSPAIQ
RVIGQIRKVA GTHTPVLLRG ESGTGKEVFA RALHALSERR DKAFIKVNCA ALSQSLLESE
LFGHEKGSFT GAVQQKKGRF EMAEGGTLFL DEIGEISLEF QAKLLRILQE GEFERVGGTR
TLRVDVRLVT ATNKDLERAV ANGTFRADLY FRICVVPIVL PPLRDRKEDI GLLAQGLLER
FNKRNGMKKK LHPSAVAALA QCNFPGNVRE LENCIARVAA LSPETVIHAD DLACHHDHCL
SADLWRLQTG SASPVGGLAQ GPLELPVLGS RPPAAAPSAP PPPPPTVPSA PLDGEAAERE
ALIEAMERAG WVQAKAARLR GMTPRQIGYA LKKYNIRVEK F