Gene RPB_0957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0957 
Symbol 
ID3909312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1106632 
End bp1108383 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content64% 
IMG OID637882850 
Producttranscriptional regulator NifA 
Protein accessionYP_484578 
Protein GI86748082 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGC GCGAAGTACG TCTTGTCGAG AGCGAGCAAT CGCGGCAGCC GATGAACCAG 
AACCCGATAC CGCTGAGTGA GATTGCGCTC ACCGGCATTT TTGAAATCTC CAAGATCCTC
ACAGCGCCGG CGCGGCTCGA AGTCACGCTC GCGAATGTCG TCAATCTGCT GCAGTCCTTT
CTGCAGATGC GAAATGGCGT CGTGTCGCTG CTGGCCGACG ACAGTGTTCC CGACATCACG
GTCGGCGTGG GCTGGAACGA AGGCAGCGAC AATCGCTATC GCGCGCGACT CCCGCAGAAA
GCCATCGACC AGATCGTCGC CACCTCGGTG CCGCTGGTCG CCGACAACGT GGCCGCGCAT
CCGATGTTCT CCGCCGCCGA CGCGCTCGCG CTGGGCGCGA CCGACGAGAC CCGTGTGTCG
TTCATCGGCG TGCCGATCCG GATCGATTCG CGGGTCGTGG GCACCCTGAC GATCGACCGG
GTTCGCGACG GGCAGTCGAT CTTCCGGATG GACGCCGATG TCCGGTTCCT GACTATGGTC
GCCAATCTGA TCGGGCAGAC CGTGAAGCTG CATCGCGTGG TGGCGCGTGA TCGCGAGCGG
CTGATGGCGG AAAGCCATCG CCTGCAGAAA GAGCTGTACG AGTTGAAGCC GCAGCGCGAG
CGCAAGCGCG TCCGGGTCGA CGGCATCGTC GGCGAGAGCC CGGCGATCCG CACGTTGCTC
GCCAAGGTCA GCATCATCGC CAAATCGCAG TCGCCGGTGC TTTTGCGCGG CGAGTCCGGC
ACCGGCAAGG AACTGATCGC CAAGGCGATC CACGAATTGT CGGCGCGCGC CAACGGGCCC
TTCATCAAGA TCAACTGCGC CGCGCTGCCG GAGTCGGTGT TGGAATCCGA ACTGTTCGGC
CACGAGAAGG GGGCTTTCAC CGGCGCGATC GCTTCACGCA AGGGCCGGTT CGAACTCGCC
GACAAGGGCA CGCTGTTTCT CGACGAGATC GGCGAGATCT CGGCCTCGTT CCAGGCCAAG
CTGCTGCGCG TGCTGCAGGA GCAGGAGTTC GAGCGGGTCG GCGGCAACCA GACCATCAAG
GTCAACGTCC GAATCGTCGC GGCGACCAAC CGCAATCTCG AAGAGGCCGT CGCCCGCAAG
GAGTTCCGCG CCGATCTGTA CTACCGCATC AACGTCGTGC CGATGATCCT GCCGCCGCTG
CGCGATAGGC CGACCGATAT CCCATTGCTG GCGAGCGAGT TCCTGAAGAA CTTCAACAAG
GAGAACGATC GCGAACTGCA ATTCGAGCCG CATGCGCTGG AATTGCTGAA GGCGTGCTCG
TTCCCGGGCA ACGTTCGCGA ACTCGAGAAC TGCGTGCGGC GCACGGCGAC GTTGGCGATC
GGGCCGGAAA TTACCGACAG CGATTTCGCC TGCCATCAGG ACGAATGCCT GTCGGCGATC
TTGTGGAAGG GCCACGCCGA ACCGGCACCG GTACGGCCGC GGCCGCAGAT TCCGCTGCAG
GTGATGCCGC GCAAGGCGCC GCTCGAAGTC GTGGCGCCGC GCGAGGCCGT GAGTGTGTCG
CCCGATCCGG TGTCGACACC CATGTCCGCC GAATCGGCCA ACGGCGGGCC GATGTCGGAG
CGCGAGCGCT TGGTCAACGC GATGGAGCGA TCCGGCTGGG TCCAGGCAAA GGCCGCCCGG
CTGCTCGGCC TGACGCCGCG GCAGATCGGC TACGCGCTGA AGAAGTACGA TATCGAGCTC
AAACACTTCT GA
 
Protein sequence
MAQREVRLVE SEQSRQPMNQ NPIPLSEIAL TGIFEISKIL TAPARLEVTL ANVVNLLQSF 
LQMRNGVVSL LADDSVPDIT VGVGWNEGSD NRYRARLPQK AIDQIVATSV PLVADNVAAH
PMFSAADALA LGATDETRVS FIGVPIRIDS RVVGTLTIDR VRDGQSIFRM DADVRFLTMV
ANLIGQTVKL HRVVARDRER LMAESHRLQK ELYELKPQRE RKRVRVDGIV GESPAIRTLL
AKVSIIAKSQ SPVLLRGESG TGKELIAKAI HELSARANGP FIKINCAALP ESVLESELFG
HEKGAFTGAI ASRKGRFELA DKGTLFLDEI GEISASFQAK LLRVLQEQEF ERVGGNQTIK
VNVRIVAATN RNLEEAVARK EFRADLYYRI NVVPMILPPL RDRPTDIPLL ASEFLKNFNK
ENDRELQFEP HALELLKACS FPGNVRELEN CVRRTATLAI GPEITDSDFA CHQDECLSAI
LWKGHAEPAP VRPRPQIPLQ VMPRKAPLEV VAPREAVSVS PDPVSTPMSA ESANGGPMSE
RERLVNAMER SGWVQAKAAR LLGLTPRQIG YALKKYDIEL KHF