Gene RPC_4475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4475 
Symbol 
ID3972487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4976682 
End bp4978424 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content66% 
IMG OID637927586 
Producttranscriptional regulator NifA 
Protein accessionYP_534317 
Protein GI90425947 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.391339 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTATC GCGAAGCGCA CGTCGCCGAT GCCGAGGAAT CGCGCCCGAC CACTCTGATA 
CCTCTGAGCG AAATTGCTCT GACTGGTATC TTTGAGATCT CGAAGATCCT CACCGCGCCG
GCGCGTCTCG AAACCACGCT CGCCAATGTC GTCAATCTGC TGCAGTCGTT CATGCAGATG
CGGCATGGCA CGGTGTCGCT GCTGGCCGAC GATGGGGTGC CTGATATTAC CGTCGGCGCC
GGCTGGAACG AAGGCACCGA CGACCGCTAC CGCGCCCGCC TGCCGGCCAA GGCGATCGAC
CAGATCGTCG CGACCTCGGT GCCGCTGGTG GTCGAGAACG TTTCTTCGCA TCCGATGTTT
TCGCGCGCCG ATGCCGACGC GCTCGGGGCC TCGCCCGAGG TCCGCGTTTC GTTCATCGGC
GTGCCGATCC GGATCGATTC CCGGGTGGTC GGTACGCTGA CCATCGACCG CGTCCGCGAC
GGCCGCTCGA TCTTCCGGCT CGACGCCGAC GTCCGCTTCC TCACCATGAT CGCCAATCTG
ATCGGCCAGA CCGTCAAGCT GCACCGCGTG GTGGCGCGCG ACCGCGAGCG GCTGATGGCC
GAGAGCCACC GGCTGCAGAA GCAATTGTCC GAGCTGAAAC CGCCGCGCGA GCGCAAGAAG
GTCCGCGTCG ACGGCATCAT CGGCGAAAGC CAGGCGATCC GCGGGCTGCT CGCCAAGGTC
GGCATCATCG CCAAATCGCA TTCGCCGGTG CTGCTGCGCG GCGAGTCCGG CACCGGCAAG
GAGCTGATCG CCAAGGCGAT CCACGAATTG TCGTCGCGCG CCAACGGCCC GTTCATCAAG
ATCAACTGCG CGGCGTTGCC CGAATCGGTG CTGGAATCCG AATTGTTCGG CCACGAGAAG
GGCGCCTTCA CCGGCGCCAT CGCGTCGCGC AAGGGACGCT TCGAACTCGC CGACAAGGGC
ACGCTGTTCC TCGACGAGAT CGGCGAAATC TCGCCGTCGT TCCAGGCCAA GCTGTTGCGG
GTGTTGCAAG AGCAGGAGTT CGAGCGGGTC GGCGGCAACC ACACCATCAA GGTCAATGTC
CGCGTGGTGG CGGCCACCAA CCGCAATCTC GAGGAGGCGG TGGCGCGCAA CGAATTCCGC
GCCGACCTGT ACTATCGCAT CAATGTGGTG CCGATGATGC TGCCGCCGCT GCGTGATCGC
GCCAGCGACA TTCCGCTGTT GGCCAGCGAG TTCCTGAAGA ACTTCAACAG GGAAAACGAG
CGCGACCTGG AGTTCGATCC GGCCTCGATG GAGCTGCTGC AGGGGTGTTC GTTCCCCGGC
AACGTGCGCG AGCTGGAAAA CTGCGTGCGC CGCACCGCGA CGCTGGCGCC CGGTCCGGCG
ATTCACCAGG ACGACTTCGC CTGCCATCAT GACGAGTGCC TGTCGTCGAT TCTTTGGAAG
AGCCATTCGG AGCGCACCGC GCAGCGTCCG CCGCCGGAAA TTCCGCTTGC AGTCGCACCG
ATCGGCCGGG CCGACGGCCC CCGCGGCAAC GTTGCAGCAC CGGCGCCCAC CGTGCCGACG
CCGCAGCCGC CCGCCCGCGT CGAAGCGGCC TCCGACGCGC AGATGTCCGA GCGCGAGCGG
CTGGTCGACG CCATGGAACG CTCGGGCTGG GTGCAGGCCA AGGCGGCGCG CATCCTCGGG
CTGACGCCGC GGCAGATCGG CTACGCGCTG AAGAAGTACG ACATCGAGGT CAAGCACTTC
TGA
 
Protein sequence
MVYREAHVAD AEESRPTTLI PLSEIALTGI FEISKILTAP ARLETTLANV VNLLQSFMQM 
RHGTVSLLAD DGVPDITVGA GWNEGTDDRY RARLPAKAID QIVATSVPLV VENVSSHPMF
SRADADALGA SPEVRVSFIG VPIRIDSRVV GTLTIDRVRD GRSIFRLDAD VRFLTMIANL
IGQTVKLHRV VARDRERLMA ESHRLQKQLS ELKPPRERKK VRVDGIIGES QAIRGLLAKV
GIIAKSHSPV LLRGESGTGK ELIAKAIHEL SSRANGPFIK INCAALPESV LESELFGHEK
GAFTGAIASR KGRFELADKG TLFLDEIGEI SPSFQAKLLR VLQEQEFERV GGNHTIKVNV
RVVAATNRNL EEAVARNEFR ADLYYRINVV PMMLPPLRDR ASDIPLLASE FLKNFNRENE
RDLEFDPASM ELLQGCSFPG NVRELENCVR RTATLAPGPA IHQDDFACHH DECLSSILWK
SHSERTAQRP PPEIPLAVAP IGRADGPRGN VAAPAPTVPT PQPPARVEAA SDAQMSERER
LVDAMERSGW VQAKAARILG LTPRQIGYAL KKYDIEVKHF