Gene Rpal_5113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5113 
Symbol 
ID6412807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5494304 
End bp5496058 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content65% 
IMG OID642714998 
Producttranscriptional regulator, NifA, Fis Family 
Protein accessionYP_001994077 
Protein GI192293472 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAGC GCGAAATTCG CCTTGTCGAT AACGAGTACC CGTCGCCTTC GATGACCCAT 
CCTCCGATAC CGCTGAGTGA CATCGCGCTC ACCGGCATTT TCGAGATCTC GAAAATCCTC
ACCTCACCGG CGCGGTTGGA GATCACCCTC GCCAACGTCG TCAACCTGCT GCAGTCATTT
TTGCAGATGC GCAACGGCGT GGTGTCGCTG CTCGCCGATG ACGGCGTGCC CGATATTACC
GTCGGGGTCG GCTGGAATGA GGGGAGCGAT AACCGCTATC GCGCCCGGCT GCCGCAGAAG
GCGATCGACC AGATCGTCGC GACCGCGGTG CCGCTGGTCG CCGACAACGT CTCTGCCCAT
CCGATGTTCA CCGCCGCCGA TGCCATGGCG CTCGGCGCCA CCGACGAAAT CCGGGTGTCG
TTCATCGGCG TGCCGATCCG GATCGACTCA CGGGTGGTCG GCACGCTAAG CATCGACCGC
GTCCGCGATG GCCGTTCGCA CTTCCGGATG GACGCCGACG TGCGCTTCCT CACCATGGTG
GCCAATCTGA TCGGCCAAAC CGTGAAGCTG CACCGCGTCG TCGCGCGCGA CCGCGAGCGG
CTGATGGCAG AAAGCCACCG GTTGCAGAAG GAGCTGTCCG AGCTGAAGCC GGAGCGCGAG
CGCAAGCGGG TCAAGGTCGA CGGCATCGTC GGCGAGAGCC CGGCGATCCG CAAACTGCTG
GCCAAGGTCA GCATCATCGC CAAGTCGCAG TCGCCCGTGT TGCTGCGCGG CGAGTCGGGA
ACCGGCAAGG AGCTGATCGC AAAAGCGATC CACGAATTGT CGGCGCGCGC CAACGGCCCG
TTCATCAAGA TCAACTGCGC GGCGCTGCCG GAATCGGTGC TGGAGTCCGA GCTGTTCGGG
CACGAGAAGG GCGCGTTCAC CGGCGCGATC GCCTCGCGCA AGGGCCGGTT CGAGCTGGCC
GACAAGGGCA CGCTGTTCCT CGACGAGATC GGTGAGATCT CCGCGTCGTT CCAGGCCAAG
CTGCTGCGCG TCTTGCAGGA GCAGGAATTC GAACGGGTCG GCGGCAACCA GACCATCAAG
GTCAATGTCC GGATCGTCGC CGCGACCAAC CGCAATCTGG AAGAGGCAGT GGCGCGCAAG
GAATTCCGCG CCGATCTGTA TTACCGCATC AATGTAGTGC CGATGATCCT GCCGCCGCTG
CGCGACCGGC CCAGCGACAT CCCGCTGCTG GCGAGCGAAT TCCTGAAGAA CTTCAACAAG
GAGAACGGCC GCGAGCTGGC CTTCGAGTCG CACGCGCTGG ATCTGCTGAA GGCCTGCTCG
TTCCCCGGCA ACGTCCGCGA GCTGGAGAAC TGCGTGCGCC GCACCGCCAC CCTGGCGATG
GGGCCGGAAA TCCGCGACAG CGATTTCGCC TGTCACCAGG ACGAATGCCT GTCGGCGATC
CTGTGGAAGG GGCACGCCGA ACCTGCGCCC GAGCGCCCAC GCCCTGAGAT CCCGTTGCAG
GTCCTGCCGC GCAAGGCACC GGTGGAAATC GTCCATCCGC GCGAGCCGGT CGCATCCGCG
GATGATTTTG CGCCGGCGCC GGTTCGTTCC GAGATGCCAT CCGACGAATC GAACATGTCG
GAGCGCGAGC GGCTGATCAA CGCCATGGAG CGAGCCGGGT GGGTGCAGGC GAAGGCCGCA
CGCATTCTCG GCCTCACGCC GCGCCAGATC GGCTACGCGC TGAAGAAGCA CAACATCGAG
CTCAAGCACT TCTGA
 
Protein sequence
MAQREIRLVD NEYPSPSMTH PPIPLSDIAL TGIFEISKIL TSPARLEITL ANVVNLLQSF 
LQMRNGVVSL LADDGVPDIT VGVGWNEGSD NRYRARLPQK AIDQIVATAV PLVADNVSAH
PMFTAADAMA LGATDEIRVS FIGVPIRIDS RVVGTLSIDR VRDGRSHFRM DADVRFLTMV
ANLIGQTVKL HRVVARDRER LMAESHRLQK ELSELKPERE RKRVKVDGIV GESPAIRKLL
AKVSIIAKSQ SPVLLRGESG TGKELIAKAI HELSARANGP FIKINCAALP ESVLESELFG
HEKGAFTGAI ASRKGRFELA DKGTLFLDEI GEISASFQAK LLRVLQEQEF ERVGGNQTIK
VNVRIVAATN RNLEEAVARK EFRADLYYRI NVVPMILPPL RDRPSDIPLL ASEFLKNFNK
ENGRELAFES HALDLLKACS FPGNVRELEN CVRRTATLAM GPEIRDSDFA CHQDECLSAI
LWKGHAEPAP ERPRPEIPLQ VLPRKAPVEI VHPREPVASA DDFAPAPVRS EMPSDESNMS
ERERLINAME RAGWVQAKAA RILGLTPRQI GYALKKHNIE LKHF