Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5113 |
Symbol | |
ID | 6412807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5494304 |
End bp | 5496058 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714998 |
Product | transcriptional regulator, NifA, Fis Family |
Protein accession | YP_001994077 |
Protein GI | 192293472 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAGC GCGAAATTCG CCTTGTCGAT AACGAGTACC CGTCGCCTTC GATGACCCAT CCTCCGATAC CGCTGAGTGA CATCGCGCTC ACCGGCATTT TCGAGATCTC GAAAATCCTC ACCTCACCGG CGCGGTTGGA GATCACCCTC GCCAACGTCG TCAACCTGCT GCAGTCATTT TTGCAGATGC GCAACGGCGT GGTGTCGCTG CTCGCCGATG ACGGCGTGCC CGATATTACC GTCGGGGTCG GCTGGAATGA GGGGAGCGAT AACCGCTATC GCGCCCGGCT GCCGCAGAAG GCGATCGACC AGATCGTCGC GACCGCGGTG CCGCTGGTCG CCGACAACGT CTCTGCCCAT CCGATGTTCA CCGCCGCCGA TGCCATGGCG CTCGGCGCCA CCGACGAAAT CCGGGTGTCG TTCATCGGCG TGCCGATCCG GATCGACTCA CGGGTGGTCG GCACGCTAAG CATCGACCGC GTCCGCGATG GCCGTTCGCA CTTCCGGATG GACGCCGACG TGCGCTTCCT CACCATGGTG GCCAATCTGA TCGGCCAAAC CGTGAAGCTG CACCGCGTCG TCGCGCGCGA CCGCGAGCGG CTGATGGCAG AAAGCCACCG GTTGCAGAAG GAGCTGTCCG AGCTGAAGCC GGAGCGCGAG CGCAAGCGGG TCAAGGTCGA CGGCATCGTC GGCGAGAGCC CGGCGATCCG CAAACTGCTG GCCAAGGTCA GCATCATCGC CAAGTCGCAG TCGCCCGTGT TGCTGCGCGG CGAGTCGGGA ACCGGCAAGG AGCTGATCGC AAAAGCGATC CACGAATTGT CGGCGCGCGC CAACGGCCCG TTCATCAAGA TCAACTGCGC GGCGCTGCCG GAATCGGTGC TGGAGTCCGA GCTGTTCGGG CACGAGAAGG GCGCGTTCAC CGGCGCGATC GCCTCGCGCA AGGGCCGGTT CGAGCTGGCC GACAAGGGCA CGCTGTTCCT CGACGAGATC GGTGAGATCT CCGCGTCGTT CCAGGCCAAG CTGCTGCGCG TCTTGCAGGA GCAGGAATTC GAACGGGTCG GCGGCAACCA GACCATCAAG GTCAATGTCC GGATCGTCGC CGCGACCAAC CGCAATCTGG AAGAGGCAGT GGCGCGCAAG GAATTCCGCG CCGATCTGTA TTACCGCATC AATGTAGTGC CGATGATCCT GCCGCCGCTG CGCGACCGGC CCAGCGACAT CCCGCTGCTG GCGAGCGAAT TCCTGAAGAA CTTCAACAAG GAGAACGGCC GCGAGCTGGC CTTCGAGTCG CACGCGCTGG ATCTGCTGAA GGCCTGCTCG TTCCCCGGCA ACGTCCGCGA GCTGGAGAAC TGCGTGCGCC GCACCGCCAC CCTGGCGATG GGGCCGGAAA TCCGCGACAG CGATTTCGCC TGTCACCAGG ACGAATGCCT GTCGGCGATC CTGTGGAAGG GGCACGCCGA ACCTGCGCCC GAGCGCCCAC GCCCTGAGAT CCCGTTGCAG GTCCTGCCGC GCAAGGCACC GGTGGAAATC GTCCATCCGC GCGAGCCGGT CGCATCCGCG GATGATTTTG CGCCGGCGCC GGTTCGTTCC GAGATGCCAT CCGACGAATC GAACATGTCG GAGCGCGAGC GGCTGATCAA CGCCATGGAG CGAGCCGGGT GGGTGCAGGC GAAGGCCGCA CGCATTCTCG GCCTCACGCC GCGCCAGATC GGCTACGCGC TGAAGAAGCA CAACATCGAG CTCAAGCACT TCTGA
|
Protein sequence | MAQREIRLVD NEYPSPSMTH PPIPLSDIAL TGIFEISKIL TSPARLEITL ANVVNLLQSF LQMRNGVVSL LADDGVPDIT VGVGWNEGSD NRYRARLPQK AIDQIVATAV PLVADNVSAH PMFTAADAMA LGATDEIRVS FIGVPIRIDS RVVGTLSIDR VRDGRSHFRM DADVRFLTMV ANLIGQTVKL HRVVARDRER LMAESHRLQK ELSELKPERE RKRVKVDGIV GESPAIRKLL AKVSIIAKSQ SPVLLRGESG TGKELIAKAI HELSARANGP FIKINCAALP ESVLESELFG HEKGAFTGAI ASRKGRFELA DKGTLFLDEI GEISASFQAK LLRVLQEQEF ERVGGNQTIK VNVRIVAATN RNLEEAVARK EFRADLYYRI NVVPMILPPL RDRPSDIPLL ASEFLKNFNK ENGRELAFES HALDLLKACS FPGNVRELEN CVRRTATLAM GPEIRDSDFA CHQDECLSAI LWKGHAEPAP ERPRPEIPLQ VLPRKAPVEI VHPREPVASA DDFAPAPVRS EMPSDESNMS ERERLINAME RAGWVQAKAA RILGLTPRQI GYALKKHNIE LKHF
|
| |