Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3986 |
Symbol | |
ID | 3911793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4553594 |
End bp | 4554955 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637885890 |
Product | Fis family transcriptional regulator |
Protein accession | YP_487590 |
Protein GI | 86751094 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR02040] transcriptional regulator PpsR |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.994596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.222002 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCAA AACCCGCCCA ACCCGACATC ACCCTCCTTC TGGATATGGA TGGCGTGATC CGCGACGCTT CCCTTTCACC GGCTCTGGCC GGGGAAAGTG TTCAGGGTTG GCTCGGCCAA GCCTGGACCG ATGTTGCCGG CGATGATGGT GGCGACAAGG TCCGGCGCAT GGTCGAGGAT GCCCGGACCA GCGGAATCTC TGCATTTCGG CAGGTAAATC AACGATTTCC GTCCGGCGCC GAGCTCCCGA TCGAATTCAC CACCATGCTA CTCGGTGATC GCACCGGACT GCTCGCGGTG GGCAAGAACC TGCAGGCGGT GACCGAACTG CATGCGCGAC TGATTGCGGC GCAACAAACG ATCGAGCGCG ACTACTGGCG GCTGCGTGAA ATGGAAACGC GTTATCGGCT GGTATTCGAC GCTGCAACCG AAGCGGTGAT GATCGTCTCG GCGCACGATC TCCGGATCGT CGAGGCCAAT CGGGCCGCGG TGCAGGCCCT CAGCGGGGTC GACCGTGACA ACGAGGATGT CACCGGCCGC GAGATCTTGA ACGAGGTTGC GCAGCTCGAC CGCGATGCGG TTCGCGAAAT GCTGGCGCGG GTCCGTGATC GCGGCAAAGC GCTGAGCATC CTCGTGCATA TCGGCCGCGA TGCGCGGCCA TGGATGTTGC GTGGCTCGCT GGTGTCGTCG GAACAAGGGC AGGTGTTCCT GCTGCAATTC TCACCGGTCG CCACGACTTC GCGGGACGAC GAACGCGCCG AGCCGACCGT GCTCAAGACG CTGATCGACC GGGTGCCGGA CGGCTTCGTC GCGCTCGACG CCGGCGGCGT CATCCGCCAC GCCAATCAGG CCTTTCTGGA CCTCGTCCAG ATCGGCTCGA AGGGATCCGC GATCGGCGAA TCGCTCGGTC GTTGGCTCAA TCAGCCCGGT GCAGACCTCG CGGCGCTGAT GTCGCATCTG CAGCGCTACA AGACCGTGCG GCTGTTTCAG ACCTCCATTC GCGGCGAACT CGGGAGCGAG ACGGATGTCG AGATCTCGGC GGTCGACGGC GACGACCACA ACTACCTCGG TGTACTGATC AGGAATGTGT CGCGGCGTCT CGGCGGCGGC GAAAGCGATG CACTACGCTC GGCGCTCGGC CCGATCAGCA AGCAGCTCGG CCGCTCTTCG CTGAGAAAGC TGGTCAAGAA TACCGTAGGC ATCGTCGAAC GCCACTATGT CAAGGAAGCG CTGGAACTCA CCAAGGGCAA CCGCACGGCC ACTGCCGAAC TGCTGGGGTT GAGCCGGCAA AGTCTCTATG CCAAGCTCGC GCGCTATGGG CTCGACGACA AGGGTGCCGT CTCCCAAAAC GCCGAGGACT GA
|
Protein sequence | MSAKPAQPDI TLLLDMDGVI RDASLSPALA GESVQGWLGQ AWTDVAGDDG GDKVRRMVED ARTSGISAFR QVNQRFPSGA ELPIEFTTML LGDRTGLLAV GKNLQAVTEL HARLIAAQQT IERDYWRLRE METRYRLVFD AATEAVMIVS AHDLRIVEAN RAAVQALSGV DRDNEDVTGR EILNEVAQLD RDAVREMLAR VRDRGKALSI LVHIGRDARP WMLRGSLVSS EQGQVFLLQF SPVATTSRDD ERAEPTVLKT LIDRVPDGFV ALDAGGVIRH ANQAFLDLVQ IGSKGSAIGE SLGRWLNQPG ADLAALMSHL QRYKTVRLFQ TSIRGELGSE TDVEISAVDG DDHNYLGVLI RNVSRRLGGG ESDALRSALG PISKQLGRSS LRKLVKNTVG IVERHYVKEA LELTKGNRTA TAELLGLSRQ SLYAKLARYG LDDKGAVSQN AED
|
| |