Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3102 |
Symbol | |
ID | 4023607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3446710 |
End bp | 3447702 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637963303 |
Product | helix-turn-helix, AraC type |
Protein accession | YP_570229 |
Protein GI | 91977570 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0110134 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCGGC GGACGATTTC ACCTCTGTTC GTAGAAGAAG TTTCGGATTG CCTGCGCCGC GCCGGCATTT CGCCTGGGCC GGTGCTGGCC TCCGCGGGCC TGCCCCAGAT CGTTCGCGAG CGGGTGTCGG CCGCGCGATT TGGCGCGCTG TGGCTAGCGG TGGCGGCGGC GATGAACGAC GAGTTCTTCG GCCTCGGCGG GCGACCGATG CGACCGGGCT CGTTCACGCT GCTCTGTCAT GCCGTGCTCA ATGCCCCGAC GCTGGAGCAG GCGCTCAACC GGGCACTCCG ATTCCTGAAG GTGGCGCTGG ACGATCCTTG CGGCGTGCTG CAGGTCGAAG GCGACATCGC CCGCATCGTG TTGAAAGACA AGGGCGCGCC GCGCTCGGCG TTTGCCTATC GCACTTTCTG GATCGTGGTG CATGGCCTCG CCTGCTGGCT GGTGGGCCGG CGCCTGCCGC TGCGGCGGGT CGATTTCGCC TGCCGGCCGC CGGAGTTCGC GGCGGACTAT CGGCTGTTTT TCGGGGCGTC GGTCAGGTTC GGCCAGCCAG AGAGCGCGTT GGCGTTCGAC GCCGCCTATC TCAAACTCCG CCCCAACCGC ACAGAACGGG CGCTGAAGGA ATTTTTGCGG CGGGCACCGG CCAACATCCT GGTGCGCTAC CGCCACGACG CCACCCTCAC TGCGGCAATC CGAAATACGC TGCGGGCGCG ACCGCCGACA GCGTGGCCGA ACTTCGAGAC GCTGGCGAAA CAAATGAAGA TACCGGCCTC CAGTTTGCGC CGCCGGCTGC GCGCAGAGGG ACAAACCTAC CAGACCATCA AGGATGAGAT ACGCCGGGTT CTCGCCATTC GATGGCTGGC GGAGAACGAA AAGCCGGTGG GCGACATCGC CGCTGACCTC GGCTTTGCCG AGCCCAGCGC TTTCCACCGT GCCTTCCGAA AGTGGATGGC GAAAAGCCCA GGCGCCTTCC GCCGCGAGGC GCTGGTTGGG TGA
|
Protein sequence | MERRTISPLF VEEVSDCLRR AGISPGPVLA SAGLPQIVRE RVSAARFGAL WLAVAAAMND EFFGLGGRPM RPGSFTLLCH AVLNAPTLEQ ALNRALRFLK VALDDPCGVL QVEGDIARIV LKDKGAPRSA FAYRTFWIVV HGLACWLVGR RLPLRRVDFA CRPPEFAADY RLFFGASVRF GQPESALAFD AAYLKLRPNR TERALKEFLR RAPANILVRY RHDATLTAAI RNTLRARPPT AWPNFETLAK QMKIPASSLR RRLRAEGQTY QTIKDEIRRV LAIRWLAENE KPVGDIAADL GFAEPSAFHR AFRKWMAKSP GAFRREALVG
|
| |