Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0437 |
Symbol | |
ID | 3909993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 480510 |
End bp | 481406 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637882323 |
Product | AraC family transcriptional regulator |
Protein accession | YP_484059 |
Protein GI | 86747563 |
COG category | [F] Nucleotide transport and metabolism [L] Replication, recombination and repair |
COG ID | [COG0350] Methylated DNA-protein cysteine methyltransferase [COG2169] Adenosine deaminase |
TIGRFAM ID | [TIGR00589] O-6-methylguanine DNA methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.988834 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATC TCGCCCTCGC CGATCACCGC CTCACCAAAC CCGGCCCGCG CGACACCGCG CTCGCCGATT ACGACGCGGT GCGCCGCGCT ATCGCGTTCA TCTCGCAGAA GTGGAAAACG CAGCCGACCG TCGAGGCGAT CGCCGACGCG GCCGGGCTGA CGCCCGACGA ACTGCACCAT CTGTTCCGGC GCTGGGCCGG GCTGACGCCG AAGGCGTTCA TGCAGGCGCT GACGCTCGAC CATGCCAAAT CGCTGCTGCG CGGCTCCGCC AGCGTGCTCG ACGCGGCATT GGACTCCGGC CTGTCCGGCC CCGGCCGGCT GCACGATCTG TTCGTCACCC ATGAGGCGAT GTCGCCGGGC GAGTGGAAGA GTGGCGGCGC GGGCCTGACG CTGCGCTACG GCTATCACCC CTCGCCGTTC GGCACCGCCG TGGTGATCGC CTCCGAGCGC GGCCTCGCCG GCCTTGCTTT CGCCGATCCG GGCGGCGAGG AGGCGGCTTT CATGGACCTG CAGCAGCGCT GGCCGCGCGC CACCTGCATC GCCGATCAGG CGTTCACCGC GCCCTTCGCG CAGCGCGTGT TCGACCCGGT GCAATGGCGG CCCGAACAGC CGCTGCGGGT GGTGCTGATC GGCACCGATT TCGAGGTCCG CGTCTGGGAA ACGCTGCTGA AGATCCCGAT GGGACGGGCG CTGTGCTACT CCGACATCGC CCACCGGATC GCCTCGCCGA AGGCCTCGCG CGCGGTCGGC GCCGCGATCG GCAAGAACCC GATCTCGTTT GTGGTGCCTT GCCACCGCGC GCTCGGTAAA ACCGGCGCCC TCACCGGCTA TCATTGGGGC CTGACCCGCA AACAGGCGAT GATCGGCTGG GAGGCGGGGC GGCTCGGGGC GGGCTGA
|
Protein sequence | MMNLALADHR LTKPGPRDTA LADYDAVRRA IAFISQKWKT QPTVEAIADA AGLTPDELHH LFRRWAGLTP KAFMQALTLD HAKSLLRGSA SVLDAALDSG LSGPGRLHDL FVTHEAMSPG EWKSGGAGLT LRYGYHPSPF GTAVVIASER GLAGLAFADP GGEEAAFMDL QQRWPRATCI ADQAFTAPFA QRVFDPVQWR PEQPLRVVLI GTDFEVRVWE TLLKIPMGRA LCYSDIAHRI ASPKASRAVG AAIGKNPISF VVPCHRALGK TGALTGYHWG LTRKQAMIGW EAGRLGAG
|
| |