Gene Rpal_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0344 
Symbol 
ID6407990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp362809 
End bp363741 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content69% 
IMG OID642710254 
Producttranscriptional regulator, AraC family 
Protein accessionYP_001989380 
Protein GI192288775 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0350] Methylated DNA-protein cysteine methyltransferase
[COG2169] Adenosine deaminase 
TIGRFAM ID[TIGR00589] O-6-methylguanine DNA methyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.251255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGCGC GCGCCGCGGC GGATAACATC AGCGGTATGA TGAACCTTGC CATTGCCGAC 
CACCAGCTCA TCAAACCCGG CGCCCGCGAT TCCGCGTTGG CCGACTACGA CTGCGTGCGG
CGCGCCATCG CCTTCATCTC GCAGAAATGG AAGGCGCAGC CGACCATCGA GGCGATCGCC
GACGCCGCCG GCCTGACGCC CGACGAGCTG CACCATCTGT TCCGGCGCTG GGCGGGGCTG
ACGCCGAAGG CGTTCATGCA GGCGCTGACG CTCGACCACG CCAAATCGCT GCTGCGGGAT
TCCGCCAGCG TGCTCGATGC CGCGCTGGCC TCCGGGCTGT CCGGCCCCGG CCGACTGCAC
GATCTGTTCG TCACCCACGA GGCGATGTCG CCGGGCGAAT GGAAGAGCGG CGGCGCCGGG
CTCAGCCTGC GCTACGGCTT TCATCCGTCG CCGTTCGGCA CCGCGGTGAT CATCGCCTCC
GATCGCGGCC TTGCCGGTCT CGCCTTCGCC GACCCGGACG AGGAGCAGGC GGCGCTGGTC
GATCTGCAAC AGCGCTGGCC GCGCGCGGTG TGCACGCAGG ATCAGGACGC GACTGCTCCG
TTGGCGCGGC GGATCTTCGA TCCGGCACAA TGGCGTGCCG AGCAGCCGCT GCGGGTGGTG
CTGATCGGCA CCGATTTCGA AGTGCGGGTG TGGGAGACGC TGCTGAAGAT CCCGCTCGGC
AAGGCGGTTT GCTACTCGGA CATCGCCGCT AAGATCAGCC TACCGAAAGC CTCGCGCGCG
GTCGGCGCCG CGGTCGGCAA GAACCCGATC TCGTTCGTGG TGCCGTGCCA TCGCGCGCTT
GGCAAAGGCG GCGCACTCAC CGGCTATCAC TGGGGCCTGA CCCGCAAGCA GGCGATGATC
GGCTGGGAAG CCGGGCAACT CAGGGCAGAG TGA
 
Protein sequence
MAARAAADNI SGMMNLAIAD HQLIKPGARD SALADYDCVR RAIAFISQKW KAQPTIEAIA 
DAAGLTPDEL HHLFRRWAGL TPKAFMQALT LDHAKSLLRD SASVLDAALA SGLSGPGRLH
DLFVTHEAMS PGEWKSGGAG LSLRYGFHPS PFGTAVIIAS DRGLAGLAFA DPDEEQAALV
DLQQRWPRAV CTQDQDATAP LARRIFDPAQ WRAEQPLRVV LIGTDFEVRV WETLLKIPLG
KAVCYSDIAA KISLPKASRA VGAAVGKNPI SFVVPCHRAL GKGGALTGYH WGLTRKQAMI
GWEAGQLRAE