Gene Rpal_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2201 
Symbol 
ID6409861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2384437 
End bp2385444 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content62% 
IMG OID642712085 
Producttranscriptional regulator, AraC family 
Protein accessionYP_001991197 
Protein GI192290592 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGCCC GTGGCGCCGA CGTTCATGCC AACCCGCTCA GTTCTTTTCG CCATGTGAAG 
ACGGGAAGCA TCGAACTTCT GGAGCACGAG CTTGGTCGTT TTTATCCGGG CATCCGGTTT
GAATTGGACA ATCCCGACGG CGCACTCAAT GCGGAGGCCA GTCGATGTGA ATTGAGCGAT
ATTGCTCTGA CCTATGGACG GCACGGCACC GGCATCACCA TCGACGTCCC GTACAACAAC
ACGCATTCGC TGGTGTTCGC CTACGCCGGC AGCGCTGAAG CCCGCACGGG CCGACTCCGC
TCAGACATCG CGGGGCACCG CGCCTTCGTT GCCTCCGCAA CTCGGCCGGT GACGCTCAAA
TACGCGCCGG ATTTCGAGCA GCTCATTCTG AACGTTTCAC AGCGGTCGGT CACAACCAAC
CTCGAGGCGC TGATCGGCGC CCCGCTCAGT CAACCGATCA TCTTCAAGCC CACTTCGAAT
CTTCGGCGCA CGTCGGCGCG CAGGCTGTGG GAGCAGTTGA TGTCTCTCGT CGAGCGGCTG
GGCCGACACG ACGGTGGGTA TCATCAGCAA ATCACGACCG AGCTCGAGCA GGCGATCATC
CTGTCTTTCC TGACCGCCAA TGAGAGCAAC TACACTCCAT TGCTGATGAG CGAGGCCGCC
GCTGCCGGCA GGCGGCCGGT CCATAAGGTG GCGGACTATC TCGAGGCATA CTGGGACCAG
CCGCTGACAG TGGAGATGCT GGCGCGGGTG AGCGGAGTGA GCGTACGGAC TCTCTTCCAC
AGTTTCCGCG GGCAGTTCGG CTATTCCCCG ATGGAGTTCG TCAGGCGCAT TCGTCTTGAA
CGGGCCCGCC AGATGCTGGC CGGCGCCGAT CCGGCGCTTT CGGTGACATC GGTCGCGCTC
TCCTGCGGCT TTGGCAATCT TGGTCACTTC GCGGGCTATT ATAAGAAGGC GTTCGGCGAA
GCGCCGTCGG CCACGCTGTC GCGCGCCAGG AGCCCTGCAT CGTCCTGA
 
Protein sequence
MHARGADVHA NPLSSFRHVK TGSIELLEHE LGRFYPGIRF ELDNPDGALN AEASRCELSD 
IALTYGRHGT GITIDVPYNN THSLVFAYAG SAEARTGRLR SDIAGHRAFV ASATRPVTLK
YAPDFEQLIL NVSQRSVTTN LEALIGAPLS QPIIFKPTSN LRRTSARRLW EQLMSLVERL
GRHDGGYHQQ ITTELEQAII LSFLTANESN YTPLLMSEAA AAGRRPVHKV ADYLEAYWDQ
PLTVEMLARV SGVSVRTLFH SFRGQFGYSP MEFVRRIRLE RARQMLAGAD PALSVTSVAL
SCGFGNLGHF AGYYKKAFGE APSATLSRAR SPASS