Gene RPB_4200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4200 
Symbol 
ID3912008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4772482 
End bp4773927 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content67% 
IMG OID637886104 
ProductXRE family transcriptional regulator 
Protein accessionYP_487803 
Protein GI86751307 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.306631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGTG AATCCGGGAA AAAACTGTTC GTCGGCCCCC GGTTCCGGCG AATCCGGCAG 
CAACTCGGCC TGTCGCAGAC CCAGATTGCC GAAGGGCTCG GGATTTCGCC GAGCTATATC
AATCTGATCG AACGGAACCA GCGCCCCGTG ACCGCCCAGA TTCTGCTCAG GCTGGCGGAG
ACCTACGATC TCGATCTGCG TGACCTCGCG ACCGCCGACG AGGACCGTTT TTTCGCCGAA
CTCAACGAGA TCTTCTCCGA CCCGCTGTTC CGCCAGATCG ACCTGCCGAA GCAGGAACTG
CGCGACCTCG CCGAGCTGTG CCCCGGCGTC ACCCACTCGC TGCAGCGATT GTACGCCGCC
TACACCGAAG CGCGGCGCGG CGAGACGCTG GTCGCGGCGC AGATGGCCGA TCGCGACGAG
GGCACCAGGT TCGAGGCCAA CCCGATCGAG CGCGTCCGCG ACCTGATCGA GGCCAACCGC
AACTATTTCC CGGAGCTCGA GCAGGCCGCC GAGGCGGTGC GCGACGAACT GAATGTCGGC
TCGCAGGAGG TCTATGGCGC GCTCGCCGAC CGGCTGCGCG AGCGGCATTC GATCACCACC
CGGATCATGC CGGTCGACGT GATGCGCGAG ACGCTGCGCC GGTTCGACCG CCACCGCCGG
CAATTGCTGA TCTCCGAACT GGTCGACTCG CAGGGCCGCG CCTTCCAGGC CGCGTTCCAG
ACCGGCCTCA CCGAATATGG CAGCGTGATC GACGGCATCG TCAACCGCGC CGGCGCCCTC
GACGAGCCGG CGCGGCGACT CTACCGGATC ACGCTCGGCA ATTACTTCGC CGCCGCGCTG
ATGATGCCCT ACGCCGCTTT CCATGCCGCC GCCGAACAGC TCAGCTACGA TGTCAACGTG
CTGGCGCAGC GCTTCAACGC CGGCTTCGAG CAGGTCTGCC ATCGCCTCAC CACGCTGCAA
CGGCCGACCG CGCGCGGCGT GCCGTTCTTC CTGCTGCGGG TCGACAACGC CGGCAACGTC
TCCAAGCGGT TCTCCTCCGG CACCTTCCCG TTCTCGAAAT TCGGCGGAAC CTGCCCGTTG
TGGAACGTGC ACTCGACCTT CGATACGCCA GACCGGCTGC TGAAACAGGT GATCGAACTG
CCCGACGGCA GCCGCTATTT CTCGATCGCC CAGATGGTGC GCCGGCCGGT GGCGCCGCAC
CCGCAGCCGC AGCCGCGCTT CGCCATCGGG CTCGGCTGCG AAATCCGCCA CGCGTCGAAG
CTGACCTACG CCGCCGGCAT GGACCTGGAG AAAGCCGAAG GCACGCCGAT CGGCGTCAAC
TGCCGCCTCT GCGAACGCGA AAACTGCAGC CAGCGCGCCG AGCCGCCGAT CACCCGGACG
CTGATCCTGG ACGAGAACAC GCGGCGGGCG TCGAGCTTTG CGTTCAGCAA TGCAAGGGAG
TTGTGA
 
Protein sequence
MAGESGKKLF VGPRFRRIRQ QLGLSQTQIA EGLGISPSYI NLIERNQRPV TAQILLRLAE 
TYDLDLRDLA TADEDRFFAE LNEIFSDPLF RQIDLPKQEL RDLAELCPGV THSLQRLYAA
YTEARRGETL VAAQMADRDE GTRFEANPIE RVRDLIEANR NYFPELEQAA EAVRDELNVG
SQEVYGALAD RLRERHSITT RIMPVDVMRE TLRRFDRHRR QLLISELVDS QGRAFQAAFQ
TGLTEYGSVI DGIVNRAGAL DEPARRLYRI TLGNYFAAAL MMPYAAFHAA AEQLSYDVNV
LAQRFNAGFE QVCHRLTTLQ RPTARGVPFF LLRVDNAGNV SKRFSSGTFP FSKFGGTCPL
WNVHSTFDTP DRLLKQVIEL PDGSRYFSIA QMVRRPVAPH PQPQPRFAIG LGCEIRHASK
LTYAAGMDLE KAEGTPIGVN CRLCERENCS QRAEPPITRT LILDENTRRA SSFAFSNARE
L