Gene RPB_3660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3660 
Symbol 
ID3911462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4201767 
End bp4202765 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content64% 
IMG OID637885562 
ProductAraC family transcriptional regulator 
Protein accessionYP_487266 
Protein GI86750770 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.954099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGCGA GGTCTGCTTT GCAGAGAGTT GTGTACAGCG CCCATGATTT GTCGCCCGGG 
CTCGACGACC AGGCGCGGTT CTCGCGGTGG CGCGATATCT ACACGGCCAG TTTCCTGCAG
AGCGGCAACG TGGTCCGATT ATCCGACAGG CCGTTCGCTG CGACCTGGGA GCACGCCCAG
ATCGGCGATA GTCTGGTGGC GCGTTTCGAA GGCACGTTGC AGCGCGTATC ACGCGATGCC
CAGCAGGTTG CGGCGCATCC GGTGGACAGA TTCTGCATCA GCTACAATCG GGCGTCGTCG
CGGCAGGCAA TGGTCCAGCG CGGCAGGGAA CTGACGCTGG AGCCGGGGAC GCCGGCCTTT
TTCAATCTGT CGGAGATGCT CGACTGCCGC TCCGAGCACG GCGAGGCCCG GATCGGATTC
ACCCTGCCGC GCAAGACATT GCTGGATAGC ATTCCGCATG CGGAAGATCT GGTGTTGCGA
CCGCTCGACC CGGGCGACGA CGCGCTGCTG CACCTGCGCT GGTATCTCGA CTTCCTGCTC
GAACGAGACG GTGCCGCGCT CGATCCGGCG ATGGTCGCGC ATGTGCAATC GGTCTTAATC
GATCTGCTCG GTCTCGCGCT CGGCGTCGGT CGCGATCTTG CAGAGGCCTC GAAACTGCGC
GGGCTGCGCG CTGTGCGTTT TATGACCATC GTCGCGGAGA TCGGGGCCGG CTTCGCCGAT
CCAGGATTTT CGGCGGCGCG GCTTGCCGCG AAGCTCAACC TGTCGTCGCG TTATATCCAG
GACATCCTGC ACGAAAGCGG GGTGACCCTG ACCGAGCGGG TGCTCGAGCT GCGGCTGCAG
AAGGCCCGCA GATTGCTGGC ATCCGGCCTG TCGCCCGCCT TGAAAGTCAC CGATATCGCG
CTGAGTTGCG GCTTCAGCGA CGTCTCCCAC TTCAACCACA GCTTCCGCCG CCGGTTCGGC
GCGTCGCCGA CCCAATTCCG GCCGCCGCGC ATCAACTAG
 
Protein sequence
MRARSALQRV VYSAHDLSPG LDDQARFSRW RDIYTASFLQ SGNVVRLSDR PFAATWEHAQ 
IGDSLVARFE GTLQRVSRDA QQVAAHPVDR FCISYNRASS RQAMVQRGRE LTLEPGTPAF
FNLSEMLDCR SEHGEARIGF TLPRKTLLDS IPHAEDLVLR PLDPGDDALL HLRWYLDFLL
ERDGAALDPA MVAHVQSVLI DLLGLALGVG RDLAEASKLR GLRAVRFMTI VAEIGAGFAD
PGFSAARLAA KLNLSSRYIQ DILHESGVTL TERVLELRLQ KARRLLASGL SPALKVTDIA
LSCGFSDVSH FNHSFRRRFG ASPTQFRPPR IN