Gene RPB_1591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1591 
Symbol 
ID3910062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1794590 
End bp1795879 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content54% 
IMG OID637883487 
Producthypothetical protein 
Protein accessionYP_485212 
Protein GI86748716 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGTC GAACCATCCT TGAGTGGGAG ACGATACGTT ACGGTGATGC GGCTGATGAA 
ATCCCTGCCG ATGCCGCTGA TCGGATTGCA GCGGTGGCGT CGGCATCCCC GCTTGCTGGT
CGCGGTGGCC TCGGTGTCTT AGAACATGGC CGTAAGGGAC TACGCGCGAG AGGGGTCGTC
GGCGTAGTTG CTGCAGAGTC GGGTGCGCTG GAGATCCTGC CTAAGATTGA CTTCCCAGGT
GTAGATCGCG AGGAGGAGGC TGGTCGCATT CGCCGCCGCC TCATTCACAT GCTCGCTGTT
GTGCTGGACC TCAAGATCGA TGCGGGGAAA ATCACTGCGC TCGATTGGCA GCGTGACACT
TTGCTTGAAA TTCTGATTCG CTTGTTTTCG GACAAGTTAG CTGATTGTGT CCGGCAGGGT
ATGCCCCGAA GATACGTCGA GCATGATGAA GATCTATTTG TATTGCGCGG CCGAATCGAC
GTGAAGCGAC AGTTCACGAG TCTTGCCGCA GATGGCTCGC GTCTTGCCTG CCGCTATGAT
GTGCTGACTC CGGACATTGC ATTGAATCAA ATCATGAAGG CGGCTGTCGC CCGGCTCCTT
CGCGTCACGC GAGCGAACTT CAACCAGCGT AGGTTGCGCG AATTGGCTTT CGCATATGCT
GATATAGCCG ATGTGCCGGT CTCGATGCTT CGCTGGGATC AAGTCATGCT TGATCGCACG
AACAGCCGTT GGCGTGAATT GCTCAACTTG GCGCGCCTAT TACTGGGTGA TCGGTTCCAA
GCAACTTCCG CCGGTAGCAG CAACGGGTTT TCTCTCTTGT TCGAGATGAG CGTATTGTTT
GAGGAGTACA TCGCGCGGGT TTTGAAGGCG TCGCTAGTTG ATACGGACCT TCATATCATC
AGCCAAGGAG GGAGGATCTA TTGTCTTCAA ACCGAAGATC GTCGTGGTCT TTTCCAAACC
CGTCCAGACA TCTTGGTAAA GCGCGGCGGA GATGTTGTGA AGATCATTGA TACTAAGTGG
AAGCGGATCT CTGCGCGAAT AGACGATCCC AAGCAGGGTG TGTCGCAAGG GGATATTTAT
CAGATGATGG CGTACGGTCA GCTTTACGGA TGCGACAGGC TCACACTTCT CTATCCGCAC
CACGCATCCA TGAATAGCAG CGAGGGTGTG CACTCAGCGC ATCGAATCGC CGGTTGTGAC
CGTCGGCTAG AGATGGCAAC AATTGATATT GGTCGCAGCG AGTTCTTTCG TGAGCGTTTG
AGGGCGATCA CCGTTCAATC GGAATGTTGA
 
Protein sequence
MIRRTILEWE TIRYGDAADE IPADAADRIA AVASASPLAG RGGLGVLEHG RKGLRARGVV 
GVVAAESGAL EILPKIDFPG VDREEEAGRI RRRLIHMLAV VLDLKIDAGK ITALDWQRDT
LLEILIRLFS DKLADCVRQG MPRRYVEHDE DLFVLRGRID VKRQFTSLAA DGSRLACRYD
VLTPDIALNQ IMKAAVARLL RVTRANFNQR RLRELAFAYA DIADVPVSML RWDQVMLDRT
NSRWRELLNL ARLLLGDRFQ ATSAGSSNGF SLLFEMSVLF EEYIARVLKA SLVDTDLHII
SQGGRIYCLQ TEDRRGLFQT RPDILVKRGG DVVKIIDTKW KRISARIDDP KQGVSQGDIY
QMMAYGQLYG CDRLTLLYPH HASMNSSEGV HSAHRIAGCD RRLEMATIDI GRSEFFRERL
RAITVQSEC