Gene RPD_4236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4236 
Symbol 
ID4024757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4703609 
End bp4704874 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content67% 
IMG OID637964442 
Producthypothetical protein 
Protein accessionYP_571354 
Protein GI91978695 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.046998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAT TGATCGACGC GCTGCGCAGC GGGGACTGGG TAACTCGCCC GCGGATCCGG 
CTGTGGGCGC TCGCGGTGCT CGCCGCCTCG CTGGGCGGGT TGCTGTATCT GGTCGCGACC
GCGAACGGGC TGAACGACTT CAAGGGCCGG CCGCTCGGCA CCGACTTCTC CGACATCTAT
GCCGGCGGCA CCTATGCGCT CGAGGGGCAG GCGGCGCTGG CGTTCGACCC CGAAACCCAA
CATGCCCGCG AGCAGACGAT CTTCGGCGCC GATACGCCGT TCTACGGCTG GCACTATCCG
CCGTTCCTGA TGTTCGTCGC CGCGCCGCTG GCGATGCTCC CTTATCCGTC GGCGCTTGCG
ATTTGGCAGA TCGCGACGCT GCTGATGTAT CTCGGAATGC TGGCGCTGGT GCTGCGCCTG
GCGTCGCGCG GACAGGGCGT CGACCCTCCA CAGCAGAAAT TGTGGCTGTT GCTCGCACTC
GCTTCGCCGG CTGCCTTCGT CAACATCTCC CATGGTCACA ATGGCTTTCT GACCGCGGCG
TTGATCGGCA CGGCGCTCGC GCTGCTCGAT CGACGGCCGA TGGTCGCGGG CGTCCTGATC
GGGCTGCTAT CCTACAAACC GCAATTCGGC GTGATGATTC CGCTGGTGCT GATCGCGACC
TGGCGCTGGC GCGCCTTCGT CTCTGCTGCG CTGACCGTGC TCGCGCTGGC GCTCGCCACC
ACGTTTGCTT TCGGCTTCGA GGTTTGGCGC GCCTTCTTCG AAGCAATGCC GTTCACCCAG
AAAGTGGTGC TGGAGCAGGG CGGCACCGGT TGGCACAAGA TCCAGTCGGT TTTCGCCTGG
GTGCGGATGT GGGGCGGCGG CGTGCAGCTC GCTTATGCGA TCCAGGGCGC TGTCATGGTG
ACGCTGGCGG CCGCGTTGGT CTGGCTGTGG CGCAGCCGTG CGGCGTATTC CCTGAAGGCC
GCGGCGCTGA TCGTCGCGTC GATTTTGGCG ACGCCCTACA GCCTCGACTA CGACTTCGTC
GCGCTCGCGC CGGCGATCGC CTTCCTGGCC GCCCACGGGC TCGCCCGCGG TTTCGCGCCG
TGGGAGAAGA CCGCGCTGGC GCTGCTGTGG CTGATGCCGC TGGTGGCGCG CGGCCTGGCC
GAGCAGACGC TGATCCCGCT CGGCGTGCCG TCGATGCTCC TGGTGTTCGT GCTGATCATC
AAGCGCGCTG CGCAGGAATC CGGCGCGCGC TCCGCGTCGT CGTCCACGCC GCAGCCGATC
GTCTGA
 
Protein sequence
MSRLIDALRS GDWVTRPRIR LWALAVLAAS LGGLLYLVAT ANGLNDFKGR PLGTDFSDIY 
AGGTYALEGQ AALAFDPETQ HAREQTIFGA DTPFYGWHYP PFLMFVAAPL AMLPYPSALA
IWQIATLLMY LGMLALVLRL ASRGQGVDPP QQKLWLLLAL ASPAAFVNIS HGHNGFLTAA
LIGTALALLD RRPMVAGVLI GLLSYKPQFG VMIPLVLIAT WRWRAFVSAA LTVLALALAT
TFAFGFEVWR AFFEAMPFTQ KVVLEQGGTG WHKIQSVFAW VRMWGGGVQL AYAIQGAVMV
TLAAALVWLW RSRAAYSLKA AALIVASILA TPYSLDYDFV ALAPAIAFLA AHGLARGFAP
WEKTALALLW LMPLVARGLA EQTLIPLGVP SMLLVFVLII KRAAQESGAR SASSSTPQPI
V