Gene RPD_3956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3956 
Symbol 
ID4024472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4404206 
End bp4405396 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content66% 
IMG OID637964158 
Producthypothetical protein 
Protein accessionYP_571076 
Protein GI91978417 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGATCT GCGCCATTTG TGGCCAGGAC ATAGCGCCCG CCGATGACTC CGAGGAGCAC 
ATTCTGCCCG GCGCGATCGG CGGGCGCCGA ACCGTGGGCG GATTCCTGCA CGACGGCTGC
AATCACCGCT CGGGCCATAC CTGGGACGCG GCGCTTGAAA AGCAGCTGCG ACCGCTGGCT
CTACATTTCG GCGTGAAGCG CCAGCGTGGG CGTACCTTAC GCATGGCGGT CACCACGACC
GCTGGGGAGA ATCTCCTGCT AAATGCCGGC GGCCAGCTGG AAATGGCCCG GCCGGAGATC
AAGCGCACGC CGATCCCGGA TGGCGAGACC ATCGCGGTCA AGGCCGGGTC GATCGCCCAG
GCGCGCGACG TGCTGGAAGG GGTGAAGCGT AAATATCCGA AGGTGGACGT GGAGGCGGCC
TTGGCCGGCG CCGAAATCCA ACGGTCCTAC GCGAAGGGTG TCGTGTGCAT TGACGTGAAC
TTCGGGGGCC CATTGTCGGG CCGGTCCCTC GTCAAGAGCG CCTTGGCGCT GGCACACGAG
ACCGGCCTCC CGATAGGCCA ATGCCGCGAC GCGTCGGCCT ACCTGCGCGA AGCCGACGCA
GAACCCTGCT TCGGCTATTA CTACGTCGAT GACCTCGTCG ACGGCCGGCC GCCGGCGATG
CCGCTGCACT GTGTCGCCAT CGATGCCAAC CCTGAGACAG GATTGATCTT GGGCTACGTA
GAATACTTCG GCATCCACCG GGCCGTGGTC TGCCTCGGCC GTGACTATGT CGGCGACCGC
CTGAAGGCGG TCTACGCCCT CGATCCGCGC ACCGGCGAGA CCGTGGAGGT AGCGGTGCGC
CTCGATTTCG ACGTCGCCGA TATGCGGGCG ATCTACGATT ATGGGCGCGA CGATGCCGAA
AAGCGGCAAG AGGCCTTCGG CGCCGTGTTC GGACCAGTCT TGGGCTCGCA TCAGGCCGCT
GAACGTGACC GGGTGGTCCA CGACAGCCTG AACTTCGCCT GGGCCAACTG CGGCGGCGTG
CCGGATCAGC CACTGACCGC TGAGCATCTA GCAAAGCTGA TGGAGCTGTT CGCCGACCGT
GCCACGCCGT GGTGGAAGCA CGTCACGGGC CTGAGCGATG CGGCGGCTCG CCAACTCGCC
TTGGCCTATA TCAGCCAGGT GCTGGCCGTA ACGCAATCGA CGCCGGTTTA G
 
Protein sequence
MPICAICGQD IAPADDSEEH ILPGAIGGRR TVGGFLHDGC NHRSGHTWDA ALEKQLRPLA 
LHFGVKRQRG RTLRMAVTTT AGENLLLNAG GQLEMARPEI KRTPIPDGET IAVKAGSIAQ
ARDVLEGVKR KYPKVDVEAA LAGAEIQRSY AKGVVCIDVN FGGPLSGRSL VKSALALAHE
TGLPIGQCRD ASAYLREADA EPCFGYYYVD DLVDGRPPAM PLHCVAIDAN PETGLILGYV
EYFGIHRAVV CLGRDYVGDR LKAVYALDPR TGETVEVAVR LDFDVADMRA IYDYGRDDAE
KRQEAFGAVF GPVLGSHQAA ERDRVVHDSL NFAWANCGGV PDQPLTAEHL AKLMELFADR
ATPWWKHVTG LSDAAARQLA LAYISQVLAV TQSTPV