Gene RPD_3765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3765 
Symbol 
ID4024281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4201498 
End bp4202547 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content66% 
IMG OID637963969 
Productsqualene/phytoene synthase 
Protein accessionYP_570887 
Protein GI91978228 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.142197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00585073 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGTAC ATTCCGATAT GCTGGCCTGC CGCGTGATGA TCAAGGAAGG TTCGCGCACG 
TTTCACGCCG CGTCGAAGGT GCTGCCGCGC CGGATCAGCG ATCCGGCGAT TGCGCTGTAC
GCGTTTTGTC GCGTCGCCGA CGACGCCGTC GATCTCGGCC TCGATCGCAG CAATGCGGTC
GAAGTGCTGA AGGACCGACT CGACCGCGCC TGTCGCGGTC TGCCCCGGCC ATATCCGTCG
GATCGCGCCT TCGCCGACGT GATCGCGCGC TTCTCGATTC CGCCGGCGAT TCCCGAGGCG
CTGATCGAAG GTCTCGAATG GGACTCGCAG GGCCGCCGCT TCGAGACGTT GTCTGATCTC
TACGGCTATG CCGCGCGCGT CGCCGGCACC GTCGGCGTGA TGATGACGCT GGTGATGGGG
CAGCGCAGAC CGGACATTGT CGCGCGTGCC TGCGATCTCG GCTGCGCGAT GCAACTCACC
AACATCGCCC GCGACATCGG CGAGGACGCG CGCAACGGCC GCATTTACAT GCCGCTGTCG
TGGATGCGCG AAGCCGGGCT CGATCCGGAA AAGTGGCTCG CCGACCCGAA ATTCACGCCG
GAGATCGCCG GCATCGTCAA GCGGCTGATC GACACCGCGG ATGCGCTGTA CGATCGCGCC
ACGCTCGGCA TCGCGAACCT GCCGCGCTCG TGCCGTCCCG GTATCTTCGC CGCGCGTGCG
CTATATGCCG AGATCGGCCG CGAGGTCGAA CGCTCCGCGC TCGATTCGGT GTCGGCTCGT
GCGGTGGTCT CGACCGGGCG CAAGCTCGCT GTGCTGTCGC GGATGCTGGC GTTCCAGGAA
ACCCAATGGG CGCCCGCGAA GAATCTGCCG GCCAAGTTGG GCGACATGGA AGAAACCCGC
TTCCTGATCG ATGCGGTGAT CGCCCATCCG GTCCGCGACT TGCAGCCGAT GCCGCAGGTC
AAGCCGATCG AGCAGAAGGT CGCCTGGCTG GTCGACCTGT TCACGCGGCT CGAACGCCGC
GACCAGATGC TGCAACGCAG CCGGGTGTAG
 
Protein sequence
MTVHSDMLAC RVMIKEGSRT FHAASKVLPR RISDPAIALY AFCRVADDAV DLGLDRSNAV 
EVLKDRLDRA CRGLPRPYPS DRAFADVIAR FSIPPAIPEA LIEGLEWDSQ GRRFETLSDL
YGYAARVAGT VGVMMTLVMG QRRPDIVARA CDLGCAMQLT NIARDIGEDA RNGRIYMPLS
WMREAGLDPE KWLADPKFTP EIAGIVKRLI DTADALYDRA TLGIANLPRS CRPGIFAARA
LYAEIGREVE RSALDSVSAR AVVSTGRKLA VLSRMLAFQE TQWAPAKNLP AKLGDMEETR
FLIDAVIAHP VRDLQPMPQV KPIEQKVAWL VDLFTRLERR DQMLQRSRV