Gene Rpal_1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1700 
Symbol 
ID6409357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1824252 
End bp1825319 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content66% 
IMG OID642711588 
ProductSqualene/phytoene synthase 
Protein accessionYP_001990703 
Protein GI192290098 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.898001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGC AATCCGATAT GTTGGCCCAA TCCGACATGC TGGCCTGCCG TGAGATGATC 
AAGGAAGGCT CGCACACCTT TCACGCGGCC TCCAAGGTGC TGCCGCGGCG GATCAGTGAT
CCGGCGATCG CGCTGTACGC GTTCTGCCGC GTCGCCGACG ACGCCGTGGA TCTCGGTCTC
GACCGCGCCG CCGCGGTCGA AGTGCTGAAG GACCGGCTCG ATCGCGCCTG CCGCGGCGTG
CCGCGTGCCT ATCCGTCCGA CCGCGCCTTC GCTGATGTGG TGGCGCGGTT TTCGATTCCG
CCGGCAATTC CCGAGGCGCT GATCGAGGGC CTGGAATGGG ATGCGCAGGG CCGTCGCTTC
GAGACGCTGT CGGATCTGTA TTCGTATTGC GCCCGCGTCG CCGGCACCGT TGGCGTGATG
ATGACGCTGG TGATGGGCCA GCGCAAACCC GACATCGTGG CGCGTGCCTG CGATCTCGGC
TGCGCGATGC AACTCACCAA TATCGCCCGC GACATCGGCG AGGATGCCCG TAACGGGCGC
ATCTATATGC CGCTGTCGTG GATGCGCGAA GCTGGCCTCG ATCCGGAGAC CTGGCTCGCC
AATCCGAAGT TCACGCCGGA GATCGCCAGC ATCGTCAAGC GGCTGATCGA CACTGCGGAT
GCGCTGTACG ATCGCGCGAC GCTCGGCATC GCCAACCTGC CGCGCTCCTG CCGTCCCGGC
ATTTTCGCAG CGCGCGCGCT GTACGCCGAG ATCGGCCGCG AGGTCGAGCG CTCCGGCCTC
GACTCGGTGT CGAGCCGTGC AGTGGTCTCA ACCGGCCGCA AGCTCGCCGT GCTGGCGCGG
CTGCTGGCGT TCCAGGAAAC CGAATGGGCG CCGGCGAAGT ATCTGCCGGC CAAGTTCGGC
GACATGGAAG AGACCAAGTT TCTGGTCGAC GCGGTGATCG CGCATCCGGT GCGCGAACTG
CCGGCGCGCC AGAAGGTCAA GCCGATTGAG CAGAAGGTCG CCTGGCTGGT CGACCTGTTC
ACCCGCCTCG AACGCCGCGA CCAGATGCTG CAACGCAGCC GGGTGTAG
 
Protein sequence
MSLQSDMLAQ SDMLACREMI KEGSHTFHAA SKVLPRRISD PAIALYAFCR VADDAVDLGL 
DRAAAVEVLK DRLDRACRGV PRAYPSDRAF ADVVARFSIP PAIPEALIEG LEWDAQGRRF
ETLSDLYSYC ARVAGTVGVM MTLVMGQRKP DIVARACDLG CAMQLTNIAR DIGEDARNGR
IYMPLSWMRE AGLDPETWLA NPKFTPEIAS IVKRLIDTAD ALYDRATLGI ANLPRSCRPG
IFAARALYAE IGREVERSGL DSVSSRAVVS TGRKLAVLAR LLAFQETEWA PAKYLPAKFG
DMEETKFLVD AVIAHPVREL PARQKVKPIE QKVAWLVDLF TRLERRDQML QRSRV