Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3765 |
Symbol | |
ID | 4024281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4201498 |
End bp | 4202547 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637963969 |
Product | squalene/phytoene synthase |
Protein accession | YP_570887 |
Protein GI | 91978228 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1562] Phytoene/squalene synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.142197 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00585073 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGTAC ATTCCGATAT GCTGGCCTGC CGCGTGATGA TCAAGGAAGG TTCGCGCACG TTTCACGCCG CGTCGAAGGT GCTGCCGCGC CGGATCAGCG ATCCGGCGAT TGCGCTGTAC GCGTTTTGTC GCGTCGCCGA CGACGCCGTC GATCTCGGCC TCGATCGCAG CAATGCGGTC GAAGTGCTGA AGGACCGACT CGACCGCGCC TGTCGCGGTC TGCCCCGGCC ATATCCGTCG GATCGCGCCT TCGCCGACGT GATCGCGCGC TTCTCGATTC CGCCGGCGAT TCCCGAGGCG CTGATCGAAG GTCTCGAATG GGACTCGCAG GGCCGCCGCT TCGAGACGTT GTCTGATCTC TACGGCTATG CCGCGCGCGT CGCCGGCACC GTCGGCGTGA TGATGACGCT GGTGATGGGG CAGCGCAGAC CGGACATTGT CGCGCGTGCC TGCGATCTCG GCTGCGCGAT GCAACTCACC AACATCGCCC GCGACATCGG CGAGGACGCG CGCAACGGCC GCATTTACAT GCCGCTGTCG TGGATGCGCG AAGCCGGGCT CGATCCGGAA AAGTGGCTCG CCGACCCGAA ATTCACGCCG GAGATCGCCG GCATCGTCAA GCGGCTGATC GACACCGCGG ATGCGCTGTA CGATCGCGCC ACGCTCGGCA TCGCGAACCT GCCGCGCTCG TGCCGTCCCG GTATCTTCGC CGCGCGTGCG CTATATGCCG AGATCGGCCG CGAGGTCGAA CGCTCCGCGC TCGATTCGGT GTCGGCTCGT GCGGTGGTCT CGACCGGGCG CAAGCTCGCT GTGCTGTCGC GGATGCTGGC GTTCCAGGAA ACCCAATGGG CGCCCGCGAA GAATCTGCCG GCCAAGTTGG GCGACATGGA AGAAACCCGC TTCCTGATCG ATGCGGTGAT CGCCCATCCG GTCCGCGACT TGCAGCCGAT GCCGCAGGTC AAGCCGATCG AGCAGAAGGT CGCCTGGCTG GTCGACCTGT TCACGCGGCT CGAACGCCGC GACCAGATGC TGCAACGCAG CCGGGTGTAG
|
Protein sequence | MTVHSDMLAC RVMIKEGSRT FHAASKVLPR RISDPAIALY AFCRVADDAV DLGLDRSNAV EVLKDRLDRA CRGLPRPYPS DRAFADVIAR FSIPPAIPEA LIEGLEWDSQ GRRFETLSDL YGYAARVAGT VGVMMTLVMG QRRPDIVARA CDLGCAMQLT NIARDIGEDA RNGRIYMPLS WMREAGLDPE KWLADPKFTP EIAGIVKRLI DTADALYDRA TLGIANLPRS CRPGIFAARA LYAEIGREVE RSALDSVSAR AVVSTGRKLA VLSRMLAFQE TQWAPAKNLP AKLGDMEETR FLIDAVIAHP VRDLQPMPQV KPIEQKVAWL VDLFTRLERR DQMLQRSRV
|
| |