Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4010 |
Symbol | |
ID | 3911817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4576142 |
End bp | 4577191 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637885914 |
Product | squalene/phytoene synthase |
Protein accession | YP_487614 |
Protein GI | 86751118 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1562] Phytoene/squalene synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.450209 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.005496 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGTAC ATTCCGATAT GCTGGCCTGC CGCGAGATGA TCAAGGAAGG CTCTCGCACC TTCCACGCCG CCTCGATGGT GCTGCCGCGC CGGATCAGCG ATCCGGCGAT CGCGCTGTAC GCGTTCTGCC GCGTCGCCGA CGATGCCGTC GATCTCGGCC TCGATCGCAC CGCTGCGGTC GAAGTGCTGA AGGACCGGCT CGATCGCGCC TGCCGCGGCC TGCCGCGGCC GTATCCGTCG GACCGCGCTT TCGCCGACGT GATCGCGCGG TTCTCGATTC CGCCGGCGAT TCCCGAGGCG CTGATCGAAG GCCTCGAATG GGACTCGCAG GGCCGCCGCT TCGAGACGCT GTCGGATCTC TACGGCTATG CGGCGCGCGT CGCCGGCACC GTGGGGGTGA TGATGACCCT GGTGATGGGG CAGCGCCGGC CGGACATCGT CGCGCGCGCC TGCGATCTCG GCTGCGCGAT GCAGCTCACC AACATCGCCC GCGATATCGG CGAGGACGCC CGCAACGGCC GCATCTACAT GCCGCTGTCG TGGATGCGCG AGGCCGGGCT CGATCCCGAG ACCTGGCTCA AGGACCCGAA ATTCACGCCG GAGATCGCCG GCATCGTCAA GCGGCTGATC GACACCGCGG ATGCGCTGTA CGATCGCGCC ACGCTCGGCA TCGCCAATCT GCCGCGCTCC TGCCGCCCCG GCATCTTCGC GGCGCGCGCG CTCTACGCCG AGATCGGCCG CGAGGTCGAA CGCTCCGGGC TCGATTCGGT GTCGTCGCGC GCCGTGGTCT CGACCGGGCG CAAGCTCGCG GTGCTGGCGC GGATGCTGGC GTTCCAGGAA ACCGAATGGG CGCCGGCGAA ATATCTGCCG ACCCGGTTCG GCGACATGGA AGAGACTCGC TTCCTGATCG ACGCCGTGAT CGCCCATCCG GTCCGCGACG TGCCGCCGGC CCCGCGCGTC AAGCCGATCG AGCAGAAAGT CGCCTGGCTG GTCGACCTGT TCACGAGGCT GGAACGCCGT GACCAGATGC TGCAGCGCAG CCGGGTGTAG
|
Protein sequence | MTVHSDMLAC REMIKEGSRT FHAASMVLPR RISDPAIALY AFCRVADDAV DLGLDRTAAV EVLKDRLDRA CRGLPRPYPS DRAFADVIAR FSIPPAIPEA LIEGLEWDSQ GRRFETLSDL YGYAARVAGT VGVMMTLVMG QRRPDIVARA CDLGCAMQLT NIARDIGEDA RNGRIYMPLS WMREAGLDPE TWLKDPKFTP EIAGIVKRLI DTADALYDRA TLGIANLPRS CRPGIFAARA LYAEIGREVE RSGLDSVSSR AVVSTGRKLA VLARMLAFQE TEWAPAKYLP TRFGDMEETR FLIDAVIAHP VRDVPPAPRV KPIEQKVAWL VDLFTRLERR DQMLQRSRV
|
| |