Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2703 |
Symbol | |
ID | 5734584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3456600 |
End bp | 3457574 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279846 |
Product | squalene/phytoene synthase |
Protein accession | YP_001545469 |
Protein GI | 159899222 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1562] Phytoene/squalene synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.80466 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTGGC CTTCGCTCGA TCCTCAAATG TGTGCTGGGT TGACCTTACC AGCTCAATTT ATTCCAACGG TTGCCGAGTG CGTGTCACAT CAGCCCAGCA ACCAATGGCT TAATCATGCT TATGCCCATT GTACCGAACT AACGAACGTT CACTCAAAGA GCTTCTATTT TAGTACTCAA TTGTTGCCTG CTCGGAAGCG TGATGCAATT CGGGTGTTGT ATGCTTTCTG TCGAACGAGC GACGACTTGG TAGACATGCA CCCTGAAACA GCCCCAGAAC GTTTGCAACA ATGGCTCAAA ACCTTGCGTT CCACCCCGCG CCACGACGAC CCCGTGCCAT TAGCTTGGCA TCATGTGCGC AACCATTTTC GCATTTCTAG CAAACTTGAA TCAGAATTAT TGGCAGGAGT CGAGATGGAT CTCTCGATTA ATCGCTATGC GACCTTTGCC GATTTATGGC TCTATTGCTA TCGTGTGGCT TCGGTGGTCG GGTTGCTCAG CATGCAGGTA ATTGGCTACG CTGCAGGAGC CGTGCCCTAC GCAATCAAAT TAGGCGTGGC CTTGCAATTA ACCAACATTC TGCGCGATAT TGGCGAAGAT GCGCGGCGCA ACCGAATTTA CCTACCAAAA GAAGATCTAG AGCGCTTCGG GGTGAGCGAA CAAGATATTT TGAATGGGGT GCGCAGCCCA CAGATGCAGC AATTATTACA ATTTGAAATG CAGCGGGCGC ATCAACTCTA CGATGAAAGT TGGCAGGGCA TCGGGTTATT GCACCCCGAT TCACGCTTTG CCGTGGCAAC CGCCGCAACT GTCTATCGTG GCATTCTTGA TCAGATTGTG CGCAATAACT ATGATGTGTT TAATCGACGC GCCAGCCTCT CGTTAGGGGC AAAATTGGCT TATTTACCCA AAATTCTCTG GCGCTTGCGC AAGCTCAACC GTGAGTTTCC GCTTGATCAA TGGGGGCAAG TATGA
|
Protein sequence | MSWPSLDPQM CAGLTLPAQF IPTVAECVSH QPSNQWLNHA YAHCTELTNV HSKSFYFSTQ LLPARKRDAI RVLYAFCRTS DDLVDMHPET APERLQQWLK TLRSTPRHDD PVPLAWHHVR NHFRISSKLE SELLAGVEMD LSINRYATFA DLWLYCYRVA SVVGLLSMQV IGYAAGAVPY AIKLGVALQL TNILRDIGED ARRNRIYLPK EDLERFGVSE QDILNGVRSP QMQQLLQFEM QRAHQLYDES WQGIGLLHPD SRFAVATAAT VYRGILDQIV RNNYDVFNRR ASLSLGAKLA YLPKILWRLR KLNREFPLDQ WGQV
|
| |