Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4493 |
Symbol | |
ID | 5736344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5754252 |
End bp | 5755271 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281656 |
Product | polyprenyl synthetase |
Protein accession | YP_001547253 |
Protein GI | 159901006 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0142] Geranylgeranyl pyrophosphate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000291603 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACACA TTTCTGACGA ATCGATTCGC CAAGCAATGC AGAGTGCCTT CCCACCTGCC GATGCTCGTG TAACCATGTT TTATGAAATG CAGGAATATC ATTTGGGCTG GCGTAATGCT CAGCTTGAGC CAACCCAAGC CGATAGTGGC AAGTTACTGC GTCCGCGCTT TTGTTTGCTC GCCTGCGCTG CGGTTGGCGG CGATCCCCAA CAGGCTGAGC CTTTAGCCGC CGCAATTCAG CTGCTCCATG ATTTTTCGCT TATCCACGAT GATATTGAAG ATCACAGCCC AACTCGCCGT GGCCGCGAAA CCGTGTGGAA ATTGTGGGAA GTTCCACAAG CGATCAATGT TGGCGATGGT ATGTTTACGC TTGCCCAGCT TTCATTATTT CGTTTGGCAG AAGTTGGGGT CGAATCATCG GTCGTGGTCG AAATTGCACG GCGCTTTAAT CAAACCATTA TTCGTTTATG CGAAGGCCAG TATCTCGATA TGTCGTTTGA GCAGCGCCTC GACATCAGCG AAGGCGATTA TTTGGCGATG ATCAGCCGCA AAACGGCAGC ATTAATTGCC GCTGCTGCTG GTTTGGGGGC AATTTTGGGC AATGCCAACC GCGAACAAGC CGCCGCCCTG TATAATTGGG GTGAAGCCTT GGGCTTGGCC TTCCAAATTG AAGATGATAT GCTCGGCATT TGGGGCGCAG AAGCCGTCAC TGGCAAGCCC GATGCTCACG ATATTTGGGG CCGCAAAAAA AGCCTGCCAA TTATCCATGC CCTAGCTCAC GCCGATGCTG AGGATGGCGG CAAGCTGGCG GCGATTTATC AAAAAGAGCC GCTTGAGGCC AGCGATATTC AAATTGTGCT GACAATTTTA GAGCGCACTG GCTCACAAGG CTATACTGCG GGCGTGGCCA AGTTCTATCA CGAGCAAGCC TTAGCCGCCC TAGCCGATTT ACAAGGCGAG GCCGAGCCAA TTGCCGAGTT GCATGCATTA ACCAAACAAT TACTGGGACG AGTGAAATAG
|
Protein sequence | MKHISDESIR QAMQSAFPPA DARVTMFYEM QEYHLGWRNA QLEPTQADSG KLLRPRFCLL ACAAVGGDPQ QAEPLAAAIQ LLHDFSLIHD DIEDHSPTRR GRETVWKLWE VPQAINVGDG MFTLAQLSLF RLAEVGVESS VVVEIARRFN QTIIRLCEGQ YLDMSFEQRL DISEGDYLAM ISRKTAALIA AAAGLGAILG NANREQAAAL YNWGEALGLA FQIEDDMLGI WGAEAVTGKP DAHDIWGRKK SLPIIHALAH ADAEDGGKLA AIYQKEPLEA SDIQIVLTIL ERTGSQGYTA GVAKFYHEQA LAALADLQGE AEPIAELHAL TKQLLGRVK
|
| |