Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0520 |
Symbol | |
ID | 5732436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 604739 |
End bp | 606220 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277647 |
Product | UbiD family decarboxylase |
Protein accession | YP_001543297 |
Protein GI | 159897050 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCAACAA AAATACCTAC TACTTTTGGT GATTTACGCG AATGGATCGC CTTTTTGGAA AAACGCGGCG AATTGAAACG GATCAAAACG CCGGTTTCGG CTGATCTTGA AATCACTGAA ATTACTGATC GCGTTTCTAA AATGAAGCAA GGTCAAGGAA ATGTCGCTTT ACTCTTTGAA AATGTGATTG GCTCAGATCT GCCAGTTTTG ATCAATGGAG TGGGCACTGA GCAACGCATG GCTTGGGCCT TGGGCTTAGA AAAACTCGAC GATCTGCGGG CACGCTTGGC GAGTGTGGTT AAGCCTGAAG TTCCCGAGGG GGTTTTCGAC AAGCTTAAAA AAGTCAGCGA GCTTTCAGAA ATTGTGCGCT ATCGGCCTAA AACGGTTGCC AGCGCTCCCT GCCAAGATAT TGTCTGGACG GGCGACCAGA TTGATTTGAA TAAATTGCCA ATTTTAAAAT GCTGGCCCGA TGATGGTGGC CGCTACGTCA CCCTAACCAC AGTCATTTCA CGCGATCCCT ACAAAGGCAT TCGCAATGTG GGGATGTATC GGGTGCAGGT TTACGATGAA AAAACCGTCG GCATGCACTG GCAAATTCAC AAAGGTGGCA CTGAACATCA ACGCGAGGCG CTGCGCAAAG GTGGGGTGAA ATTGCCGGTT GCCGTAGCAA TTGGCGGCGA TTATGCCACG ATCTACTCGG GCTCGGCTCC ACTGCCACCA GGCATCGATG AAATTATGTT GGCTGGTTGG TTGCGGCGCG AACGGGTTGA GATGGTCAAA TGCAAAACGA TCGATCTCGA AGTGCCCGCC AATGCCGAAA TTATTCTCGA AGGTTATGTT GACCCCAGCG AAAGTCGGCT TGAAGGGCCA TTCGGCGATC ATACTGGCTA CTATTCGCTG GCCGATCAAT ATCCCGTGAT GCACCTGACC GCCATCACCA TGCGCAAGGA TGCGATTTAT CCGACGACGA TTGTGGGTTA TCCACCGCAA GAAGATTATT GGCTTGGCAA AGCGACTGAG CGCTTGTTCT TGCCATTGAT GCAGTTGGTC GTGCCTGAGG TGATCGATGT CAATATGCCT GCCGAAGGGA CGTTCCATAA TCTATTGGTG GTCAGCATCA AGAAAAAATA CCCCGGCCAA GTGCGCAAAG TGATGTATGG CCTGTGGGGC TTGATGCTCA TGTCGTTGAC CAAATTTATT ATTGTGGTCG ATGAAGATAT TGATGTACAG GATATGAACC AAGTCTTGTT TCATGTCACA TCGAATGTTG ATCCGCAGCG CGATACGGTG ATTGTGGAAG GGCCACTTGA TGCGCTCGAC CACTCGGCAG ATCATTTTGC TTATGGCCAC AAAATGGGGA TTGATGCTAC CCGCAAGCGC CAAGATATCG ATCGGTTTCC GCGTGAATGG CCTCAAGACA TTCGCATGAC CCAATCGATT GTGGATCGGG TGACCAAACG CTGGCGTGAA TATGGCTTCT AG
|
Protein sequence | MSTKIPTTFG DLREWIAFLE KRGELKRIKT PVSADLEITE ITDRVSKMKQ GQGNVALLFE NVIGSDLPVL INGVGTEQRM AWALGLEKLD DLRARLASVV KPEVPEGVFD KLKKVSELSE IVRYRPKTVA SAPCQDIVWT GDQIDLNKLP ILKCWPDDGG RYVTLTTVIS RDPYKGIRNV GMYRVQVYDE KTVGMHWQIH KGGTEHQREA LRKGGVKLPV AVAIGGDYAT IYSGSAPLPP GIDEIMLAGW LRRERVEMVK CKTIDLEVPA NAEIILEGYV DPSESRLEGP FGDHTGYYSL ADQYPVMHLT AITMRKDAIY PTTIVGYPPQ EDYWLGKATE RLFLPLMQLV VPEVIDVNMP AEGTFHNLLV VSIKKKYPGQ VRKVMYGLWG LMLMSLTKFI IVVDEDIDVQ DMNQVLFHVT SNVDPQRDTV IVEGPLDALD HSADHFAYGH KMGIDATRKR QDIDRFPREW PQDIRMTQSI VDRVTKRWRE YGF
|
| |