Gene Haur_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0520 
Symbol 
ID5732436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp604739 
End bp606220 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content50% 
IMG OID641277647 
ProductUbiD family decarboxylase 
Protein accessionYP_001543297 
Protein GI159897050 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAACAA AAATACCTAC TACTTTTGGT GATTTACGCG AATGGATCGC CTTTTTGGAA 
AAACGCGGCG AATTGAAACG GATCAAAACG CCGGTTTCGG CTGATCTTGA AATCACTGAA
ATTACTGATC GCGTTTCTAA AATGAAGCAA GGTCAAGGAA ATGTCGCTTT ACTCTTTGAA
AATGTGATTG GCTCAGATCT GCCAGTTTTG ATCAATGGAG TGGGCACTGA GCAACGCATG
GCTTGGGCCT TGGGCTTAGA AAAACTCGAC GATCTGCGGG CACGCTTGGC GAGTGTGGTT
AAGCCTGAAG TTCCCGAGGG GGTTTTCGAC AAGCTTAAAA AAGTCAGCGA GCTTTCAGAA
ATTGTGCGCT ATCGGCCTAA AACGGTTGCC AGCGCTCCCT GCCAAGATAT TGTCTGGACG
GGCGACCAGA TTGATTTGAA TAAATTGCCA ATTTTAAAAT GCTGGCCCGA TGATGGTGGC
CGCTACGTCA CCCTAACCAC AGTCATTTCA CGCGATCCCT ACAAAGGCAT TCGCAATGTG
GGGATGTATC GGGTGCAGGT TTACGATGAA AAAACCGTCG GCATGCACTG GCAAATTCAC
AAAGGTGGCA CTGAACATCA ACGCGAGGCG CTGCGCAAAG GTGGGGTGAA ATTGCCGGTT
GCCGTAGCAA TTGGCGGCGA TTATGCCACG ATCTACTCGG GCTCGGCTCC ACTGCCACCA
GGCATCGATG AAATTATGTT GGCTGGTTGG TTGCGGCGCG AACGGGTTGA GATGGTCAAA
TGCAAAACGA TCGATCTCGA AGTGCCCGCC AATGCCGAAA TTATTCTCGA AGGTTATGTT
GACCCCAGCG AAAGTCGGCT TGAAGGGCCA TTCGGCGATC ATACTGGCTA CTATTCGCTG
GCCGATCAAT ATCCCGTGAT GCACCTGACC GCCATCACCA TGCGCAAGGA TGCGATTTAT
CCGACGACGA TTGTGGGTTA TCCACCGCAA GAAGATTATT GGCTTGGCAA AGCGACTGAG
CGCTTGTTCT TGCCATTGAT GCAGTTGGTC GTGCCTGAGG TGATCGATGT CAATATGCCT
GCCGAAGGGA CGTTCCATAA TCTATTGGTG GTCAGCATCA AGAAAAAATA CCCCGGCCAA
GTGCGCAAAG TGATGTATGG CCTGTGGGGC TTGATGCTCA TGTCGTTGAC CAAATTTATT
ATTGTGGTCG ATGAAGATAT TGATGTACAG GATATGAACC AAGTCTTGTT TCATGTCACA
TCGAATGTTG ATCCGCAGCG CGATACGGTG ATTGTGGAAG GGCCACTTGA TGCGCTCGAC
CACTCGGCAG ATCATTTTGC TTATGGCCAC AAAATGGGGA TTGATGCTAC CCGCAAGCGC
CAAGATATCG ATCGGTTTCC GCGTGAATGG CCTCAAGACA TTCGCATGAC CCAATCGATT
GTGGATCGGG TGACCAAACG CTGGCGTGAA TATGGCTTCT AG
 
Protein sequence
MSTKIPTTFG DLREWIAFLE KRGELKRIKT PVSADLEITE ITDRVSKMKQ GQGNVALLFE 
NVIGSDLPVL INGVGTEQRM AWALGLEKLD DLRARLASVV KPEVPEGVFD KLKKVSELSE
IVRYRPKTVA SAPCQDIVWT GDQIDLNKLP ILKCWPDDGG RYVTLTTVIS RDPYKGIRNV
GMYRVQVYDE KTVGMHWQIH KGGTEHQREA LRKGGVKLPV AVAIGGDYAT IYSGSAPLPP
GIDEIMLAGW LRRERVEMVK CKTIDLEVPA NAEIILEGYV DPSESRLEGP FGDHTGYYSL
ADQYPVMHLT AITMRKDAIY PTTIVGYPPQ EDYWLGKATE RLFLPLMQLV VPEVIDVNMP
AEGTFHNLLV VSIKKKYPGQ VRKVMYGLWG LMLMSLTKFI IVVDEDIDVQ DMNQVLFHVT
SNVDPQRDTV IVEGPLDALD HSADHFAYGH KMGIDATRKR QDIDRFPREW PQDIRMTQSI
VDRVTKRWRE YGF