Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0490 |
Symbol | |
ID | 5732404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 571828 |
End bp | 572697 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277616 |
Product | 5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase |
Protein accession | YP_001543269 |
Protein GI | 159897022 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000477991 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTTG TAACCCTCAA AACTGCGGTT GGGCCACGTC CAGGCATTGT GCTTGGCGAT CGGGTGATGT TGCTCGATGG CTATCAATCG CTCCAAGCCT TAATCGAAGC AGGCGATGCT GGCTTGGACA CAATCAAGGC GGCCTTGCCC GACTATCAAA GCCGTGCAGG CATCCAATTA AATATGGATC AATTGTTGGC TCCCTTGCCC CGCCCATTGA AAAATGTGTT TTGTGTGGGC CTTAATTATG CAGCCCACGC TCGTGAATCG TTGCAAGCCA AAGGGCTAGA AGTCAAAATG CCCGAGCATC CGGTATTTTT CACCAAACCA CCAACCGCCA TCAACAGCCC AACTGGCGAA ATTGTGATTG ATCCAGCAGT TTCCGAACGG ATTGATTGGG AAGTTGAATT GGGTGTGGTG ATCGGCAAAG CTGGCAAAAA CATCAGCCAC GAGCAAGCCA TGGAGCATGT TTGGGGCTAT ACGGTGATTA ACGATGTTTC GGCGCGAGAT TTACAAATGC GCCATCAGCA ATTTTTCAAA GGCAAAGCGC TCGATGGCTC ATGCCCAATG GGGCCGTGGA TCATCACCAG CGATGAGTTG ACCGACCCGC ATAATTTGGT GGTTCGGCTA CGCGTCAACG GCGAGATCAA ACAGGAGTCT AATACCAACG ATCTTATTTT TAATATTCCT ACATTAATTC ATGTGCTTTC CCAAGGCATG ACCCTTGAGC CAGGCGATAT TATTGCAACT GGCACACCGG CTGGCGTAGG TTTTGCGCGT ACTCCCCAAG AATTTTTACG CCCAGGCGAT TTGCTCGAAA CCGAAGTTGA GGGCATTGGT ATTCTTCGTA ACCCTGTCGT GGCAGGCTAA
|
Protein sequence | MRFVTLKTAV GPRPGIVLGD RVMLLDGYQS LQALIEAGDA GLDTIKAALP DYQSRAGIQL NMDQLLAPLP RPLKNVFCVG LNYAAHARES LQAKGLEVKM PEHPVFFTKP PTAINSPTGE IVIDPAVSER IDWEVELGVV IGKAGKNISH EQAMEHVWGY TVINDVSARD LQMRHQQFFK GKALDGSCPM GPWIITSDEL TDPHNLVVRL RVNGEIKQES NTNDLIFNIP TLIHVLSQGM TLEPGDIIAT GTPAGVGFAR TPQEFLRPGD LLETEVEGIG ILRNPVVAG
|
| |