Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1208 |
Symbol | |
ID | 5733101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1391392 |
End bp | 1392420 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278348 |
Product | fumarylacetoacetate (FAA) hydrolase |
Protein accession | YP_001543984 |
Protein GI | 159897737 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0490287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTG TTTCATTTCG GCGCTATGGC GAAGGCTCCG AGGCACGAGC CGGGGCTTGG TTGCCCATGG GAATCATCGA TCTACAAGCC GCAGCAGGCT TAGTTTTTGA AGATTTGCCC CACGATTGGT CGCTGATGAG CATGCTCAAA CACGAAGCCG ATGGCTATGG CATCGATGCC GCAATCCAGG TCGTTTCAGC AGTCGTCGAT TTGCTCGGCG GTGGCGGCGA TGGCATCGAA TGGGATGATC CCGATGCGAT CAACAGTATG CTTTCGCTAG GCGGCGAAAC CGTGATTTAC CCGCCTGATA GCGTGCGTTT GTTAGCGCCG ATTCCTCAAC CACCAACGAT TCGCGATTTT TATGCCTTCG AGCAGCATGT GCGTGAAATT CGTGCTCAGC ATGGCCGCTC CGTGCCTAGC ACGTGGTACG ATATGCCAGT CTTTTACTTT GGTAACCCTA CCACCGTGCT TGGGCCAGAT AGCGATCTGG TAATGCCGCG CACCAGTCAA CTTGATTATG AACTGGAAAT TGCAGCAGTT ATTGGCCGGC CATGTCGCGA TATCGAGCCA GATGAAGCTG AATATTATAT TGCTGGCTTG ATGGTCATGA ATGATTGGTC GGCCCGCGAT ATTCAGGCCC GCGAGATGAG CGTTGGCTTG GGGCCAGCCA AGGGCAAAGA TTTTGCCACC TCATTCGGGC CAGCCCTGAT CACGCTTGAC GAAATTGAGG ATAAAGCGCT GGGCGATGGG CGTTACGATT TGGCGATGGT GGTGCGGGTC AATGGTGAAG AGCGTGGTCG TGCGTCGTTC GCCGATATTT ACTATACGCT CGGCGAATTG ATTGCCCACG CTTCACGCGA TGTCACCCTG CTGCCTGGTG AAATTATTGG CTCGGGCACA GTTGGCACTG GCTGTTTGCT CGAAACCACC CACGGCGAAG GGCCATGGCT TGAGGTTGGC GATGTGGTCG AACTCGAAAT CGAACGCATC GGCATCTTAC GCAACACAAT TGTTGATCGC GATAGCTAA
|
Protein sequence | MKFVSFRRYG EGSEARAGAW LPMGIIDLQA AAGLVFEDLP HDWSLMSMLK HEADGYGIDA AIQVVSAVVD LLGGGGDGIE WDDPDAINSM LSLGGETVIY PPDSVRLLAP IPQPPTIRDF YAFEQHVREI RAQHGRSVPS TWYDMPVFYF GNPTTVLGPD SDLVMPRTSQ LDYELEIAAV IGRPCRDIEP DEAEYYIAGL MVMNDWSARD IQAREMSVGL GPAKGKDFAT SFGPALITLD EIEDKALGDG RYDLAMVVRV NGEERGRASF ADIYYTLGEL IAHASRDVTL LPGEIIGSGT VGTGCLLETT HGEGPWLEVG DVVELEIERI GILRNTIVDR DS
|
| |