Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0629 |
Symbol | |
ID | 5732527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 725085 |
End bp | 726743 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641277756 |
Product | urocanate hydratase |
Protein accession | YP_001543405 |
Protein GI | 159897158 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.317757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTCAT CGCGCATTGT GCGTGCACCA CGCGGCTCAG AATTATCGTG CAAAGGGTGG GCGCAAGAAG CAGCGTTGCG AATGCTGATG AATAATCTTG ATCCTGATGT GGCCGAAGAT CCGCAAAATT TGATTGTCTA CGGCGGCACG GGGAAGGCTG CCCGCAATTG GCAATGCTTC GATGCGATTG TGCGTTCGTT GCAAGAACTC AACGATGATG AAACCTTGTT GGTGCAATCG GGCAAACCTG TTGCTGTATT TCGTAGCCAT CGCGATGCTC CACGGGTGCT GATTGCCAAT TCGATGCTCG TGCCACATTG GGCCACATGG GAGAACTTCC GCGAATTGGA GCAGGCTGGC TTGACGATGT ATGGCCAAAT GACCGCTGGC TCGTGGATTT ATATTGGTAC ACAAGGAATT TTGCAAGGCA CTTATGAGAC ACTGGCCGCC ATTGCCCGCC AACATTTTGG TGGCTCGTTG CGCGGTCGTT GGACGCTCAC CGCTGGCCTT GGCGGTATGG GCGGCGCACA ACCTTTGGCC GTCACCATGA ACGATGGCGT GGCCTTGGTG GTTGAAGTTG ATCGCCAGCG CATGCAGCGC CGCTTGGATA CGCGCTATCT CGACGTGGCG GTTGATACGC TTGAAGAAGC CATGACCTTG GTTGATGAAG CGGTGCGCGA CGGCAAAGCA CTTTCGGTTG GTTTATTGGG CAACGCCGCC GAAGTCTTTG GCGAATTGTA TAAGCGTGGT GTGCGCCCCG ATATTGTGAC CGACCAAACC AGTGCCCACG ACCCGCTTGA GGGCTATGTG CCAGCTGGCA TGAGCCTTGA GCAGGCACTC GAATTGCGTC AACGCGACCC CGAAGAATAT GTCAAGCATT CAACTGCTTC AATGGTTGAG CATGTTAAAG CTATGGTTGC CTTCGCTGAT GCTGGCTCGA TCGTGTTTGA TTATGGCAAT AATTTGCGTG GTGTAGCTAA GGCTGCTGGT TATGATCGAG CATTTGCCTA TCCTGGCTTT GTGCCTGCCT ATATTCGCCC ATTGTTCTGC GAAGGCAAAG GGCCATTCCG TTGGGCAGCG CTTTCGGGCG ACCCCGCTGA TATTGCCAAA ACCGATGAAG CCTTGCTCGA ATTGTTCCCA GAGGATCAAG CATTGCATCG CTGGATTCGC GCCGCTCAAG AGCGGGTTCA ATTCCAAGGT TTGCCCGCCC GCATTTGCTG GCTCGGCTAT GGCGAACGGG CCAAGGCTGG CGCGTTATTC AACAAATTGG TGCGTGATGG CGTTGTGAGT GCGCCAATCG TGATTGGACG CGACCACCTC GATTGTGGTT CAGTCGCTTC GCCCAACCGC GAAACCGAAG CTATGCGCGA TGGCTCCGAT GCAATTGGCG ATTGGCCCAT TTTGAATGCG ATGATCAATG CGGTCAATGG TGCAACCTGG GTCAGCGTGC ATCATGGCGG CGGCGTTGGC ATCGGCTATT CGCTGCATGC TGGCATGGTG ATTGTGGCTG ATGGCACTGC TGAAGCCGAT CACCGCCTAG AGCGGGTGCT CACCAGCGAT CCGGGCATGG GCGTGGTGCG TCACGTTGAT GCAGGCTACG ATGAAGCAAT TGCCGTAGCC CAAGAGCGCA ACGTGCATAT TCCAATGCTG AAACAATAG
|
Protein sequence | MTSSRIVRAP RGSELSCKGW AQEAALRMLM NNLDPDVAED PQNLIVYGGT GKAARNWQCF DAIVRSLQEL NDDETLLVQS GKPVAVFRSH RDAPRVLIAN SMLVPHWATW ENFRELEQAG LTMYGQMTAG SWIYIGTQGI LQGTYETLAA IARQHFGGSL RGRWTLTAGL GGMGGAQPLA VTMNDGVALV VEVDRQRMQR RLDTRYLDVA VDTLEEAMTL VDEAVRDGKA LSVGLLGNAA EVFGELYKRG VRPDIVTDQT SAHDPLEGYV PAGMSLEQAL ELRQRDPEEY VKHSTASMVE HVKAMVAFAD AGSIVFDYGN NLRGVAKAAG YDRAFAYPGF VPAYIRPLFC EGKGPFRWAA LSGDPADIAK TDEALLELFP EDQALHRWIR AAQERVQFQG LPARICWLGY GERAKAGALF NKLVRDGVVS APIVIGRDHL DCGSVASPNR ETEAMRDGSD AIGDWPILNA MINAVNGATW VSVHHGGGVG IGYSLHAGMV IVADGTAEAD HRLERVLTSD PGMGVVRHVD AGYDEAIAVA QERNVHIPML KQ
|
| |