Gene Haur_0629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0629 
Symbol 
ID5732527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp725085 
End bp726743 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content55% 
IMG OID641277756 
Producturocanate hydratase 
Protein accessionYP_001543405 
Protein GI159897158 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.317757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCAT CGCGCATTGT GCGTGCACCA CGCGGCTCAG AATTATCGTG CAAAGGGTGG 
GCGCAAGAAG CAGCGTTGCG AATGCTGATG AATAATCTTG ATCCTGATGT GGCCGAAGAT
CCGCAAAATT TGATTGTCTA CGGCGGCACG GGGAAGGCTG CCCGCAATTG GCAATGCTTC
GATGCGATTG TGCGTTCGTT GCAAGAACTC AACGATGATG AAACCTTGTT GGTGCAATCG
GGCAAACCTG TTGCTGTATT TCGTAGCCAT CGCGATGCTC CACGGGTGCT GATTGCCAAT
TCGATGCTCG TGCCACATTG GGCCACATGG GAGAACTTCC GCGAATTGGA GCAGGCTGGC
TTGACGATGT ATGGCCAAAT GACCGCTGGC TCGTGGATTT ATATTGGTAC ACAAGGAATT
TTGCAAGGCA CTTATGAGAC ACTGGCCGCC ATTGCCCGCC AACATTTTGG TGGCTCGTTG
CGCGGTCGTT GGACGCTCAC CGCTGGCCTT GGCGGTATGG GCGGCGCACA ACCTTTGGCC
GTCACCATGA ACGATGGCGT GGCCTTGGTG GTTGAAGTTG ATCGCCAGCG CATGCAGCGC
CGCTTGGATA CGCGCTATCT CGACGTGGCG GTTGATACGC TTGAAGAAGC CATGACCTTG
GTTGATGAAG CGGTGCGCGA CGGCAAAGCA CTTTCGGTTG GTTTATTGGG CAACGCCGCC
GAAGTCTTTG GCGAATTGTA TAAGCGTGGT GTGCGCCCCG ATATTGTGAC CGACCAAACC
AGTGCCCACG ACCCGCTTGA GGGCTATGTG CCAGCTGGCA TGAGCCTTGA GCAGGCACTC
GAATTGCGTC AACGCGACCC CGAAGAATAT GTCAAGCATT CAACTGCTTC AATGGTTGAG
CATGTTAAAG CTATGGTTGC CTTCGCTGAT GCTGGCTCGA TCGTGTTTGA TTATGGCAAT
AATTTGCGTG GTGTAGCTAA GGCTGCTGGT TATGATCGAG CATTTGCCTA TCCTGGCTTT
GTGCCTGCCT ATATTCGCCC ATTGTTCTGC GAAGGCAAAG GGCCATTCCG TTGGGCAGCG
CTTTCGGGCG ACCCCGCTGA TATTGCCAAA ACCGATGAAG CCTTGCTCGA ATTGTTCCCA
GAGGATCAAG CATTGCATCG CTGGATTCGC GCCGCTCAAG AGCGGGTTCA ATTCCAAGGT
TTGCCCGCCC GCATTTGCTG GCTCGGCTAT GGCGAACGGG CCAAGGCTGG CGCGTTATTC
AACAAATTGG TGCGTGATGG CGTTGTGAGT GCGCCAATCG TGATTGGACG CGACCACCTC
GATTGTGGTT CAGTCGCTTC GCCCAACCGC GAAACCGAAG CTATGCGCGA TGGCTCCGAT
GCAATTGGCG ATTGGCCCAT TTTGAATGCG ATGATCAATG CGGTCAATGG TGCAACCTGG
GTCAGCGTGC ATCATGGCGG CGGCGTTGGC ATCGGCTATT CGCTGCATGC TGGCATGGTG
ATTGTGGCTG ATGGCACTGC TGAAGCCGAT CACCGCCTAG AGCGGGTGCT CACCAGCGAT
CCGGGCATGG GCGTGGTGCG TCACGTTGAT GCAGGCTACG ATGAAGCAAT TGCCGTAGCC
CAAGAGCGCA ACGTGCATAT TCCAATGCTG AAACAATAG
 
Protein sequence
MTSSRIVRAP RGSELSCKGW AQEAALRMLM NNLDPDVAED PQNLIVYGGT GKAARNWQCF 
DAIVRSLQEL NDDETLLVQS GKPVAVFRSH RDAPRVLIAN SMLVPHWATW ENFRELEQAG
LTMYGQMTAG SWIYIGTQGI LQGTYETLAA IARQHFGGSL RGRWTLTAGL GGMGGAQPLA
VTMNDGVALV VEVDRQRMQR RLDTRYLDVA VDTLEEAMTL VDEAVRDGKA LSVGLLGNAA
EVFGELYKRG VRPDIVTDQT SAHDPLEGYV PAGMSLEQAL ELRQRDPEEY VKHSTASMVE
HVKAMVAFAD AGSIVFDYGN NLRGVAKAAG YDRAFAYPGF VPAYIRPLFC EGKGPFRWAA
LSGDPADIAK TDEALLELFP EDQALHRWIR AAQERVQFQG LPARICWLGY GERAKAGALF
NKLVRDGVVS APIVIGRDHL DCGSVASPNR ETEAMRDGSD AIGDWPILNA MINAVNGATW
VSVHHGGGVG IGYSLHAGMV IVADGTAEAD HRLERVLTSD PGMGVVRHVD AGYDEAIAVA
QERNVHIPML KQ