Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1970 |
Symbol | |
ID | 4710337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2171291 |
End bp | 2172850 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639856443 |
Product | Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase |
Protein accession | YP_001003536 |
Protein GI | 121998749 |
COG category | [R] General function prediction only |
COG ID | [COG0388] Predicted amidohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAAG AGTCGGAACC GGAACGGACA GAACAGGACG AGCACCACCT GCGCCTGCGC AACATGGTCC TTGAGGACTA CGACGATGTC GCGCACATCA TGGACCAAGT GTATCCCGAT ATGGACGGGG CGTGGTCGCG GGAACAGTTC GCGGCGCAGA TCAATCGCTT CCCGGAGGGG CAGATCTGCA TCGAGGACAA CGGCCGCGTG GTTGCCGGCG CCATCACGCT CATCGTGGAC TACGGTCGCT TCGGCGACAA GCACACCTAC GAGGAGATCA CCGGCGGCGG CTACCTGACC ACCCACGACC CCAACGGCGA CGTCCTTTAC GGCGTGGATA TCTTCGTCGA CCCGGAGTAC CGCTCGATGC GCCTGGGGAG GCGACTCTAC GACGCCCGCA AGGAGCTGTG CCGGCGTCTG AACCTGCGCG GCATCGTCGC CGGCGGGCGC ATCCCCGGCT ACACCCGGCA CGCGGACGAG ATGTCGCCGG AGGCCTACAT CGAGCTGGTC AAGCGCCGCG AGATCCATGA CCCGATCCTC TCGTTCCAGC TCGCCAACGA CTTCCACGTG CGGCGGATCA TCACCGATTA CCTGCCCTCG GACCGGGACT CGTACGCCTA CGCGACCCTG GTGCAGTGGA ACAACATCTT CTACGAACCC AAGGAACAGC CGCTGATCAC CCGGCGCAAG GCCGTGGTCC GGGTCGGCAC TGTGCAGTGG CAGATGCGCC CGATGAGCTC GGTGGACGAG CTGATCGATC AGGTGGAGTT CTTCGTCGAC GCCCTGGCCG GCTACAACGC CGACTTTGCA CTGTTCCCGG AGTTCTTCAA CGGACCACTG CTTTCGCTGT TCAACCAGGA GAACCCGGCC GAGGCCATCC GCGGCCTGGC GCAGTACACC GAGGAGATCG TCGACGAGAT GTCGCGCCTG GCGGTCTCCT ACAACATCAA TATCATCGCC GGCTCGATGC CCGTCTACCA GGAGCAGTCG CTCTACAACG TCTCCTACCT CTGCCGGCGC GACGGCACGA CGGATGAGCA GCACAAGCTG CACATCACCC CCGACGAGCG CTCCTACTGG GGCGTGCGGG GCGGTGACGA ACTCAAGGTC TTCGAGACCG ACATCGGCAA GATCGGCATC CTGGTCTGCT ACGACTCCGA GTTCCCAGAG CTGCCGCGGA TCCTCAATCA GAAGGGGGCG CGCATCCTCT TCGTGCCCTA CTGGGTCGAC ACCCAGACCG GCTATCTGCG CGTCCGCCGC TGCGCCCAGG CACGGGCCAT CGAGAACGAG TGCTATGTGG CGATCACCGG CTCCGTGGGC AACCTGCCCA AGGTAGAGAA CATCGACATC CAGTACTCGC AGTCGGCGGT GTTCTCGCCG GCGGACTTCG CCTTCCCCCA CGACGCCATC GTCGCGGAGA CCACCCCGAA CACCGAGATG ACGCTGATCG TCGACCTGGA TCTGGACAAG CTCACCGAGA TCCGCAACGA GGGCTCGGTC CGCAATTACA AGGACCGTCG CCTGGATCTC TACCGCATCC AGTGGCTGGG CCGGGACTGA
|
Protein sequence | MQEESEPERT EQDEHHLRLR NMVLEDYDDV AHIMDQVYPD MDGAWSREQF AAQINRFPEG QICIEDNGRV VAGAITLIVD YGRFGDKHTY EEITGGGYLT THDPNGDVLY GVDIFVDPEY RSMRLGRRLY DARKELCRRL NLRGIVAGGR IPGYTRHADE MSPEAYIELV KRREIHDPIL SFQLANDFHV RRIITDYLPS DRDSYAYATL VQWNNIFYEP KEQPLITRRK AVVRVGTVQW QMRPMSSVDE LIDQVEFFVD ALAGYNADFA LFPEFFNGPL LSLFNQENPA EAIRGLAQYT EEIVDEMSRL AVSYNINIIA GSMPVYQEQS LYNVSYLCRR DGTTDEQHKL HITPDERSYW GVRGGDELKV FETDIGKIGI LVCYDSEFPE LPRILNQKGA RILFVPYWVD TQTGYLRVRR CAQARAIENE CYVAITGSVG NLPKVENIDI QYSQSAVFSP ADFAFPHDAI VAETTPNTEM TLIVDLDLDK LTEIRNEGSV RNYKDRRLDL YRIQWLGRD
|
| |