Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2092 |
Symbol | |
ID | 5704671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2407658 |
End bp | 2409265 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641271577 |
Product | histidine ammonia-lyase |
Protein accession | YP_001536948 |
Protein GI | 159037695 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00926159 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGCGG TCGAAACCAG CGCGTCCGTC GACTTCGACG GCGAGAACTT GGACGTGCCA GCTCTCCGGC GGGTCGCCGA GCAACGGGTG CCGTGCCGGG TGCCGGCGAG CTCGTTGACC AAGGCGGCCA TGAGCCGGAA GCTGTTCGAG GACACGATCC GTCAGGACGT CCCGGTCTAC GGCGTCACCA CCGGGTACGG CGAAATGATC TACATGCTGG TCGCGCCGGA ACACGAGGTC GAGTTGCAGA CCAACCTGGT TCGCAGCCAC AGTGCCGGCG TCGGACCCGC GTTCTCGGAG AACGAGGCCC GGGCGATCGT CGGGGCGCGC CTGAACGCCC TGGCAAAGGG GTACTCGGCG GTGCGACCCG AGATTTTGGA GCGGCTGGCG CTGTATCTCA ACCTCGGTAT CACGCCGGCC ATCCCGGAGA TCGGTTCACT CGGGGCCAGC GGTGACCTCG CTCCACTCGC GCACATCGCC AGCACCGTCA TCGGCGAGGG GTACGTGCTA CGTGACGGCC GGCGGGTACG CACCGGCGAC GTGCTACGCG AGTTCGGGAT CGAGCCGCTG CAGCTCCGGT TCAAGGAGGG CCTTGCCCTG ATCAACGGCA CATCGGCAAT GACCGGCCTG GGAGCCCTGG TGGTGGACCA GGCGATGATC CAGGTACGCC AGGCCGAGAT CGTCGCGGCG CTGGTGATCG AGGGTCTGCG CGGGTCGACC GGACCGTTCC TACCGGAGGG ACACGACGTG GCCCGGCCGC ATGCCGGCCA GATCGACAGC GCGGCGAACA TGCGGACGCT GATGCAGGGC AGCAGGCTGA CGGTGGAGCA CGCCGAGTTG CGCCGAATGG TGCAGGAGAG CCGGTCGGCC GAGGACAGCG TGCAACGTAC CAACCTGTAC ATGCAGAAGG CCTACTCGCT GCGTGCCGTC CCGCAGGTGC TCGGAGGGGT ACGCGACACA CTCACCCATG CCCGGACCAA GCTCGACATC GAACTCAACT CCGCCAACGA CAACCCGCTG TTCTTCGAGG GGCGGGAGGT GTTCCACGGG GCGAACTTCC ACGGTCAGCC GGTCGCGTTC GCGATGGACT TCGTCACGAT CGCGTTGACC CAGCTCGGGG TGCTGTCTGA GCGCCGGACG AACCGGCTGC TCAACCGGCA CCTCAGTTAC GGGCTGCCGG AGTTCCTGGT GGCCGGCGAT CCGGGCCTGC ACAGCGGATT CGCCGGGGCG CAGTACCCTG CGACCGCGCT GGTCGCGGAG AATCGAACGA TCGGTCCGGC CAGTGCCCAG AGCATCCCGT CCAACGGCGA CAACCAGGAC ATCGTCAGCA TGGGCCTCAT CGCCGCCCGT AACGCGCGCC GGGTGCTGAC CAACAACGAC CAGATCCTCG CGGTGGAACT GCTCGCCGCC GCCCAGGCGG TCGACCTCGC CGACCGTAGC GCCGGGTTGA GCCGTGCGGC CCGAGCGGTG TACGACACGG TGCGGCGGGT GGTTCCGGTG CTGGACCAGG ACCGCTACAT GGCCGACGAC ATCGAACTGG TCGCCGACAT GCTCACCCAC GGCGAGTTGG TCGACGCGGT CGAGGCGGTC AACGTGACGT TGCACTGA
|
Protein sequence | MTAVETSASV DFDGENLDVP ALRRVAEQRV PCRVPASSLT KAAMSRKLFE DTIRQDVPVY GVTTGYGEMI YMLVAPEHEV ELQTNLVRSH SAGVGPAFSE NEARAIVGAR LNALAKGYSA VRPEILERLA LYLNLGITPA IPEIGSLGAS GDLAPLAHIA STVIGEGYVL RDGRRVRTGD VLREFGIEPL QLRFKEGLAL INGTSAMTGL GALVVDQAMI QVRQAEIVAA LVIEGLRGST GPFLPEGHDV ARPHAGQIDS AANMRTLMQG SRLTVEHAEL RRMVQESRSA EDSVQRTNLY MQKAYSLRAV PQVLGGVRDT LTHARTKLDI ELNSANDNPL FFEGREVFHG ANFHGQPVAF AMDFVTIALT QLGVLSERRT NRLLNRHLSY GLPEFLVAGD PGLHSGFAGA QYPATALVAE NRTIGPASAQ SIPSNGDNQD IVSMGLIAAR NARRVLTNND QILAVELLAA AQAVDLADRS AGLSRAARAV YDTVRRVVPV LDQDRYMADD IELVADMLTH GELVDAVEAV NVTLH
|
| |