Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0996 |
Symbol | |
ID | 5704678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1119101 |
End bp | 1120639 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641270511 |
Product | histidine ammonia-lyase |
Protein accession | YP_001535898 |
Protein GI | 159036645 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.168418 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACCG TAGTCATCCA ACCAACCGGG GTCACCCCCG CCGACGTGCT CGCCGTCGCC CGCGGCACCG CCAAGGTCGT ACTCGACCCG GCGGCGATCG ACGCGATGGT CGCCAGCCGG TCCGTCGTGG ACGGCATCGA GGCCTCCGGC CAGCCGGTGT ACGGCGTCAG CACCGGTTTC GGGGCCCTCG CCAACACGTT CGTCGCCCCG CAGCGGCGGG CGGAGCTACA GCACGCGCTG ATCCGTTCAC ACGCCGCCGG GGTGGGCTCC GCCATGCCCC GCGAGGTGGT CCGGGCGATG ATGCTGCTGC GCGTACGGTC CCTCGCGCTC GGCCGCTCCG GCGTCCGGCC GATCGTCGCC ACGGCACTGG TGGACCTGCT CAACAACGAC GTCACCCCGT GGGTACCCGA ACACGGGTCG CTGGGAGCCT CCGGGGACCT GGCGCCGCTG GCGCACTGCG CGCTGGCGCT GCTCGGCGAG GGCTGGGTGC TGGGCGCGGC CGGTGACCGG ATCCCGGCCG GCGAGGCGCT GCGCCGGGCC GGTCTCACCC CGATCGAGCT GGCGGCCAAG GAGGGGCTGG CCCTGATCAA CGGCACCGAC GGGATGCTCG GCATGCTGCT GCTGGCCAAC CACGACGCCA CGCACCTGTT CACCCTGGCC GACGTCACGG CCGCCCTGGC CATCGAGGCG ATGCTCGGGT CGGAACGACC TTTCCGACCC GAGTTGCACA CGATCCGCCC ACACCCCGGT CAGGCCGCCT CGGCGGCGAA CATCCACCGC CTGCTCCAGG ACTCGGCGGT GATGGAATCG CACCGCGACG ACGTGACGCA CGCGGTGCAG GACGCATATT CGATGCGATG CGCGCCGCAG GTCGCCGGGG CCGCCCGCGA CACCCTGGAC TTCGCCCGGC AGGTGGCGGG CCGGGAACTG ATCTCGGTGG TGGACAACCC GGTGGTGCTA CCGGACGGCC GGGTCGAGTC GACCGGGAAC TTCCACGGCG CACCACTCGG TTTCGCCGCC GACTTCCTCG CCGTCGCCGC CGCCGAGGTC GGCGCGATCG CCGAGCGACG GGTGGACCGG CTGCTCGACG TGACCCGCTC CCGCGACCTA CCGGCGTTCC TCTCCCCCGA CGCCGGCGTC AACTCAGGGC TGATGATCGC CCAGTACACG GCGGCGGGCA TCGTCGCGGA GAACCGCCGG CTCGCCGCAC CCGCCTCGGT GGACTCGCTG CCCACCAGCG GAATGCAGGA GGACCACGTG TCGATGGGCT GGGCGGCGAC ACGGAAACTG CGGACCGTCC TGGACAACCT GACCAGTCTG CTCGCGGTCG AGCTGCTCGC CGCGGTCCGC GGGCTCCAAC TGCGGGCCCC GCTGCGACCG TCCCCGGCCG GGCGGGCCGC CATCGCCGCG TTGGCCGGGG CCGCCGGGGA TCCCGGCCCG GACATCTTCC TCGCTCCGGT GCTGGAGACC GCCCGTACGG TGGTGGCCGG CCCGGAGTTG CGCGCCGCGA TCGAACGTGA GGTCGGCGCG CTGGCCTGA
|
Protein sequence | MSTVVIQPTG VTPADVLAVA RGTAKVVLDP AAIDAMVASR SVVDGIEASG QPVYGVSTGF GALANTFVAP QRRAELQHAL IRSHAAGVGS AMPREVVRAM MLLRVRSLAL GRSGVRPIVA TALVDLLNND VTPWVPEHGS LGASGDLAPL AHCALALLGE GWVLGAAGDR IPAGEALRRA GLTPIELAAK EGLALINGTD GMLGMLLLAN HDATHLFTLA DVTAALAIEA MLGSERPFRP ELHTIRPHPG QAASAANIHR LLQDSAVMES HRDDVTHAVQ DAYSMRCAPQ VAGAARDTLD FARQVAGREL ISVVDNPVVL PDGRVESTGN FHGAPLGFAA DFLAVAAAEV GAIAERRVDR LLDVTRSRDL PAFLSPDAGV NSGLMIAQYT AAGIVAENRR LAAPASVDSL PTSGMQEDHV SMGWAATRKL RTVLDNLTSL LAVELLAAVR GLQLRAPLRP SPAGRAAIAA LAGAAGDPGP DIFLAPVLET ARTVVAGPEL RAAIEREVGA LA
|
| |