Gene Sare_2092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2092 
Symbol 
ID5704671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2407658 
End bp2409265 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content68% 
IMG OID641271577 
Producthistidine ammonia-lyase 
Protein accessionYP_001536948 
Protein GI159037695 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00926159 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCGG TCGAAACCAG CGCGTCCGTC GACTTCGACG GCGAGAACTT GGACGTGCCA 
GCTCTCCGGC GGGTCGCCGA GCAACGGGTG CCGTGCCGGG TGCCGGCGAG CTCGTTGACC
AAGGCGGCCA TGAGCCGGAA GCTGTTCGAG GACACGATCC GTCAGGACGT CCCGGTCTAC
GGCGTCACCA CCGGGTACGG CGAAATGATC TACATGCTGG TCGCGCCGGA ACACGAGGTC
GAGTTGCAGA CCAACCTGGT TCGCAGCCAC AGTGCCGGCG TCGGACCCGC GTTCTCGGAG
AACGAGGCCC GGGCGATCGT CGGGGCGCGC CTGAACGCCC TGGCAAAGGG GTACTCGGCG
GTGCGACCCG AGATTTTGGA GCGGCTGGCG CTGTATCTCA ACCTCGGTAT CACGCCGGCC
ATCCCGGAGA TCGGTTCACT CGGGGCCAGC GGTGACCTCG CTCCACTCGC GCACATCGCC
AGCACCGTCA TCGGCGAGGG GTACGTGCTA CGTGACGGCC GGCGGGTACG CACCGGCGAC
GTGCTACGCG AGTTCGGGAT CGAGCCGCTG CAGCTCCGGT TCAAGGAGGG CCTTGCCCTG
ATCAACGGCA CATCGGCAAT GACCGGCCTG GGAGCCCTGG TGGTGGACCA GGCGATGATC
CAGGTACGCC AGGCCGAGAT CGTCGCGGCG CTGGTGATCG AGGGTCTGCG CGGGTCGACC
GGACCGTTCC TACCGGAGGG ACACGACGTG GCCCGGCCGC ATGCCGGCCA GATCGACAGC
GCGGCGAACA TGCGGACGCT GATGCAGGGC AGCAGGCTGA CGGTGGAGCA CGCCGAGTTG
CGCCGAATGG TGCAGGAGAG CCGGTCGGCC GAGGACAGCG TGCAACGTAC CAACCTGTAC
ATGCAGAAGG CCTACTCGCT GCGTGCCGTC CCGCAGGTGC TCGGAGGGGT ACGCGACACA
CTCACCCATG CCCGGACCAA GCTCGACATC GAACTCAACT CCGCCAACGA CAACCCGCTG
TTCTTCGAGG GGCGGGAGGT GTTCCACGGG GCGAACTTCC ACGGTCAGCC GGTCGCGTTC
GCGATGGACT TCGTCACGAT CGCGTTGACC CAGCTCGGGG TGCTGTCTGA GCGCCGGACG
AACCGGCTGC TCAACCGGCA CCTCAGTTAC GGGCTGCCGG AGTTCCTGGT GGCCGGCGAT
CCGGGCCTGC ACAGCGGATT CGCCGGGGCG CAGTACCCTG CGACCGCGCT GGTCGCGGAG
AATCGAACGA TCGGTCCGGC CAGTGCCCAG AGCATCCCGT CCAACGGCGA CAACCAGGAC
ATCGTCAGCA TGGGCCTCAT CGCCGCCCGT AACGCGCGCC GGGTGCTGAC CAACAACGAC
CAGATCCTCG CGGTGGAACT GCTCGCCGCC GCCCAGGCGG TCGACCTCGC CGACCGTAGC
GCCGGGTTGA GCCGTGCGGC CCGAGCGGTG TACGACACGG TGCGGCGGGT GGTTCCGGTG
CTGGACCAGG ACCGCTACAT GGCCGACGAC ATCGAACTGG TCGCCGACAT GCTCACCCAC
GGCGAGTTGG TCGACGCGGT CGAGGCGGTC AACGTGACGT TGCACTGA
 
Protein sequence
MTAVETSASV DFDGENLDVP ALRRVAEQRV PCRVPASSLT KAAMSRKLFE DTIRQDVPVY 
GVTTGYGEMI YMLVAPEHEV ELQTNLVRSH SAGVGPAFSE NEARAIVGAR LNALAKGYSA
VRPEILERLA LYLNLGITPA IPEIGSLGAS GDLAPLAHIA STVIGEGYVL RDGRRVRTGD
VLREFGIEPL QLRFKEGLAL INGTSAMTGL GALVVDQAMI QVRQAEIVAA LVIEGLRGST
GPFLPEGHDV ARPHAGQIDS AANMRTLMQG SRLTVEHAEL RRMVQESRSA EDSVQRTNLY
MQKAYSLRAV PQVLGGVRDT LTHARTKLDI ELNSANDNPL FFEGREVFHG ANFHGQPVAF
AMDFVTIALT QLGVLSERRT NRLLNRHLSY GLPEFLVAGD PGLHSGFAGA QYPATALVAE
NRTIGPASAQ SIPSNGDNQD IVSMGLIAAR NARRVLTNND QILAVELLAA AQAVDLADRS
AGLSRAARAV YDTVRRVVPV LDQDRYMADD IELVADMLTH GELVDAVEAV NVTLH