Gene Sare_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0996 
Symbol 
ID5704678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1119101 
End bp1120639 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content73% 
IMG OID641270511 
Producthistidine ammonia-lyase 
Protein accessionYP_001535898 
Protein GI159036645 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.168418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACCG TAGTCATCCA ACCAACCGGG GTCACCCCCG CCGACGTGCT CGCCGTCGCC 
CGCGGCACCG CCAAGGTCGT ACTCGACCCG GCGGCGATCG ACGCGATGGT CGCCAGCCGG
TCCGTCGTGG ACGGCATCGA GGCCTCCGGC CAGCCGGTGT ACGGCGTCAG CACCGGTTTC
GGGGCCCTCG CCAACACGTT CGTCGCCCCG CAGCGGCGGG CGGAGCTACA GCACGCGCTG
ATCCGTTCAC ACGCCGCCGG GGTGGGCTCC GCCATGCCCC GCGAGGTGGT CCGGGCGATG
ATGCTGCTGC GCGTACGGTC CCTCGCGCTC GGCCGCTCCG GCGTCCGGCC GATCGTCGCC
ACGGCACTGG TGGACCTGCT CAACAACGAC GTCACCCCGT GGGTACCCGA ACACGGGTCG
CTGGGAGCCT CCGGGGACCT GGCGCCGCTG GCGCACTGCG CGCTGGCGCT GCTCGGCGAG
GGCTGGGTGC TGGGCGCGGC CGGTGACCGG ATCCCGGCCG GCGAGGCGCT GCGCCGGGCC
GGTCTCACCC CGATCGAGCT GGCGGCCAAG GAGGGGCTGG CCCTGATCAA CGGCACCGAC
GGGATGCTCG GCATGCTGCT GCTGGCCAAC CACGACGCCA CGCACCTGTT CACCCTGGCC
GACGTCACGG CCGCCCTGGC CATCGAGGCG ATGCTCGGGT CGGAACGACC TTTCCGACCC
GAGTTGCACA CGATCCGCCC ACACCCCGGT CAGGCCGCCT CGGCGGCGAA CATCCACCGC
CTGCTCCAGG ACTCGGCGGT GATGGAATCG CACCGCGACG ACGTGACGCA CGCGGTGCAG
GACGCATATT CGATGCGATG CGCGCCGCAG GTCGCCGGGG CCGCCCGCGA CACCCTGGAC
TTCGCCCGGC AGGTGGCGGG CCGGGAACTG ATCTCGGTGG TGGACAACCC GGTGGTGCTA
CCGGACGGCC GGGTCGAGTC GACCGGGAAC TTCCACGGCG CACCACTCGG TTTCGCCGCC
GACTTCCTCG CCGTCGCCGC CGCCGAGGTC GGCGCGATCG CCGAGCGACG GGTGGACCGG
CTGCTCGACG TGACCCGCTC CCGCGACCTA CCGGCGTTCC TCTCCCCCGA CGCCGGCGTC
AACTCAGGGC TGATGATCGC CCAGTACACG GCGGCGGGCA TCGTCGCGGA GAACCGCCGG
CTCGCCGCAC CCGCCTCGGT GGACTCGCTG CCCACCAGCG GAATGCAGGA GGACCACGTG
TCGATGGGCT GGGCGGCGAC ACGGAAACTG CGGACCGTCC TGGACAACCT GACCAGTCTG
CTCGCGGTCG AGCTGCTCGC CGCGGTCCGC GGGCTCCAAC TGCGGGCCCC GCTGCGACCG
TCCCCGGCCG GGCGGGCCGC CATCGCCGCG TTGGCCGGGG CCGCCGGGGA TCCCGGCCCG
GACATCTTCC TCGCTCCGGT GCTGGAGACC GCCCGTACGG TGGTGGCCGG CCCGGAGTTG
CGCGCCGCGA TCGAACGTGA GGTCGGCGCG CTGGCCTGA
 
Protein sequence
MSTVVIQPTG VTPADVLAVA RGTAKVVLDP AAIDAMVASR SVVDGIEASG QPVYGVSTGF 
GALANTFVAP QRRAELQHAL IRSHAAGVGS AMPREVVRAM MLLRVRSLAL GRSGVRPIVA
TALVDLLNND VTPWVPEHGS LGASGDLAPL AHCALALLGE GWVLGAAGDR IPAGEALRRA
GLTPIELAAK EGLALINGTD GMLGMLLLAN HDATHLFTLA DVTAALAIEA MLGSERPFRP
ELHTIRPHPG QAASAANIHR LLQDSAVMES HRDDVTHAVQ DAYSMRCAPQ VAGAARDTLD
FARQVAGREL ISVVDNPVVL PDGRVESTGN FHGAPLGFAA DFLAVAAAEV GAIAERRVDR
LLDVTRSRDL PAFLSPDAGV NSGLMIAQYT AAGIVAENRR LAAPASVDSL PTSGMQEDHV
SMGWAATRKL RTVLDNLTSL LAVELLAAVR GLQLRAPLRP SPAGRAAIAA LAGAAGDPGP
DIFLAPVLET ARTVVAGPEL RAAIEREVGA LA