Gene Hhal_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1820 
Symbol 
ID4711043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1993704 
End bp1995296 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content71% 
IMG OID639856290 
Producthistidine ammonia-lyase 
Protein accessionYP_001003386 
Protein GI121998599 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.175616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAAG TGGACCTGGC CGGCAGCCTG AGCGCCGCAG ACATCGAGGC CATCGGTTAC 
GGCCACAGGA CCGCAACGGT CTCGCCGACC GGCTGGAAGC GGCTGCGCTC GGCCGAGGCA
TACCTCCAGC GCCTGGTGGA TGAGCGCCGC CAGGTTTACG GCGTCACCAC CGGCTACGGC
CCCCTGGCCA CCAGCCGGAT CGACCCCTCG GCCTCACGCA CCCTGCAGCG CAACCTGGTC
TACCACCTGT GCAGCGGCGT CGGCGAGCCG CTCTCCCGCT GTCACACCCG GGCGACGCTT
GGCGCGCGGA TCGCCAGCGT CACCCGGGGC CACTCCGGGG TGACGCCAGC GGTGGTGGAG
CGGCTGCTGG CGTGGCTGGA ACACGACGTG GTGCCAGAGG TGCCGGCCAT CGGCACCGTC
GGCGCCAGCG GCGACCTGAC CCCGCTGGCC CATGTGGCCC GGGCCCTCAT GGGCGAAGGC
CGGGTGTGCA TCAACGGCGG GGAATGGGAG CCCGCCGACG CCGCCCAACG CCGCCTCGGC
TGGGAACCGT GGACCCTGGA CGGAAAGGAC GCCATCGCCC TGGTCAATGG CACCTCCACC
ACCGCGGGCA TATGCGCCGT GAACGGTGCA GGTGCTGAAC GTGCCGCCGG GGTCTGTGCG
GTGCTGGGGA TGGTTTACGC TGAGCTTCTC GGTGGCCATG CCGAGGCCTT CCAGCCGGCC
ATCGGAGCCG TCCGGCCCCA CCCCGGGCAG ATGCGCGCCC ACGCCTGGCT CACCGCTCTT
GCCGAGGACA GCCAGCGCCT CCAACCGTGG ACCGGCACAC CGCCCCGGCT GACCGAGGGC
CAGGAGGCCG TGCTTCCTGA TCAGCCCCTC CCCCAAGACC CCTACTCGAT TCGCTGTCTG
CCCCAGGCGC TGGGCGCGGT GCTGGACAGC ATCACCTTCC ACAACCAGAC CGTAGCCAGC
GAGCTAGACG CCGCCAGCGA CAACCCCCTG CTCTTCCCGG ACGAGGGGCG CGTGCTGCAC
GGCGGCAACT TCTTTGGCCA GCACCTTGCC TTCGCCGCCG ACGCCCTGAA CAATGCCGTG
GTGCAGCTGG CGTTACACAG CGAACGGCGC ATCAGCCGCA TCACCGACTC AACCCGCAGC
GGCTTTCCGG CCTTCATGCA GCCGCGCCAG ACCGGTTTGC ACAGCGGCTT CATGGGGGCC
CAGGTCACGG CCTCGGCCCT GGTGGCCGAG ATGCGGACCG GGGCCCACCC AGCCTCCATC
CAATCGATAC CGACCAACGC CGACAACCAG GACATCGTCC CCATGAGCAC CCGCGCAGCG
CGGCAGGCAG CCACCAACCT GGACCATCTG CAGCGGATCT TGGCCATCGA GGCGCTGGTG
CTGGCGCAAG GCCTCGAGCT GGCCGATGGT GTCGGGTTTA GCAGCAGCGC GCGGCGTACC
CTGGGATGGG TACGCGAACT GGCCCCACCG CTGGAGGACG ATCGCCCGCT GGCCGAGGAG
ATCGCCCGCG TTGCTGCTGC GCTGGCCACG CCGTACCAAG CCCACCGACT GGTTGCCGGG
CTTCCGGGCG CGCCCCCGGG GCCAGCCTCC TGA
 
Protein sequence
MAEVDLAGSL SAADIEAIGY GHRTATVSPT GWKRLRSAEA YLQRLVDERR QVYGVTTGYG 
PLATSRIDPS ASRTLQRNLV YHLCSGVGEP LSRCHTRATL GARIASVTRG HSGVTPAVVE
RLLAWLEHDV VPEVPAIGTV GASGDLTPLA HVARALMGEG RVCINGGEWE PADAAQRRLG
WEPWTLDGKD AIALVNGTST TAGICAVNGA GAERAAGVCA VLGMVYAELL GGHAEAFQPA
IGAVRPHPGQ MRAHAWLTAL AEDSQRLQPW TGTPPRLTEG QEAVLPDQPL PQDPYSIRCL
PQALGAVLDS ITFHNQTVAS ELDAASDNPL LFPDEGRVLH GGNFFGQHLA FAADALNNAV
VQLALHSERR ISRITDSTRS GFPAFMQPRQ TGLHSGFMGA QVTASALVAE MRTGAHPASI
QSIPTNADNQ DIVPMSTRAA RQAATNLDHL QRILAIEALV LAQGLELADG VGFSSSARRT
LGWVRELAPP LEDDRPLAEE IARVAAALAT PYQAHRLVAG LPGAPPGPAS