Gene AnaeK_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_2046 
Symbol 
ID6787897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp2299841 
End bp2301367 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content77% 
IMG OID642763505 
Producthistidine ammonia-lyase 
Protein accessionYP_002134403 
Protein GI197122452 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.034077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACCC TCCTCCTCGA CGGCGAGACC CTCACGCTGG AGCAGGTCCG CGCGGTCGCG 
ACCGGGGCCG CCCGCGCCGC GCTCGCCCCC GCGGCCCGCG AGCGCGTGCG GCGTTCCCGC
GCGCTGGTGG ACGCCCGGCT CGAGGACGGC GAGGCGCACT ACGGCATCAA CACCGGCTTC
GGGACGCTCG CCGAGGTCCG CATCCCGCGG GCCGACCTCG AGCGGCTGCA GCGCAACCTG
GTGCTCTCGC ACGCCGCCGG CGTGGGCGCG CCGCTGCCCC TCCCGGAGGC GCGCGCGCTG
GTGCTGCTGC GCGCCAACGT GCTCGCGAAG GGCGTCTCCG GGATCCGCGA GCGCACGCTG
GACCTGCTGC TCGCGATGCT CGAGCGCGGG GTGGTGCCGG TGGTGCCGGA GCGCGGGTCG
GTGGGCGCGT CGGGCGACCT CGCCCCGCTC GCGCACCTCG CGCTGGTGCT GATCGGCGAC
GGCGAGGCGT TCCTCGCGCC GCCCGGCGCG GCGGGCCGGC CCGAGCGGCT CCCCGGCGGC
GAGGCGCTGC GGCGGGCCGG GCTCGAGCCG GTGGTGCTGC AGCCGAAGGA GGGGCTGGCG
CTCGTGAACG GCACCCAGGC CATGGCCGCG GTCGGCACGC TCGCGCTGCT CCGCGCCGAG
CGGCTGGCGG CGCTCGCCGA TCTCGCGGGC GCCATGACGC TGGAGGGGCT GCTCGGCTCG
CACCGGCCGT TCGCGCCGGA GATCCAGGCC GCCCGCGGGC AGCCCGGCCA GATCGCCGCG
GCGGCGCACC TGCGCGCGCT GCTGGCCGGC TCCGAGCTGA ACGCCTCGCA CCAGGGCCCG
GGCTGCCACA AGGTGCAGGA CCCCTACTCG CTCCGCTGCA TGCCGCAGGT GCACGGCGCC
GCGCGCGACG GCATCGGCTT CTGCCGCGGG GTGCTGGCGC GCGAGGTGAA CGCCGCCACC
GACAACCCGC TGGTCTTCCC GGACACCGGG GAGATCGTCT CGGGCGGCAA CTTCCACGGC
CAGCCGGTGG CGCTCGCGCT CGACGTGCTC GCGGTGGCCG CCTCGCACCT CGCCGCCATC
TCGGAGCGCC GCGTGGAGCA GCTCGTGAAC CCGTCGCTCT CCGGGCTGCC GCCGTTCCTG
GCGCCCCAGC ACGGGCTCAA CTCGGGGTTC ATGATCGCGC AGGTGACCAG CGCGGCGCTC
GTCTCGGAGA ACAAGGTGCT CTGCCACCCG GCCTCGGTGG ACTCGATCCC GTCCTCCGCC
GGCCGCGAGG ACCACGTGTC GATGGGCATG ACCGCCGCGC TGAAGGCGCG CCAGGTGGTG
GAGAACGTCC GCACCTGCCT CGCCATCGAG CTGCTGGTCG CGGCGCAGGC GCTCGATCTC
CGGGCCCCGC TCCGCCCCGC CCAGCGCGTG GCCGAGGCGC ACGCCCGCCT GCGCGAGCGC
GTCCCGCACC TGTCGGAGGA TCGGGCGCTG CACCGCGACA TCGAGGCGGT GTCGAGCCTG
GTGGACGAGG GCGGGCTGGA GCTGTGA
 
Protein sequence
METLLLDGET LTLEQVRAVA TGAARAALAP AARERVRRSR ALVDARLEDG EAHYGINTGF 
GTLAEVRIPR ADLERLQRNL VLSHAAGVGA PLPLPEARAL VLLRANVLAK GVSGIRERTL
DLLLAMLERG VVPVVPERGS VGASGDLAPL AHLALVLIGD GEAFLAPPGA AGRPERLPGG
EALRRAGLEP VVLQPKEGLA LVNGTQAMAA VGTLALLRAE RLAALADLAG AMTLEGLLGS
HRPFAPEIQA ARGQPGQIAA AAHLRALLAG SELNASHQGP GCHKVQDPYS LRCMPQVHGA
ARDGIGFCRG VLAREVNAAT DNPLVFPDTG EIVSGGNFHG QPVALALDVL AVAASHLAAI
SERRVEQLVN PSLSGLPPFL APQHGLNSGF MIAQVTSAAL VSENKVLCHP ASVDSIPSSA
GREDHVSMGM TAALKARQVV ENVRTCLAIE LLVAAQALDL RAPLRPAQRV AEAHARLRER
VPHLSEDRAL HRDIEAVSSL VDEGGLEL