Gene Acid345_1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1410 
Symbol 
ID4068751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1707483 
End bp1709033 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content62% 
IMG OID637983419 
Producthistidine ammonia-lyase 
Protein accessionYP_590486 
Protein GI94968438 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0341381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTC TTCATCTTAC TGGCAATACT CTTACCCTCG ACGAAGTGCG CGAGGTCGTT 
TACGAACAAC GTCCTGTGTT GCTGGATTCC GATGCGCGCG CCGCGGTAGA TCGCGCTCGC
GCTGTGATCG AAGATGTCGT CGCCAACGAT CGCCTCGCGT ATGCAGTGAC GACGGGTGTC
GGCAAGTTGA GCGATGTCCG CATTCCTCCC GCAGAAAATC GCACCCTGCA ACTCAACTTG
ATGCGCTCCC ATGCCGTGGG TGTGGGCGAT CCACTCAGCG AGCAGGTCAG CCGCGCCATG
ATGCTGCTGC GCGCCAACTC GCTTTGCAAA GGATGGTCAG GCGTACGTGG CCTGGTAATT
GACACGCTCT GCGAGATGCT CAACCGCGGG GTGCATCCTG TGATTCCATC GCAGGGAAGC
GTCGGCGCCA GCGGCGATCT CGCTCCTCTC GCGCACCAGG GGCTGGTGCT AATCGGCGAA
GGCGAAGCCT TCTATCAAGG CAAACGTGTC AGCGGCGCAG AAGCGCTGCG CGCAGCAGGG
ATTAAGCCGA TCACCCTCGA AGCCAAGGAA ACGATCTCGC TGATCAACGG CACCCAGGCG
ATGCTTGCAG TCGGCCTGCT AGCAGTGCTC GACGCCGAAA TTCTTGCCGA GACCGCCGAT
GCAGTCGGCG CGCTTGCCCT CGATGTACTG CAGGGAACTG ACGCTGCGTT CGACGAGCGC
ATCCATAAAG CTCGCCCGCA CTCCGGACAG ATCCAAGTCG CGGCGAACCT GCGCCGCCTG
CTCGCCGGCA GCCAGATTCA CGAATCGCAC AAAGACTGTG CCCGCGTGCA GGATGCCTAC
TCGCTGCGCT GCATGCCGCA AGTGCACGGC GCCGTGCGCG ACACCATCCA CTATTGCCGC
TCCGTCTTCG AAGTCGAGAT GAACTCCGCG GTGGACAATC CTCTGGTATT TCCAGAGCCG
AAGAAGGTCG GCGAGCGCTC CGACGCGCCC GTCCATGGCG ACATCATTTC CGGCGGCAAC
TTCCACGGTG AGCCGGTAGC GTTCGCGCTC GATTTCCTCG CGATCGCCTT GAGCGCGCTT
GCCGGAATCT CCGAGCGCCG CATCGAGCGC CTGGTGAACC CGGCGCTGAG TGAAGGGCTG
CCCGCCTTCC TCGCTCCCGG CGCAGGACTC AATTCCGGCT TCATGATGCC GCAGGTCACG
GCCGCCGCTC TGGTCAGCGA GAACAAGGTG CTCTCACATC CGGCGTCGGT GGACTCGATC
ACCACTTCGG GCAATAAAGA AGATTTCGTC TCGATGGGAA TGACGGCTGC GCTGAAACTG
CAGCGCATCG TCCAGAACAC GCGCAATGTT ATGGCGATCG AAGCGCTAGC GGCCGCGCAG
GCGCTCGACT TCAAAGCCCC GCTGAAAACA ACGAAGCTCC TGCAGAAGGT TCATGCTGCG
GTTCGCGCGG TTTCACCGCA GATCACCGAA GACCGCATTC TCACGGCGGA TTTCGCAGCG
GCGGAAGCGC TGATCCGAAG TGGAAAGCTC GCAGCGGCGG CGCGCAATTA G
 
Protein sequence
MKALHLTGNT LTLDEVREVV YEQRPVLLDS DARAAVDRAR AVIEDVVAND RLAYAVTTGV 
GKLSDVRIPP AENRTLQLNL MRSHAVGVGD PLSEQVSRAM MLLRANSLCK GWSGVRGLVI
DTLCEMLNRG VHPVIPSQGS VGASGDLAPL AHQGLVLIGE GEAFYQGKRV SGAEALRAAG
IKPITLEAKE TISLINGTQA MLAVGLLAVL DAEILAETAD AVGALALDVL QGTDAAFDER
IHKARPHSGQ IQVAANLRRL LAGSQIHESH KDCARVQDAY SLRCMPQVHG AVRDTIHYCR
SVFEVEMNSA VDNPLVFPEP KKVGERSDAP VHGDIISGGN FHGEPVAFAL DFLAIALSAL
AGISERRIER LVNPALSEGL PAFLAPGAGL NSGFMMPQVT AAALVSENKV LSHPASVDSI
TTSGNKEDFV SMGMTAALKL QRIVQNTRNV MAIEALAAAQ ALDFKAPLKT TKLLQKVHAA
VRAVSPQITE DRILTADFAA AEALIRSGKL AAAARN