Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1410 |
Symbol | |
ID | 4068751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1707483 |
End bp | 1709033 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637983419 |
Product | histidine ammonia-lyase |
Protein accession | YP_590486 |
Protein GI | 94968438 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0341381 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCTC TTCATCTTAC TGGCAATACT CTTACCCTCG ACGAAGTGCG CGAGGTCGTT TACGAACAAC GTCCTGTGTT GCTGGATTCC GATGCGCGCG CCGCGGTAGA TCGCGCTCGC GCTGTGATCG AAGATGTCGT CGCCAACGAT CGCCTCGCGT ATGCAGTGAC GACGGGTGTC GGCAAGTTGA GCGATGTCCG CATTCCTCCC GCAGAAAATC GCACCCTGCA ACTCAACTTG ATGCGCTCCC ATGCCGTGGG TGTGGGCGAT CCACTCAGCG AGCAGGTCAG CCGCGCCATG ATGCTGCTGC GCGCCAACTC GCTTTGCAAA GGATGGTCAG GCGTACGTGG CCTGGTAATT GACACGCTCT GCGAGATGCT CAACCGCGGG GTGCATCCTG TGATTCCATC GCAGGGAAGC GTCGGCGCCA GCGGCGATCT CGCTCCTCTC GCGCACCAGG GGCTGGTGCT AATCGGCGAA GGCGAAGCCT TCTATCAAGG CAAACGTGTC AGCGGCGCAG AAGCGCTGCG CGCAGCAGGG ATTAAGCCGA TCACCCTCGA AGCCAAGGAA ACGATCTCGC TGATCAACGG CACCCAGGCG ATGCTTGCAG TCGGCCTGCT AGCAGTGCTC GACGCCGAAA TTCTTGCCGA GACCGCCGAT GCAGTCGGCG CGCTTGCCCT CGATGTACTG CAGGGAACTG ACGCTGCGTT CGACGAGCGC ATCCATAAAG CTCGCCCGCA CTCCGGACAG ATCCAAGTCG CGGCGAACCT GCGCCGCCTG CTCGCCGGCA GCCAGATTCA CGAATCGCAC AAAGACTGTG CCCGCGTGCA GGATGCCTAC TCGCTGCGCT GCATGCCGCA AGTGCACGGC GCCGTGCGCG ACACCATCCA CTATTGCCGC TCCGTCTTCG AAGTCGAGAT GAACTCCGCG GTGGACAATC CTCTGGTATT TCCAGAGCCG AAGAAGGTCG GCGAGCGCTC CGACGCGCCC GTCCATGGCG ACATCATTTC CGGCGGCAAC TTCCACGGTG AGCCGGTAGC GTTCGCGCTC GATTTCCTCG CGATCGCCTT GAGCGCGCTT GCCGGAATCT CCGAGCGCCG CATCGAGCGC CTGGTGAACC CGGCGCTGAG TGAAGGGCTG CCCGCCTTCC TCGCTCCCGG CGCAGGACTC AATTCCGGCT TCATGATGCC GCAGGTCACG GCCGCCGCTC TGGTCAGCGA GAACAAGGTG CTCTCACATC CGGCGTCGGT GGACTCGATC ACCACTTCGG GCAATAAAGA AGATTTCGTC TCGATGGGAA TGACGGCTGC GCTGAAACTG CAGCGCATCG TCCAGAACAC GCGCAATGTT ATGGCGATCG AAGCGCTAGC GGCCGCGCAG GCGCTCGACT TCAAAGCCCC GCTGAAAACA ACGAAGCTCC TGCAGAAGGT TCATGCTGCG GTTCGCGCGG TTTCACCGCA GATCACCGAA GACCGCATTC TCACGGCGGA TTTCGCAGCG GCGGAAGCGC TGATCCGAAG TGGAAAGCTC GCAGCGGCGG CGCGCAATTA G
|
Protein sequence | MKALHLTGNT LTLDEVREVV YEQRPVLLDS DARAAVDRAR AVIEDVVAND RLAYAVTTGV GKLSDVRIPP AENRTLQLNL MRSHAVGVGD PLSEQVSRAM MLLRANSLCK GWSGVRGLVI DTLCEMLNRG VHPVIPSQGS VGASGDLAPL AHQGLVLIGE GEAFYQGKRV SGAEALRAAG IKPITLEAKE TISLINGTQA MLAVGLLAVL DAEILAETAD AVGALALDVL QGTDAAFDER IHKARPHSGQ IQVAANLRRL LAGSQIHESH KDCARVQDAY SLRCMPQVHG AVRDTIHYCR SVFEVEMNSA VDNPLVFPEP KKVGERSDAP VHGDIISGGN FHGEPVAFAL DFLAIALSAL AGISERRIER LVNPALSEGL PAFLAPGAGL NSGFMMPQVT AAALVSENKV LSHPASVDSI TTSGNKEDFV SMGMTAALKL QRIVQNTRNV MAIEALAAAQ ALDFKAPLKT TKLLQKVHAA VRAVSPQITE DRILTADFAA AEALIRSGKL AAAARN
|
| |