Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0254 |
Symbol | |
ID | 8331581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 286093 |
End bp | 287646 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 644953421 |
Product | histidine ammonia-lyase |
Protein accession | YP_003111048 |
Protein GI | 256389484 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.225314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGG TAGTGGTGAT CGGTGAGGCG GATCTCACTT TCGGTGACGT TGTCGCCGTG GCCCGGGACG GTGCCCGCGT CGAATTGTCC GCGACGTCGC TGAAGGCGCT GGCCGAGGGG CGCGCCGTGG TGGACCGGCT GGCCGCCGCG CCGACCCCGG CCTACGGCAT CTCCACCGGC TTCGGCGCGC TGGCCACCCG GCACATCGAT CCGGAGATGC GCGCGCAGCT GCAGCGCTCC CTGATCCGCT CGCACGCCGC CGGCATGGGC CCGCTGGTCG AGCCCGAGGT GATCCGCGCG CTCACCCTCA TGCGGCTGAA GACGCTGGCC ACCGGGCACA CCGGCGTGCG CCCCGTGGTC GCCGAGACCA TGGCCGCGCT GCTGAACTCC GGCGTCACCC CGGCCGTGCG CGAGTACGGC TCGCTGGGCT GTTCCGGCGA CCTGGCGCCG CTGTCGCACG TCGCCCTGGT GCTCATGGGC GAGGGCGAGG TTGTCGGGGC GGACGGGGTT TCTGCGGTCG CCGCAGGACC GGTCCTGGCC GAGCACGGCA TCGAGCCGCT GGAGCTGGCG CCCAAGGAGG GCCTGGCGCT GATCAACGGC ACCGACGGCA TGCTCGGCAT GCTGATCCTG GCTCTTGGCG ACCTGACCGA GCTGGTGAAG GTCGCGGACA TCTCCGCGGC GATGTCGGTC GAGGCGCTGC TGGGCACGGA CAAGGTGTTC CGCCCCGAAC TGCAGGCCAT CCGCCCGCAT CCGGGTCAGG CCGCTTCCGC CGCGAACCTG GTGAAGGTGC TCGACGGCTC GCCGATCATG GAGTCGCACC GCGAGCCCAA CGAGTGCACC CGCGTCCAGG ACGCCTACTC GCTGCGCTGC GCCCCGCAGG TCGCCGGCGC CACGCGGGAC ACCATGGCGC ACGCCGCGAC GGTCGCCGAG CGCGAACTCG CCTCGATCGT CGACAACCCG GTGGTGCTGC TGGCCGACGG CCGCGTGGAG TCCAACGGCA ACTTCCACGG CGCCCCGGTC GCGATGGTCC TGGACTTCCT CGCCATCGCC GCCGCCGACC TCGGCTCCAT CGCCGAGCGC CGCACCGACC GCATGCTCGA CGTCGCGCGC TCGCACGGCC TGCCGCCGTT CCTCGCCGAC GACCCGGGCG TCGACAGCGG CCTGATGATC GCGCAGTACA CGCAGGCCGC TTTGGTCAGC GAGAACAAGC GGCTGGCGGT CCCGGCGTCG GTGGACTCGA TCCCGTCCTC GGCGATGCAG GAGGACCACG TCTCCATGGG CTGGTCGGCG GCGCGCAAGC TGCGCACCGC CGTGGACAAC CTGCGGCGCA TCCTGGCCGT CGAGCTGGTC GCCGCCGCGC GGGCGCTGGA ACTGCGCGCG CCGCTGCAGC CCGCCGCCGG GACCGGCGCG GCTGTCAGGG CCCTGCGGGA GGCCGGCGTC GGCGGCCCGG GCCCGGACCG CTTCCTGTCG CCGGAGCTGC GCGCCGCCGA GGACGCGCTG AAGTCCGGCG CGGTGGTCGC GGCCGTCGAG ACGGCCGTGG GTCCGCTGAA CTGA
|
Protein sequence | MSQVVVIGEA DLTFGDVVAV ARDGARVELS ATSLKALAEG RAVVDRLAAA PTPAYGISTG FGALATRHID PEMRAQLQRS LIRSHAAGMG PLVEPEVIRA LTLMRLKTLA TGHTGVRPVV AETMAALLNS GVTPAVREYG SLGCSGDLAP LSHVALVLMG EGEVVGADGV SAVAAGPVLA EHGIEPLELA PKEGLALING TDGMLGMLIL ALGDLTELVK VADISAAMSV EALLGTDKVF RPELQAIRPH PGQAASAANL VKVLDGSPIM ESHREPNECT RVQDAYSLRC APQVAGATRD TMAHAATVAE RELASIVDNP VVLLADGRVE SNGNFHGAPV AMVLDFLAIA AADLGSIAER RTDRMLDVAR SHGLPPFLAD DPGVDSGLMI AQYTQAALVS ENKRLAVPAS VDSIPSSAMQ EDHVSMGWSA ARKLRTAVDN LRRILAVELV AAARALELRA PLQPAAGTGA AVRALREAGV GGPGPDRFLS PELRAAEDAL KSGAVVAAVE TAVGPLN
|
| |