Gene Caci_0254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0254 
Symbol 
ID8331581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp286093 
End bp287646 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content74% 
IMG OID644953421 
Producthistidine ammonia-lyase 
Protein accessionYP_003111048 
Protein GI256389484 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.225314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGG TAGTGGTGAT CGGTGAGGCG GATCTCACTT TCGGTGACGT TGTCGCCGTG 
GCCCGGGACG GTGCCCGCGT CGAATTGTCC GCGACGTCGC TGAAGGCGCT GGCCGAGGGG
CGCGCCGTGG TGGACCGGCT GGCCGCCGCG CCGACCCCGG CCTACGGCAT CTCCACCGGC
TTCGGCGCGC TGGCCACCCG GCACATCGAT CCGGAGATGC GCGCGCAGCT GCAGCGCTCC
CTGATCCGCT CGCACGCCGC CGGCATGGGC CCGCTGGTCG AGCCCGAGGT GATCCGCGCG
CTCACCCTCA TGCGGCTGAA GACGCTGGCC ACCGGGCACA CCGGCGTGCG CCCCGTGGTC
GCCGAGACCA TGGCCGCGCT GCTGAACTCC GGCGTCACCC CGGCCGTGCG CGAGTACGGC
TCGCTGGGCT GTTCCGGCGA CCTGGCGCCG CTGTCGCACG TCGCCCTGGT GCTCATGGGC
GAGGGCGAGG TTGTCGGGGC GGACGGGGTT TCTGCGGTCG CCGCAGGACC GGTCCTGGCC
GAGCACGGCA TCGAGCCGCT GGAGCTGGCG CCCAAGGAGG GCCTGGCGCT GATCAACGGC
ACCGACGGCA TGCTCGGCAT GCTGATCCTG GCTCTTGGCG ACCTGACCGA GCTGGTGAAG
GTCGCGGACA TCTCCGCGGC GATGTCGGTC GAGGCGCTGC TGGGCACGGA CAAGGTGTTC
CGCCCCGAAC TGCAGGCCAT CCGCCCGCAT CCGGGTCAGG CCGCTTCCGC CGCGAACCTG
GTGAAGGTGC TCGACGGCTC GCCGATCATG GAGTCGCACC GCGAGCCCAA CGAGTGCACC
CGCGTCCAGG ACGCCTACTC GCTGCGCTGC GCCCCGCAGG TCGCCGGCGC CACGCGGGAC
ACCATGGCGC ACGCCGCGAC GGTCGCCGAG CGCGAACTCG CCTCGATCGT CGACAACCCG
GTGGTGCTGC TGGCCGACGG CCGCGTGGAG TCCAACGGCA ACTTCCACGG CGCCCCGGTC
GCGATGGTCC TGGACTTCCT CGCCATCGCC GCCGCCGACC TCGGCTCCAT CGCCGAGCGC
CGCACCGACC GCATGCTCGA CGTCGCGCGC TCGCACGGCC TGCCGCCGTT CCTCGCCGAC
GACCCGGGCG TCGACAGCGG CCTGATGATC GCGCAGTACA CGCAGGCCGC TTTGGTCAGC
GAGAACAAGC GGCTGGCGGT CCCGGCGTCG GTGGACTCGA TCCCGTCCTC GGCGATGCAG
GAGGACCACG TCTCCATGGG CTGGTCGGCG GCGCGCAAGC TGCGCACCGC CGTGGACAAC
CTGCGGCGCA TCCTGGCCGT CGAGCTGGTC GCCGCCGCGC GGGCGCTGGA ACTGCGCGCG
CCGCTGCAGC CCGCCGCCGG GACCGGCGCG GCTGTCAGGG CCCTGCGGGA GGCCGGCGTC
GGCGGCCCGG GCCCGGACCG CTTCCTGTCG CCGGAGCTGC GCGCCGCCGA GGACGCGCTG
AAGTCCGGCG CGGTGGTCGC GGCCGTCGAG ACGGCCGTGG GTCCGCTGAA CTGA
 
Protein sequence
MSQVVVIGEA DLTFGDVVAV ARDGARVELS ATSLKALAEG RAVVDRLAAA PTPAYGISTG 
FGALATRHID PEMRAQLQRS LIRSHAAGMG PLVEPEVIRA LTLMRLKTLA TGHTGVRPVV
AETMAALLNS GVTPAVREYG SLGCSGDLAP LSHVALVLMG EGEVVGADGV SAVAAGPVLA
EHGIEPLELA PKEGLALING TDGMLGMLIL ALGDLTELVK VADISAAMSV EALLGTDKVF
RPELQAIRPH PGQAASAANL VKVLDGSPIM ESHREPNECT RVQDAYSLRC APQVAGATRD
TMAHAATVAE RELASIVDNP VVLLADGRVE SNGNFHGAPV AMVLDFLAIA AADLGSIAER
RTDRMLDVAR SHGLPPFLAD DPGVDSGLMI AQYTQAALVS ENKRLAVPAS VDSIPSSAMQ
EDHVSMGWSA ARKLRTAVDN LRRILAVELV AAARALELRA PLQPAAGTGA AVRALREAGV
GGPGPDRFLS PELRAAEDAL KSGAVVAAVE TAVGPLN