Gene Cpin_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_2006 
Symbol 
ID8358157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp2444871 
End bp2446439 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content51% 
IMG OID644964193 
Producthistidine ammonia-lyase 
Protein accessionYP_003121702 
Protein GI256421049 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.34717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.214546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACACC AATTCAAGTA CGGTATTGAT CAGCTCACCG TGGCGAAGGC GCTGGATATT 
GCGTCAGGAA AGCTCGCCGG TATACTGGTA CCGGAAGTGA CTGCACGCAT CAGGGCCAGC
GCTGCCCACG TGCAGACTAT TGTCGCCGCT CACAGCACCG TATATGGTAT CAATACAGGC
TTCGGGCCGC TCTGCGATAC CAAGATATCC GAAGAAGATA CCCGTGCCCT GCAATACAAC
ATTCTTCAAA GTCATAGTGT AGGGGTAGGT AACCCGATCC CCGAACTGGT AGCCCGCGTG
ATGCTGGTCA CCAAAGTACA GGCCCTGGCA CAGGGATATT CAGGGGTAGC CCTGGCTACG
CTTGAACGTA TCATCTGGCA TATAGAACAT CACGTAACGC CCGTGGTACC TGAAAAAGGT
TCTGTCGGCG CCTCCGGCGA CCTGGCGCCT TTATCACATC TCTTCCTGCC GCTGATCGGT
CTTGGCGAGG TGTATTACAA AGGACAGCGA CAGCCCTCAG CAGCTGTTTT ACAGGCAGAA
AACCTGGAGC CGGTTATCTT AGGTCCCAAA GAGGGACTGG CGCTTATAAA CGGTACCCAG
TTCATTCTCT CCTTTGCAGT CACCGCTTTA CAGCGTATGC ATAATGCACT GGAAGCTGCT
GATATCATTG GTGCATTATC GCTTGAAGGA CTCATGGGTA CCGCTCGTCC CTTTGATCCG
CGTTTACATG CGATCAGACC TTTCCCCGGT AATCAGCTGG TGGCACACAG GCTGAAAATT
ATGCTGGAAA ACTCCGGTAT CATGGCTGCT CACGTGGATT GCGGTCGTGT ACAGGATCCT
TATTCCCTGC GCTGTATGCC ACAGGTACAT GGCGCTTCCC GTACCGCATG GCATCACCTG
CGTGAACTGA CCGTAATTGA ATTGAACGCT GTGACAGATA ACCCTATCAT CTTTAGCGCG
GAGGATACGA TCAGCGGGGG TAACTTCCAT GGTCAGCCGC TGGCAATGCC GCTGGACTAT
GCGACTGTCG CCGCCGCAGA ACTGGGTAAT ATATCCGATC GCCGTTGTTA TATGATGATC
GAAGGCCGTT ATGGCTTACC CAAACTATTA ATCGAAGATG CCGGACTCAA TTCCGGTTTT
ATGATACCAC AATACACGAC AGCCGCCCTG GTAACAGAGA ATAAGACCCT TTGTTTCCCG
GCCAGTGCCG ATAGTGTACC GACTTCCCTT GGTCAGGAAG ACCATGTATC CATGGGTTCC
ATCAGTGGTC GTAAACTGCA CCAGGTTATT GACAACCTGG AATATATACT GGCAGTCGAA
CTGCTGTATG CTGCACAGGC GGTGGATTTC AGACGGCCTT TACAATCCGG ACCTATCCTG
GAAGCTGTAC ATTCCTTTGT ACGTGAAACC GTTCCATTCG CAGCCAAAGA CAGGATTTTT
GCCTATGATA TCAAACAGCT GCATGGACTG ATCACCAATC AATCTTTAGT GAACGTGGCT
AACAATGCCG CACTTGACAA TCACCTTTCT CTAAACGGTA TTTATCATGA ACAGTTCGGA
CTTTATTAA
 
Protein sequence
MIHQFKYGID QLTVAKALDI ASGKLAGILV PEVTARIRAS AAHVQTIVAA HSTVYGINTG 
FGPLCDTKIS EEDTRALQYN ILQSHSVGVG NPIPELVARV MLVTKVQALA QGYSGVALAT
LERIIWHIEH HVTPVVPEKG SVGASGDLAP LSHLFLPLIG LGEVYYKGQR QPSAAVLQAE
NLEPVILGPK EGLALINGTQ FILSFAVTAL QRMHNALEAA DIIGALSLEG LMGTARPFDP
RLHAIRPFPG NQLVAHRLKI MLENSGIMAA HVDCGRVQDP YSLRCMPQVH GASRTAWHHL
RELTVIELNA VTDNPIIFSA EDTISGGNFH GQPLAMPLDY ATVAAAELGN ISDRRCYMMI
EGRYGLPKLL IEDAGLNSGF MIPQYTTAAL VTENKTLCFP ASADSVPTSL GQEDHVSMGS
ISGRKLHQVI DNLEYILAVE LLYAAQAVDF RRPLQSGPIL EAVHSFVRET VPFAAKDRIF
AYDIKQLHGL ITNQSLVNVA NNAALDNHLS LNGIYHEQFG LY