Gene Cpin_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_2007 
Symbol 
ID8358158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp2446417 
End bp2448081 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content51% 
IMG OID644964194 
Producturocanate hydratase 
Protein accessionYP_003121703 
Protein GI256421050 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0339801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.22403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGTT CGGACTTTAT TAAGACATAT GCGGCACACC CGCATTATAA AGCGCCCCAT 
GGCAATCAGC TGCACGCGCG CTCCTGGCAG ACAGAAGCAC CCTTACGTAT GCTCCTGAAT
AACCTGGATG CCGAAGTGGC TGAGAATCCT GATGAACTGG TGGTGTATGG TGGTATCGGT
CAGGCTGCAC GTAACAAAGA ATCCTTACAG AAGATCATTG AGATCCTGCT GGAACTGGAC
GAAGATCATT CTTTGCTGGT ACAGTCGGGT AAACCCGTTG GCGTTGTTCG TACCCATCCG
CAGGCGCCCC GCGTTATGCT GGCGAATAGT AACCTGGTGC CTAAATGGGC TACCTGGGAA
CATTTCAACG AACTGCGTGC AAAAGGACTC ATGATGTACG GACAGATGAC AGCAGGTAGC
TGGATCTATA TCGGTACACA GGGTATCTTA CAGGGTACCT ACGAGACTTT TGTGGCTTGT
GGCCGTCAGC ATTTCAATGG CGACCTGAAA GGTAAACTGC TCGTGACGGC AGGTATTGGT
GGTATGGGTG GCGCACAGCC ATTGGCTGCT ACCATGGCCG GTGCTGTATT CCTGGGTGCA
GATGTGGATG AATCACGTAT CCGCAAGCGC CTGGCTACCC GTTATATCGA CCGTATTACC
CACTCTTATG AGGAGGCGAT TGCCTGGGCA ATGGACGCTA AAGCCAAAGG GGAAGCACTG
TCCATCGGGC TGGTAAGTGA TGCGGGAGAT ATGCTGGAAC GCTTACTGAA AGACAATATT
ATTCCTGATA TACTGACTGA CCAGACCTCC GCGCACGATC CTATTAACGG ATATGTGCCG
AATGGGCTTT CCCTGGAAGA AGCGACGGCA TTACGTAAAA AAGACCCGGC AGACTACAAA
GCCCGCTCTT TAAAGAGTAT GGCCCGTCAC GTATCTTTTA TGCTGGCTTT ACAGGGAAAG
GGCGCTGTTA CCTTTGACTA TGGTAATAAC CTGCGTGAGT TTGCACGTGA AGGTGGAGAA
CCTAACGCCT TCAACTTCCC GGGATTTACG CCTGCCTATA TCCGTCCCCT TTTCTGTGAA
GGGAAAGGAC CTTTCAGATG GGTGGCTTTA TCCGGCGATC CTGAAGATAT TTATACCACC
GACAAGGCAT TGATGGAAGC CTTTCCGGAG AATACGGCCC TGATCAACTG GCTGAAGAAA
GCACAGGCAC AGGTAGCCTT CCAGGGATTA CCTGCGCGTA TCTGCTGGCT GGGATTAGGC
GAAAGAGAAA AAGCCGGTCT TATTTTCAAT GAACTGGTGA GAACAGGTAA AGTGAAAGCG
CCTATTGTGA TCGGTCGCGA TCACCTGGAT TGTGGTTCTG TCGCATCTCC CAACAGGGAA
ACAGAAGCGA TGAAAGATGG TTCGGATGCG GTGTCTGACT GGACTTTATT AAACCTGATG
GCGAATACCG GCGGTGGTGC TACCTGGGTA TCTTTCCATC ATGGTGGCGG CGTTGGTATG
GGTTATTCAC AACATGCAGG CATGGTCGTA CTGGCAGATG GATCTGAACG TGCGGAAGCC
TGTCTGAAAA GAGTATTATT CAATGATCCG GCATTGGGCA TCTTCCGACA TGCGGATGCA
GGGTATGAAG AAGCAAAAGC AACTGCCAGA AAATTCAATA TCTGA
 
Protein sequence
MNSSDFIKTY AAHPHYKAPH GNQLHARSWQ TEAPLRMLLN NLDAEVAENP DELVVYGGIG 
QAARNKESLQ KIIEILLELD EDHSLLVQSG KPVGVVRTHP QAPRVMLANS NLVPKWATWE
HFNELRAKGL MMYGQMTAGS WIYIGTQGIL QGTYETFVAC GRQHFNGDLK GKLLVTAGIG
GMGGAQPLAA TMAGAVFLGA DVDESRIRKR LATRYIDRIT HSYEEAIAWA MDAKAKGEAL
SIGLVSDAGD MLERLLKDNI IPDILTDQTS AHDPINGYVP NGLSLEEATA LRKKDPADYK
ARSLKSMARH VSFMLALQGK GAVTFDYGNN LREFAREGGE PNAFNFPGFT PAYIRPLFCE
GKGPFRWVAL SGDPEDIYTT DKALMEAFPE NTALINWLKK AQAQVAFQGL PARICWLGLG
EREKAGLIFN ELVRTGKVKA PIVIGRDHLD CGSVASPNRE TEAMKDGSDA VSDWTLLNLM
ANTGGGATWV SFHHGGGVGM GYSQHAGMVV LADGSERAEA CLKRVLFNDP ALGIFRHADA
GYEEAKATAR KFNI