Gene Cpin_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_1504 
Symbol 
ID8357645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp1847411 
End bp1850428 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table11 
GC content49% 
IMG OID644963684 
Productamidohydrolase 
Protein accessionYP_003121202 
Protein GI256420549 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000736153 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATT TTTTTTCGCC GCGTAAGCGG TTGTCCCACC TGTGTCTGTT GCTGAGCGGT 
CTGCTGGCTG GCTCGTATGC TTCAGCGCAG GAAACTTTTC CGGTGAACGG CATCGCCGAT
CCCAGAGAAG GGTGTTATGC TTTTGTAAAA GCGACGATTG TAAAAAGCGC CGGTAATGTA
TTGAACAATG CCAGCCTCGT AATCCGTAAC GGACGTATCG TCAGTGTTGG CAATGGTCCT
GTTCCTAAAG ACGCGGTAGT GATTGATTGC GAGGGTAAAT ATATCTATCC TTCTTTCGTA
GATGCTTACA GCGATTATGG TACGCAGCCT GCAAAGAAAG CCGGCGGCGG ATACAGAGGT
GATCCGCAGT TCCTTTCTGC CACTAAAGGT GCATTCGGCT GGAACCAGGC AGTGAAGAGT
GAAATAAATG CGGCTTCCGT ATTCCAGGTA GATGGTACTG CTGCGGAGTC ATTGAGAAAT
GCAGGTTTTG GTACCGTATT GTCTCATCAG CAGGATGGTA TTGCCCGTGG TACGGGCGTA
CTGGTAACGC TGGCGGATGA CCGCGAAAAT AAAGTCCTGA TCAAAGAAAA AGCCAGTGCG
CAATTCTCAT TCGATAAAGG TAGTTCTACC CAGAATTATC CCAGTTCTTT AATGGGCAGC
ATCGCCGTAC TGCGTCAGAC ATTCCTGGAC GCCCAATGGT ATCAGTCACA TCCTGCTAAA
GAAGGGACAA ACCTGACTTT ACAGGCATGG AACGATAGCC GCGCTTTACC ACAGATCTTC
GACGTACAGG ATAAATGGGA TGCCCTGCGT GCAGATAAAG TAGGCGACGA ATTCGGAGTA
CAATATATCA TCAAAGCCAG TGGAAACGAA TACCAGCGTA TACCGGAAAT GGTAGCTACC
AAAGCTTCCT TTATCCTGCC GCTGAACTTC CCGCAGCCTA TTGACGTAGA AGATCCGAAC
GATGCCCGCT TCGTGGCCCT CAGCGATATG AAACACTGGG AACTGGCGCC GACAGAAGCT
GCTGCTTTTG AGAAAGCCAA CATCCCTTTC TGCCTCACCG CTACCGGTCT GAAAGACCTG
AAACAATTCC TGGGCGCTGT CAGAAAATCG ATTGAGTACG GCCTCAGTGA ACAAAAAGCC
CTGGATGCGC TGACCCTCTC TGCAGCACGT ATCATTAAAG CCGATGATCT CGTAGGCTCC
CTGGAGCCGG GTAAACTAGC CAACTTCCTG GTTACTTCCG GTACCATCTT CAATGAAAAT
ACCGTACTGT TCCAGAACTG GGTACAGGGT AAAAAATATA TCATCAAAGA CGAAGGCTGG
AAAGACGTGC GCGGCACTTA TACACTGACC CTGACGCCGG GTAATACGAA ATATACCCTG
CTGCTGAAAG GTACTGCCGC TACACCCACC TTGTCCCTCC TGCAACAGGA TACACTGACT
GGCTCGATCA CGATCAATGA TAAGCTGATC AAAATCGCCT TCCCTTTGAA AAAAGGTGGT
GCACAACTGC GTCTGAGTGG TGTAGCTGGT ACCAGCGAAT GGAGTGGTAC CGGTCTGGAC
ACAGCCGGCA AATGGGTAAA ATGGAATGCC TCCTTCAGTG CCGCTTACAC CAGACAGGAT
ACTGCAAAAT CAAAATCAGC GCCACAGCTG GGTAATATGT ACTATCCGTT CAATGGATAT
GGCTGGGAAG CCCTGCCTAA ACAACAGGAT ATCCTGATCA AAAATGCGAC TGTCTGGACC
AACGAAAAAG AAGGTAAACT CGAAAACACA GACGTGCTGG TACGCAACGG TAAGATCGCT
CAGATCGGTA AAAACCTGCC TGCCGGTAAT GCACGTTTAA TAGACGGTAC AGGCAAACAC
CTGACGCCGG GTATTATCGA CGAACACTCT CATATCGCTA TTTCCAAAGG CGTGAATGAA
GGAACGCAGT CCGTTACTTC AGAAGTGCGT ATCGCGGACG TTGTGAACCC GGATGATGTG
AATATCTACA GACAACTGAG TGGTGGGGTA ACCGCTTCCC ATCTGTTGCA TGGTTCTGCC
AATACGATCG GCGGACAAAG CCAGCTGATA AAACTGCGTT GGGGTGCTGA TGCGGAAGAA
CTCAAATTCG CAGGATCAGA TCCTTTTATC AAATTCGCAT TGGGCGAAAA CGTCAAACAA
TCCAACTGGG GCGAACGCCA GCGCGAGCGT TTCCCACAGA CGCGAATGGG GGTTGAACAG
GTACTCACAG ATGCATTCAC ACGTGCGCGT GATTACGAAA AAGAAGGCGC CGGCAAACGT
CGCGACCTGG AACTGGATGC ATTAGTAGAA ATCCTGAACA GCAAACGTTT CATTACCTGT
CACTCTTATG TACAGAGCGA AATCAATATG CTCATGCATG TGGCAGATAC CTTCCATTTC
AAAATAAACA CCTTTACACA CATCCTGGAA GGCTATAAGG TAGCCGATAA AATGAAGCAA
CACGGTGCAG GTGCAGGTAC CTTCGCTGAC TGGTGGGCTT ACAAGATGGA AGTACAGGAC
GCCATCCCTT ACAATGCCAC CATCATGCAA CGCGTAGGGG TGAATGTAGC GATCAACTCT
GATGATGCGG AAATGGCACG CAGACTGAAC CAGGAAGCCG CCAAGAGCAT CAAATACGGC
GATATGACCG AAGAGGAAGC CCTGAAACTG GTGACCATCA ACCCGGCAAA ACTGTTGCAC
GTAGCTGAGC GTACCGGTAG TATCAAAACC GGTAAGGATG CCGACCTGGT ACTCTGGAAC
GACAATCCGC TGAGCATTTA TGCAAAGGCG GAAAAGACAC TGGTAGACGG TATCGTATAT
TTCGACCGTG AGAAAGACCT GGAACTGCGT CAGCGTATCA GTGCTGAACG TAACCGCCTG
GTGCTGAAAA TGCTGAGTGA GAAGAAAAAA GGTACACCTA CACAGAAAGC AGCTGCTTCC
AGGGAGGAAA TGTATCACTG TGAAGATCTG CAGGCTGGTC ACCAGCAACG TCTGACAGAT
GGAAATAATG AATTGTAA
 
Protein sequence
MSNFFSPRKR LSHLCLLLSG LLAGSYASAQ ETFPVNGIAD PREGCYAFVK ATIVKSAGNV 
LNNASLVIRN GRIVSVGNGP VPKDAVVIDC EGKYIYPSFV DAYSDYGTQP AKKAGGGYRG
DPQFLSATKG AFGWNQAVKS EINAASVFQV DGTAAESLRN AGFGTVLSHQ QDGIARGTGV
LVTLADDREN KVLIKEKASA QFSFDKGSST QNYPSSLMGS IAVLRQTFLD AQWYQSHPAK
EGTNLTLQAW NDSRALPQIF DVQDKWDALR ADKVGDEFGV QYIIKASGNE YQRIPEMVAT
KASFILPLNF PQPIDVEDPN DARFVALSDM KHWELAPTEA AAFEKANIPF CLTATGLKDL
KQFLGAVRKS IEYGLSEQKA LDALTLSAAR IIKADDLVGS LEPGKLANFL VTSGTIFNEN
TVLFQNWVQG KKYIIKDEGW KDVRGTYTLT LTPGNTKYTL LLKGTAATPT LSLLQQDTLT
GSITINDKLI KIAFPLKKGG AQLRLSGVAG TSEWSGTGLD TAGKWVKWNA SFSAAYTRQD
TAKSKSAPQL GNMYYPFNGY GWEALPKQQD ILIKNATVWT NEKEGKLENT DVLVRNGKIA
QIGKNLPAGN ARLIDGTGKH LTPGIIDEHS HIAISKGVNE GTQSVTSEVR IADVVNPDDV
NIYRQLSGGV TASHLLHGSA NTIGGQSQLI KLRWGADAEE LKFAGSDPFI KFALGENVKQ
SNWGERQRER FPQTRMGVEQ VLTDAFTRAR DYEKEGAGKR RDLELDALVE ILNSKRFITC
HSYVQSEINM LMHVADTFHF KINTFTHILE GYKVADKMKQ HGAGAGTFAD WWAYKMEVQD
AIPYNATIMQ RVGVNVAINS DDAEMARRLN QEAAKSIKYG DMTEEEALKL VTINPAKLLH
VAERTGSIKT GKDADLVLWN DNPLSIYAKA EKTLVDGIVY FDREKDLELR QRISAERNRL
VLKMLSEKKK GTPTQKAAAS REEMYHCEDL QAGHQQRLTD GNNEL