Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_4387 |
Symbol | |
ID | 8360560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | - |
Start bp | 5453973 |
End bp | 5457143 |
Gene Length | 3171 bp |
Protein Length | 1056 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644966546 |
Product | hypothetical protein |
Protein accession | YP_003124034 |
Protein GI | 256423381 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.95296 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00457818 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGTTGT ATCAATCCCT GCCGGGAACA GGCCGCCTGC CCGGTTTTAT CACCTTATTA TCACTATTGC TGCTCACCGC TTTTGCTACT CAGGCACGCC AGCAACAGCA TACGCTCCCC CGGCTGATCC CCAGGTCTCA TGACGCTGTT GCTGTCAGCA AGGCCAATCG CATTCCTGCG GGAGAAAGTG CCGGTATCGC TGATGTAAAC ATTACCCTGG CAGACCTGCA GATCGGTTCC TTTGCATTGC CCATCACACT CCGTTACCAC AGTAATGGTC TCAAAGTAAA TGAAGTGCCT TCCTCCTTAG GCGATGGCTG GTCTTTGCAT TACGGGGGTA TGATCAGCTT TCGGCAGCAT GGGCTCAATG ATTTTAAAAC CGCAGGTCTC TTCGACGGCG GTATCAGCTC CGCTTCCATG ACAATGCTGA AGTCCTTCCT GCGGGGGCAT ATGACGACCG GTCAGCAGCA TACCTGGCTG GATAATCTGA TCAACAGCAA TGCCGACGCT GAATTTGATC AGTATCAGTA CAGTTATCCC GGCAAATCCG GCGCTTACTA TTATGATACC CTCATGAATG TAGTAACGGT GCCCAAAAAC GATATCCGGG TATTCCGGGG GAATAACGAA ATACGTGTGC TCGATAATCA GAACAACAAC TACTACTTTG GTACACTGGA ACAGAGCACC GTGACAGATG CCTCCGTATA CGGATATACA CCTGCTTTTG ATGATGTGGC TACTTACTAT TTATCCAGGA TAGTTACCAG TCAAAACAGG ACGATCCTGT TCAGGTACAG AAAATACAGC TATTCCGTAC AGCAACAGAA ATCCGCTGTG GCATTTTCCG CGGTAGCAGG TACCTGTAGC GCCGACAACG GACAAGCCCT CTATACAAGA ACAGTACAGA TCAATTGCCT CTTACCTGAC TCTGTCATTT ACGATCAGGG CTATGTTAAA TTCGCTATCA GTACTATTCC CCGCGAAGAC ATCAAGGCAC TCCAGGATAC TGCCGCCATC CCTTCCATCA CAGGACTCAG CATCTTTGCC GGAAACAACA AAAAGATCAA ATCCTATTCT TTTCTGCAAG GATATTTTGA TACCAATAAA CGATTGAAAT TATCAGGTGT ACAGGAATGG AATGGCGATG CAACAGATAA ATTATGGCAG TTCCACTATT ACCGTGAACA GGATGCCTTC CCGGCAATGT CCTCAAATGA CCAGGACCAT TGGGGATATT ACAATGCAGC AGGTAACGCA GGCATGATCC CTGATATAGA CTATGCATCT TTGATTCCTG GCTGGAATGC ACCGGCTGCC AACTATGGCA ACCGGAGTGC CAATATAGCC GCCCGCTATG GTATGTTGCA AACTATCATC CATCCTACAG GTGGTAGTAC CGCTTTTGAA TATGAACCCA ACCAGGTCAG GGTAGAGGAT TACAACACAC TCACCACCTT GTCTCCATTC CTGACAGTCC CTGGTGGTAC GCCAGTGCCT TATGCAGTAG GTGGTGTACG CATAAAGACG ATTATTACCA ATGATAGTAT AGGTACGCCT TCCAGCTACC GGACGTACCA GTACGCCGAT AGTCTCAACC AGGTGGCCTT CCTGGATATT CCTTACTACA TCGCAAAGAT GGAATATAAC AGGGATAGTG CAGGCGCTTG TCTTGCCTGT GGACAGTCGG CAACTGTTTT TGATGAAAGT GTCTGGCCCT TAGATGCGAT CCCTGTTATT TACAGCCACA TCACCGTGTC TGATTCCAGT GTTGCCGGCA GACAAGGTAA AACAGAAAGT GTCTATTTAT TGCCGGAGAA TCATACCGTC AGCAATACCG CTCCCTACGT CACACCGCTT AATACTTCCT GGCAGATCGG CACGCTCGTC AACAGAAAAC TATACGCACT CAAAGACAAC GTCGATGAAC TGGTAAAAGA ATACCGCTAC ACTTACAAAG CCCTGGAAGA GAGTTATCAG ACAACAGGCG TGAAAACTAA CTACGCACAA TACTGTACAC TGAATACACC CGATAACAAC AACTATAATA CTTCCCTATC TACTTTCTTC TCAGACCGTT TTTACCTGCA GCAATCAAAG GAAATAGATT ATCTGTCGGG ACAAACTCTG ACGAAACAGA TAGATTATGC TGTTAGCGGG ATCAGACATA ATTTACCCGC TGTTGTCAGC TACCCGGACA GCAAGGGAGA CACCATCAAA GAACGAATTA TTTACTCCTG GGATTATGAT ACTACGACTA CAACCGGTGG CGACGCTTTA GCGATCCGCA ACCTGGGCAG ATTAAATGTC CTGGTGCCAA TTGAAAGAAT ACAAATCCGT ACCATTGATG GCGTAGACTA CGTAGTCGGT GCAACGTTAA CTAATTACCG GACAGACCGC CCCTTTCCGG ACAGGATCTA TGAGCTGAAC GTGCCGGCGC CTATTCCATT GTCTGCCTAC ACGACAAGCA GAATCAGTGG CAACTTCGTC AGAGACAGTC GCTATGAATT GGTCACCACC TTTAGTATGT ACGACAGCAA CAATAATGTG ACGGAGACAA CCGGCATTGA CGGTGTCCCT GTCAGTTACC TGTACGGATA TAAACAGCTA TACAAGATAG CGGAGATTAC CACTTCAAGC CGGCCATACG TCGCCTATAC GTCCTTTGAA ACAGATGAAC GTGGCGGCTG GGCATATTCA GGTACGCCTG TGGTGGATGC TACATCTCCT GTCGGTGAGC GTTGTTATGA CCTGTCCACC GGTACATTTG GCGTCGGTAA TATCGTGTTG CCCACAGCTG TACTGACCTA TTGGTCCAAA AGTCCTACGC CTTTCGTATT TCCAAGGCAG ATTCCTTTTC AGCCTGCTAC AAGCGGGAGA ACAGTCAATG GATGGACTTT TCATATGCAC TGGTTGTCGG GGCAGTTTCC ACAGGATATC GTTTTTCCTT CTTCTGGTTT CATAGATGAT GTAAGAGTCT TCCCGCTGAG ATCTCAGATT AAATCATATA GTTATCAACC TCATACAGGG ATGACAAGTG AGTGTGATGA CCGTGGCAAC ATCACCTACT ATTCCTACGA TGAAACAGGA CGGTTAAAAA TGGTACAGAA TGGGGATCGT GCGATCCTCA AACTAATCGA TTACCAGTAC ATGAAACCTA TTACAGAATA A
|
Protein sequence | MSLYQSLPGT GRLPGFITLL SLLLLTAFAT QARQQQHTLP RLIPRSHDAV AVSKANRIPA GESAGIADVN ITLADLQIGS FALPITLRYH SNGLKVNEVP SSLGDGWSLH YGGMISFRQH GLNDFKTAGL FDGGISSASM TMLKSFLRGH MTTGQQHTWL DNLINSNADA EFDQYQYSYP GKSGAYYYDT LMNVVTVPKN DIRVFRGNNE IRVLDNQNNN YYFGTLEQST VTDASVYGYT PAFDDVATYY LSRIVTSQNR TILFRYRKYS YSVQQQKSAV AFSAVAGTCS ADNGQALYTR TVQINCLLPD SVIYDQGYVK FAISTIPRED IKALQDTAAI PSITGLSIFA GNNKKIKSYS FLQGYFDTNK RLKLSGVQEW NGDATDKLWQ FHYYREQDAF PAMSSNDQDH WGYYNAAGNA GMIPDIDYAS LIPGWNAPAA NYGNRSANIA ARYGMLQTII HPTGGSTAFE YEPNQVRVED YNTLTTLSPF LTVPGGTPVP YAVGGVRIKT IITNDSIGTP SSYRTYQYAD SLNQVAFLDI PYYIAKMEYN RDSAGACLAC GQSATVFDES VWPLDAIPVI YSHITVSDSS VAGRQGKTES VYLLPENHTV SNTAPYVTPL NTSWQIGTLV NRKLYALKDN VDELVKEYRY TYKALEESYQ TTGVKTNYAQ YCTLNTPDNN NYNTSLSTFF SDRFYLQQSK EIDYLSGQTL TKQIDYAVSG IRHNLPAVVS YPDSKGDTIK ERIIYSWDYD TTTTTGGDAL AIRNLGRLNV LVPIERIQIR TIDGVDYVVG ATLTNYRTDR PFPDRIYELN VPAPIPLSAY TTSRISGNFV RDSRYELVTT FSMYDSNNNV TETTGIDGVP VSYLYGYKQL YKIAEITTSS RPYVAYTSFE TDERGGWAYS GTPVVDATSP VGERCYDLST GTFGVGNIVL PTAVLTYWSK SPTPFVFPRQ IPFQPATSGR TVNGWTFHMH WLSGQFPQDI VFPSSGFIDD VRVFPLRSQI KSYSYQPHTG MTSECDDRGN ITYYSYDETG RLKMVQNGDR AILKLIDYQY MKPITE
|
| |