Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_4197 |
Symbol | |
ID | 8360370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 5246614 |
End bp | 5249805 |
Gene Length | 3192 bp |
Protein Length | 1063 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644966366 |
Product | peptidase domain protein |
Protein accession | YP_003123855 |
Protein GI | 256423202 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000751128 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.547659 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAAAC TTCTACTCCT CGCAACTGCA CTACTGTGCA GTACACTGTC GTTTGCACAG GACTACTGGA GGCCACACAC CGACGGCGCC AGAATTAGTA CCGATAAAGC TGTAGCCCGT CTTGCTTTCC CTACTGAGTT TAAACTGTTT GATCTCAATT TCACCCCTTT CAGAGATCAG GCTTTCAGAT CGGTCGGTAA TAAGTCCGCA CATGCTACCA TTATTTCTTT ACCCAATGCC GACGGACAGA TTGAACAATT CGAAATTACA GAAGCATCCA ATTTCGAACC TGCTTTACAA GCCCGGTTCC CGGATATCAG AGCCTTTTCG GGAAAGGGTA TTACCGACAA ATATGCCACG TTAAAACTGA GCATATCTCC CGAAGGTATA CAGACGACTG TATTCCGTAC CGGGAAAGAA AATGAATTTA TTGAGCCGTA TTCTGCTGAC CATACAGTAT ATACTGTATT CAAGAAAAGA GCGAATACCT TACCTTGGAA ATGTTCCACT CCTGAGCAAC AACTGGCGAC TGACATAGGC AGCCGCATAC CTGATGTTGC AGGCCGGTCC ACCGGTGATG CCAAGACATT ACGGCTGGCA CAATCGGTAA CCGCTGAGTA CTCCAATTAC TTTGGCGCTA CCAGCGCTTC ACAGGTAGCC TTAGTACTGG CCGCTGTAAA CGCTACGCTG ACCCGTTGTA ACGGGGTGTA TGAGAAGGAT CTTGCGATCC ATCTTAACCT GGTGGCTTCC ACTACCAGTG TATTCTATTA TAATGCTTCC ACAGATCCGT ATTCTTCTGC CAGTTCAGGA GCCGGTGGCG CCTGGAACGG TGAATTACAG AGTACCCTGA ACTCCGTAAT CGGTGCTGCC AACTACGATA TCGGTCACCT GTTTGGTGCA TCCGGTGGTG GCGGTAACGC TGGTTGTATC GGTTGTATCT GTGTTGATAA TTCAAAAGGA AGTGGATTCA CCTCTCCTGC TGATGCGATT CCACAGGGCG ATAATTTCGA TATCGACTAT GTGGTACACG AAGTAGGTCA CCAGTTAGGC GCTAACCACA CTTTTTCTAT GAGCAATGAA GGTACAGGTG TGAACGTTGA ACCTGGTTCA GGTATCACGA TCATGGGGTA TGCCGGTATC ACCAGTCAGG ATCTGGCGCC GCATTCTATT GATATTTTCC ATGCGGCTTC CATTGCACAG ATCCAGGCAA ACCTGGCAAC TAAGAGTTGT CCTGTAACAA CCGTTATATC TGGCAATAAC GCTACGCCAG TAGTAAGTGC CGGTGGTAAT TTTACCATTC CTATCAGCAC GCCGTTTGTC CTGACTGGTT CCGCTACTGA CGCTAATCCA TCTGATGTAC TGACATATGC CTGGGAACAA TTTGACAATG CCTCTTCTTC ACAAACAGGC TCCAGCAGTG TGGCCAGTCC GACCAAAGCT ACTGGTCCGA ACTGGATCTC ATTGCCGCCA GTTACCTCAC CTACCAGGTA CTTCCCTAAA CTGGCGACCG TCCTCGCCGG TAACTTAGTC TCTGGTCCTT TAACCGGTGG TGATGCAGGC GCGAATACAG AGGCATTAAG TGCTGTGTCC AGGACCCTGC GTTTCCGTTT AACAGTAAGA GATAATGCCC CTTATAGTTC TACCGCGCCG GTGACTATCG GACAGACCAA CTTCTCGGAT ATGACGGTTA CTGTCAGCAA TACCTCCGGA CCATTCTCCG TAACGGCTCC TAATACGGCT GTTTCCTGGG CGGGTAATTC TTCCCAGAAT ATCACCTGGA ATGTGGCCAA TACAACCGCT TCTCCCGTAA GTTGTGCGAA TGTTAAGATC TCCATTTCTA CTGACGGAGG TAATACATTC AGCACCTTAG TTAGCAGTAC ACCGAATGAT GGTAGTCAGT CGGTTATCAT TCCTAACACA CCAACAACGA CAGCCAGAAT AAAAGTCGAA TCTGTAGGAA ATATCTTCTT TGACATTTCC AATACCAACT TCACGATTGT ATCCGGATCA TCCTGTACTT CTCCGACAGG ACTGAGCGAT TCAGCGATTA CCGCTACTTC TGCTACGATC ACCTGGGCAG ATGTAAGTGG AGCAGTATCG TATGACGTTG ATTACAAAGC AGTCGCAGAT ACTACCTGGA TCAGTGCTGC CGCAGGCATT ACTGTTACAG GCGTTAACCT GACAGGTCTG CTGGCAGGAC ATACATATGA TTACAGAGTA AGGTCACATT GTGCGAGCGA TAGTAGTTCT TATGCTGTCG GACAGCTCAC AACTACGCCA AGTGCATCCT GTAGCGCACC TGGTGGTTTA AGCAGTACAT CGGTAACTAC GACTGCTGCT GTTATCAGCT GGACGGCTGT AAGCGGTGCA TTGAGTTATG ATGTTGATTA TAAACCAGCT GCAAGTGCTA CCTGGATCAA TGTCCAGGCT GGAGCGACAG CAACATTTGC GAACCTGACT GGATTAACCG CTACCACTAC ATACGACTGG AGAGTGAGGA CAAACTGCTC CGATGGTAAT GGCGCTTATG CGAGTGCACA GTTCACTACC CAAACACCGC CAACCTGTGC TAATAACAAG GATACTGCTA CCAATGGTAA TACTGCCGGT GCAGCTACTA TTCCATTCAA TACGGATATT ACGGGTCTTA TCAGTCCGTC TGGTGACATC GATCATTATA AGTTTGTGAT CACCACTGCA GGTACGATCA CTATCACCCT GGGTACCTTA CCAGGCGACT ATGACCTGAA ACTGCTGAAC AGTGCCGGTA CACAGCTGGC CATTTCACAG GCGGGTGGTA CGAGCAGTGA AACGATCAGC AGAAATATGA CGCCAGGTAC TTATTATGCA CAGGTATATG GTTACAATGG CGCTAACAGT ACTACATCTT GTTATACATT GCGCGTACAG TTAGGTACTG CCAGCCGTAG CACGGATGTA AGCAGCGGAG ATCTCCCTAA AGTAGCGGTA TTCCCGAACC CTGCTAATAA CGTAGTGAAC GTTAATCTGA CAGGCTTCAA AGGAAAATCT GCCGTAACGA TGTATGATGT CAATGGTCGT GTGGTATTAC GCCGTGAAAT GAATCCGGTG AATACGCAGC TGGACATTTC CACATTACCG ACAGGTGTTT ATATTCTGAA GATCAGGAAC GGTGCGAAAG AGGTGAATAT GACGAAGATC ATCAAGCAAT AA
|
Protein sequence | MRKLLLLATA LLCSTLSFAQ DYWRPHTDGA RISTDKAVAR LAFPTEFKLF DLNFTPFRDQ AFRSVGNKSA HATIISLPNA DGQIEQFEIT EASNFEPALQ ARFPDIRAFS GKGITDKYAT LKLSISPEGI QTTVFRTGKE NEFIEPYSAD HTVYTVFKKR ANTLPWKCST PEQQLATDIG SRIPDVAGRS TGDAKTLRLA QSVTAEYSNY FGATSASQVA LVLAAVNATL TRCNGVYEKD LAIHLNLVAS TTSVFYYNAS TDPYSSASSG AGGAWNGELQ STLNSVIGAA NYDIGHLFGA SGGGGNAGCI GCICVDNSKG SGFTSPADAI PQGDNFDIDY VVHEVGHQLG ANHTFSMSNE GTGVNVEPGS GITIMGYAGI TSQDLAPHSI DIFHAASIAQ IQANLATKSC PVTTVISGNN ATPVVSAGGN FTIPISTPFV LTGSATDANP SDVLTYAWEQ FDNASSSQTG SSSVASPTKA TGPNWISLPP VTSPTRYFPK LATVLAGNLV SGPLTGGDAG ANTEALSAVS RTLRFRLTVR DNAPYSSTAP VTIGQTNFSD MTVTVSNTSG PFSVTAPNTA VSWAGNSSQN ITWNVANTTA SPVSCANVKI SISTDGGNTF STLVSSTPND GSQSVIIPNT PTTTARIKVE SVGNIFFDIS NTNFTIVSGS SCTSPTGLSD SAITATSATI TWADVSGAVS YDVDYKAVAD TTWISAAAGI TVTGVNLTGL LAGHTYDYRV RSHCASDSSS YAVGQLTTTP SASCSAPGGL SSTSVTTTAA VISWTAVSGA LSYDVDYKPA ASATWINVQA GATATFANLT GLTATTTYDW RVRTNCSDGN GAYASAQFTT QTPPTCANNK DTATNGNTAG AATIPFNTDI TGLISPSGDI DHYKFVITTA GTITITLGTL PGDYDLKLLN SAGTQLAISQ AGGTSSETIS RNMTPGTYYA QVYGYNGANS TTSCYTLRVQ LGTASRSTDV SSGDLPKVAV FPNPANNVVN VNLTGFKGKS AVTMYDVNGR VVLRREMNPV NTQLDISTLP TGVYILKIRN GAKEVNMTKI IKQ
|
| |