Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_5110 |
Symbol | |
ID | 8361286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 6372424 |
End bp | 6375324 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644967258 |
Product | Two component regulator three Y domain protein |
Protein accession | YP_003124743 |
Protein GI | 256424090 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.58811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.312264 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGC TTTTTCTGCT TGTGCTGTCT ATCCTATGGA GCAACCTGCT CATCGCACAA CAAATAATCG GGCTACCGCC TGTGATCAAT TATACCCGTC AGCAATATCC TGGTGGCGGG CAAACCTGGG ACCTGGATAT GGACCGGAAT GGCATCCTTT ACTTTGCCAA CAATGAAGGG CTGATGACCT TCAACGGTCA TTCCTGGAAA TTACTGCCCG TTCCTAACCA TACCCGGCTA AGGGCTGTCA AAACAGCTAA AGACGGCCGG ATCTATATCG GCGCACAGGA TGATTTCGGT TATTACCTGG CGGATGCGGA TGGCGTGTTA CGGTATACCT CCCTCAAACC GGTCGTCAGG AACCGGAAGG ATCAATTTGC CGATATCTGG GATATTGTGA TCGACGACAG CGGAATCTTC TTCAGAAGCA CTAACAAGAT CTTTCACTAC GATCATAAAA CATTACATGT TTATCCCGCC CAGGCGGAAT GGCAGGTGTT ACGCCGTGCC GGCGGACAAC TATATGCCCA GGATGCAGCC AGGGGATTAT TACGATTCCG GAACGGCCAA TGGCAACCTG TCTGTACAGC GATCGCCAGT CAGCGCCTGC TGATCAGGGA AATCCTGGAT TACGGAAAGG ACACCCTATT GATCTGCACC TTAAAAAACG GTCTATTCAA ATTACATGGC GGTGAGCTGT TACCGTTCCG GACAGCAGCG GATGGCATAT TTACCGAAAA ACAAATCTAC TGTGCGAAGA ACTGGGGCGA TAGTGACAAA CAATTACTTG TAGGCACCTC CTATGGTGGT TGCTATATTA TTAATAGTAC TACAGGGCAT ATTATCCAAC GGTTTACGGT GGATGAAGGC TTACAGCATA ACAGTGTACT GAAGATCTTT ACAGATCCTG CCAGGAATAT CTGGCTAACG CTGGATAATG GTATTGATAT GGTGCGGTAT AATACGGCGG TGAAACAGAT TCTGCCGGAT GGCCAGCGAT ATCTTACTTC TTATACGGCG GCGATATTTA AGGAGCAGTT GTATATAGGT ACTTCGGATG GCGTTTTTGC GACACCGCTG GGTAACCGGA AAGACCTGAG TTTTCAGGAA GGAAATTTCC GGCGGATCAG ACATACGCAG GGACAGGTAT GGAACCTGAG TCAGATAGGC TCACATTTAC TAATGGGGCA TCATGAAGGC GCTTTTGAGA TTGACGGACA GGACGCCAGA TTAGTCTCCA AACTGACAGG AAGCTGGTTG TTCCGCCCCT TCTCTTCCCA TATACTCTCA GGAGGATATA ACGGACTGCA ATTGATCAAA GAGGATGGTA GTCAGCTAGG TGGCGCTGTC AGGGTACGAG GTTTGTTTGA ATCGTTCCGC TTTCTGACAG TGGAAGACGA TAGTATTGTC TGGTCTTCGC ATCCTTACAG GGGGGTATTC AGGATGCATG TAGCAGAAGA CAGTCTGCTA CAGTACACGC TTTTTACCAG CAAAGACGGT CTGCCTTCCG ACTATAACAA TTTTGTATTC CGGATCAGGA ACCGGACCCT GGTGGCCACA CAGGCAGGGG TATATGAATA TGATGCCAGG AAGCGGCATT TTGTCCCTTC TCCCTGGTTA TATCCTGTAC TAAAGAATAT TCCTTTACAA CACCTGGCGG AAGATGCTAC GGGTAATGTC TGGTTTGTTT CTGATAAGAT GCCGGGTGTA ATTGATTTTC ACAAGCCGTT ACCGGAGCAC CCTTATAGTA TTGTGTATTT CCCGGAACTG AAAGGTCAGG TGGTGAGCGG ATTTGAGTTC CTTTATCCAT ATGACGACGA GAACATTTTT GTGGGTGCGG AGAAAGGGAT GTATCACATT AACTATAAGC ATTACCGTCA TTCGAGGGAG TTGCCGGTTG TGCTGATTGG TCAGGTCAGC GCACATGGGA AAAGGGACAG TCTTTTATTC GGCGGGTATC ATCCCCCGGG AAAGCATTCT TTCAAGGCGA TGCCCAATGC ATTTTCGGGT TTTCACTTTG AGTATGCTTC ACCGATATAC AGTCAGGGGA ATACCATCTT CTACAGTTAC CGGCTGGCCG GTTTTGACCA GGGATGGAGT GAGTGGTCGG TAAAAACGGA AAAAGACTAC ACCAATCTGC CGCATGGTTA TTACGCTTTT TCTGTGCGGG CGAAGGACAA CCTGGGGAAT GTATCCCGTC CGGCAGTATA TGCATTCCGG ATATTACCTG CCTGGTATCA GACAGGGCTC GCTAAAAGTC TGTATGTGTT GAGCGGACTC GCCTTACTGC TCTGGGGCTA TATCTATCAG AAGAAAAAAT TCATCCGTCA GCGAAAGCGT TATCAGCAGA AACAGGAGCA GCTGATCTTA CGGTACCGGT TTGAAAAAGA GCAAAGAGAG AAAGACCTGA TCAGTCTGCA GAATGAAAAG CTGACAGCGG AAGTTCGTTT TAAAAACAGG GAGCTGGCGA CGGCTACGAT GTACCTGCTA CACAGGGGCA AGGTCTTATC GAATATCAAG GAAGAACTGT TGAGTGCTAT GAAGAAGTTA GACTCACCGG AAGGGGCTTT TAAAAAAGTA ATGCGTTTGT TTGAAGAGGC GGAAAACAAT GAAGAGGACT GGGAGCAATT TTCCCGGCAT TTTGATGAAG TGCATAACAA TTTCCTGTTT AAGCTGAAAC GTCGTTACCC GGAGCTGAGT ACGACTGATC TGAAACTCTG TGCATATCTG CGCATCAATC TGACGACGAA AGAAATTGCA CAGTCACTGG GGATTTCTGT CCGTGGAGTA GAAACAAGCA GATACCGGCT GCGCAAGAAA CTGGAATTAC CGGCGGAGGT CAGCCTATAT GACTTTCTGC TGGCGGTGGC GGATGATCAG GTAAGACCGG CACAATCATG A
|
Protein sequence | MNKLFLLVLS ILWSNLLIAQ QIIGLPPVIN YTRQQYPGGG QTWDLDMDRN GILYFANNEG LMTFNGHSWK LLPVPNHTRL RAVKTAKDGR IYIGAQDDFG YYLADADGVL RYTSLKPVVR NRKDQFADIW DIVIDDSGIF FRSTNKIFHY DHKTLHVYPA QAEWQVLRRA GGQLYAQDAA RGLLRFRNGQ WQPVCTAIAS QRLLIREILD YGKDTLLICT LKNGLFKLHG GELLPFRTAA DGIFTEKQIY CAKNWGDSDK QLLVGTSYGG CYIINSTTGH IIQRFTVDEG LQHNSVLKIF TDPARNIWLT LDNGIDMVRY NTAVKQILPD GQRYLTSYTA AIFKEQLYIG TSDGVFATPL GNRKDLSFQE GNFRRIRHTQ GQVWNLSQIG SHLLMGHHEG AFEIDGQDAR LVSKLTGSWL FRPFSSHILS GGYNGLQLIK EDGSQLGGAV RVRGLFESFR FLTVEDDSIV WSSHPYRGVF RMHVAEDSLL QYTLFTSKDG LPSDYNNFVF RIRNRTLVAT QAGVYEYDAR KRHFVPSPWL YPVLKNIPLQ HLAEDATGNV WFVSDKMPGV IDFHKPLPEH PYSIVYFPEL KGQVVSGFEF LYPYDDENIF VGAEKGMYHI NYKHYRHSRE LPVVLIGQVS AHGKRDSLLF GGYHPPGKHS FKAMPNAFSG FHFEYASPIY SQGNTIFYSY RLAGFDQGWS EWSVKTEKDY TNLPHGYYAF SVRAKDNLGN VSRPAVYAFR ILPAWYQTGL AKSLYVLSGL ALLLWGYIYQ KKKFIRQRKR YQQKQEQLIL RYRFEKEQRE KDLISLQNEK LTAEVRFKNR ELATATMYLL HRGKVLSNIK EELLSAMKKL DSPEGAFKKV MRLFEEAENN EEDWEQFSRH FDEVHNNFLF KLKRRYPELS TTDLKLCAYL RINLTTKEIA QSLGISVRGV ETSRYRLRKK LELPAEVSLY DFLLAVADDQ VRPAQS
|
| |