Gene Cpin_5110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5110 
Symbol 
ID8361286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6372424 
End bp6375324 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content48% 
IMG OID644967258 
ProductTwo component regulator three Y domain protein 
Protein accessionYP_003124743 
Protein GI256424090 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.58811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.312264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC TTTTTCTGCT TGTGCTGTCT ATCCTATGGA GCAACCTGCT CATCGCACAA 
CAAATAATCG GGCTACCGCC TGTGATCAAT TATACCCGTC AGCAATATCC TGGTGGCGGG
CAAACCTGGG ACCTGGATAT GGACCGGAAT GGCATCCTTT ACTTTGCCAA CAATGAAGGG
CTGATGACCT TCAACGGTCA TTCCTGGAAA TTACTGCCCG TTCCTAACCA TACCCGGCTA
AGGGCTGTCA AAACAGCTAA AGACGGCCGG ATCTATATCG GCGCACAGGA TGATTTCGGT
TATTACCTGG CGGATGCGGA TGGCGTGTTA CGGTATACCT CCCTCAAACC GGTCGTCAGG
AACCGGAAGG ATCAATTTGC CGATATCTGG GATATTGTGA TCGACGACAG CGGAATCTTC
TTCAGAAGCA CTAACAAGAT CTTTCACTAC GATCATAAAA CATTACATGT TTATCCCGCC
CAGGCGGAAT GGCAGGTGTT ACGCCGTGCC GGCGGACAAC TATATGCCCA GGATGCAGCC
AGGGGATTAT TACGATTCCG GAACGGCCAA TGGCAACCTG TCTGTACAGC GATCGCCAGT
CAGCGCCTGC TGATCAGGGA AATCCTGGAT TACGGAAAGG ACACCCTATT GATCTGCACC
TTAAAAAACG GTCTATTCAA ATTACATGGC GGTGAGCTGT TACCGTTCCG GACAGCAGCG
GATGGCATAT TTACCGAAAA ACAAATCTAC TGTGCGAAGA ACTGGGGCGA TAGTGACAAA
CAATTACTTG TAGGCACCTC CTATGGTGGT TGCTATATTA TTAATAGTAC TACAGGGCAT
ATTATCCAAC GGTTTACGGT GGATGAAGGC TTACAGCATA ACAGTGTACT GAAGATCTTT
ACAGATCCTG CCAGGAATAT CTGGCTAACG CTGGATAATG GTATTGATAT GGTGCGGTAT
AATACGGCGG TGAAACAGAT TCTGCCGGAT GGCCAGCGAT ATCTTACTTC TTATACGGCG
GCGATATTTA AGGAGCAGTT GTATATAGGT ACTTCGGATG GCGTTTTTGC GACACCGCTG
GGTAACCGGA AAGACCTGAG TTTTCAGGAA GGAAATTTCC GGCGGATCAG ACATACGCAG
GGACAGGTAT GGAACCTGAG TCAGATAGGC TCACATTTAC TAATGGGGCA TCATGAAGGC
GCTTTTGAGA TTGACGGACA GGACGCCAGA TTAGTCTCCA AACTGACAGG AAGCTGGTTG
TTCCGCCCCT TCTCTTCCCA TATACTCTCA GGAGGATATA ACGGACTGCA ATTGATCAAA
GAGGATGGTA GTCAGCTAGG TGGCGCTGTC AGGGTACGAG GTTTGTTTGA ATCGTTCCGC
TTTCTGACAG TGGAAGACGA TAGTATTGTC TGGTCTTCGC ATCCTTACAG GGGGGTATTC
AGGATGCATG TAGCAGAAGA CAGTCTGCTA CAGTACACGC TTTTTACCAG CAAAGACGGT
CTGCCTTCCG ACTATAACAA TTTTGTATTC CGGATCAGGA ACCGGACCCT GGTGGCCACA
CAGGCAGGGG TATATGAATA TGATGCCAGG AAGCGGCATT TTGTCCCTTC TCCCTGGTTA
TATCCTGTAC TAAAGAATAT TCCTTTACAA CACCTGGCGG AAGATGCTAC GGGTAATGTC
TGGTTTGTTT CTGATAAGAT GCCGGGTGTA ATTGATTTTC ACAAGCCGTT ACCGGAGCAC
CCTTATAGTA TTGTGTATTT CCCGGAACTG AAAGGTCAGG TGGTGAGCGG ATTTGAGTTC
CTTTATCCAT ATGACGACGA GAACATTTTT GTGGGTGCGG AGAAAGGGAT GTATCACATT
AACTATAAGC ATTACCGTCA TTCGAGGGAG TTGCCGGTTG TGCTGATTGG TCAGGTCAGC
GCACATGGGA AAAGGGACAG TCTTTTATTC GGCGGGTATC ATCCCCCGGG AAAGCATTCT
TTCAAGGCGA TGCCCAATGC ATTTTCGGGT TTTCACTTTG AGTATGCTTC ACCGATATAC
AGTCAGGGGA ATACCATCTT CTACAGTTAC CGGCTGGCCG GTTTTGACCA GGGATGGAGT
GAGTGGTCGG TAAAAACGGA AAAAGACTAC ACCAATCTGC CGCATGGTTA TTACGCTTTT
TCTGTGCGGG CGAAGGACAA CCTGGGGAAT GTATCCCGTC CGGCAGTATA TGCATTCCGG
ATATTACCTG CCTGGTATCA GACAGGGCTC GCTAAAAGTC TGTATGTGTT GAGCGGACTC
GCCTTACTGC TCTGGGGCTA TATCTATCAG AAGAAAAAAT TCATCCGTCA GCGAAAGCGT
TATCAGCAGA AACAGGAGCA GCTGATCTTA CGGTACCGGT TTGAAAAAGA GCAAAGAGAG
AAAGACCTGA TCAGTCTGCA GAATGAAAAG CTGACAGCGG AAGTTCGTTT TAAAAACAGG
GAGCTGGCGA CGGCTACGAT GTACCTGCTA CACAGGGGCA AGGTCTTATC GAATATCAAG
GAAGAACTGT TGAGTGCTAT GAAGAAGTTA GACTCACCGG AAGGGGCTTT TAAAAAAGTA
ATGCGTTTGT TTGAAGAGGC GGAAAACAAT GAAGAGGACT GGGAGCAATT TTCCCGGCAT
TTTGATGAAG TGCATAACAA TTTCCTGTTT AAGCTGAAAC GTCGTTACCC GGAGCTGAGT
ACGACTGATC TGAAACTCTG TGCATATCTG CGCATCAATC TGACGACGAA AGAAATTGCA
CAGTCACTGG GGATTTCTGT CCGTGGAGTA GAAACAAGCA GATACCGGCT GCGCAAGAAA
CTGGAATTAC CGGCGGAGGT CAGCCTATAT GACTTTCTGC TGGCGGTGGC GGATGATCAG
GTAAGACCGG CACAATCATG A
 
Protein sequence
MNKLFLLVLS ILWSNLLIAQ QIIGLPPVIN YTRQQYPGGG QTWDLDMDRN GILYFANNEG 
LMTFNGHSWK LLPVPNHTRL RAVKTAKDGR IYIGAQDDFG YYLADADGVL RYTSLKPVVR
NRKDQFADIW DIVIDDSGIF FRSTNKIFHY DHKTLHVYPA QAEWQVLRRA GGQLYAQDAA
RGLLRFRNGQ WQPVCTAIAS QRLLIREILD YGKDTLLICT LKNGLFKLHG GELLPFRTAA
DGIFTEKQIY CAKNWGDSDK QLLVGTSYGG CYIINSTTGH IIQRFTVDEG LQHNSVLKIF
TDPARNIWLT LDNGIDMVRY NTAVKQILPD GQRYLTSYTA AIFKEQLYIG TSDGVFATPL
GNRKDLSFQE GNFRRIRHTQ GQVWNLSQIG SHLLMGHHEG AFEIDGQDAR LVSKLTGSWL
FRPFSSHILS GGYNGLQLIK EDGSQLGGAV RVRGLFESFR FLTVEDDSIV WSSHPYRGVF
RMHVAEDSLL QYTLFTSKDG LPSDYNNFVF RIRNRTLVAT QAGVYEYDAR KRHFVPSPWL
YPVLKNIPLQ HLAEDATGNV WFVSDKMPGV IDFHKPLPEH PYSIVYFPEL KGQVVSGFEF
LYPYDDENIF VGAEKGMYHI NYKHYRHSRE LPVVLIGQVS AHGKRDSLLF GGYHPPGKHS
FKAMPNAFSG FHFEYASPIY SQGNTIFYSY RLAGFDQGWS EWSVKTEKDY TNLPHGYYAF
SVRAKDNLGN VSRPAVYAFR ILPAWYQTGL AKSLYVLSGL ALLLWGYIYQ KKKFIRQRKR
YQQKQEQLIL RYRFEKEQRE KDLISLQNEK LTAEVRFKNR ELATATMYLL HRGKVLSNIK
EELLSAMKKL DSPEGAFKKV MRLFEEAENN EEDWEQFSRH FDEVHNNFLF KLKRRYPELS
TTDLKLCAYL RINLTTKEIA QSLGISVRGV ETSRYRLRKK LELPAEVSLY DFLLAVADDQ
VRPAQS