Gene Cpin_5648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5648 
Symbol 
ID8361825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp7204082 
End bp7207207 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content47% 
IMG OID644967789 
ProductTonB-dependent receptor plug 
Protein accessionYP_003125273 
Protein GI256424620 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGTAA AAAATGCTTC TAAGAGGCTA TGGATGCTCC TGTCCTTCAT AGCTATGCTG 
ACCGGCGCCA TGCCTGCCTT TGCCCAGAGT GGCAAAATCT CCGGGACCGT CACGGACATT
CAAGGACAAG CGCTTCCCGG TGTATCGGTG AAGCTGACCG GCACCAAAAC CGGTACTGTC
ACGGACCTGA ATGGTGCTTT CTCGCTGAAT GCTACTGAAA AAGGTACGCT GGAATTCAGT
TTCCTGGGAT TTGAAATACA AACGGTCGTT TTTTCCGGCT CGCAGCCGAT CAAAGTCAAA
CTACAACAGG CGACAGCCAA TGTGGATGAA GTGGTCGTAG TGGGCGCCAG CATGAAAAAG
TCTGATCTGA CTGGTTCAGT GGCTACAGTA GATTCAAAGA AACTGCTGGA GCGTCCTGCT
ACCAACATCA ACCAGGCGTT ACAGGGTAAT GCTGCCGGCG TATTCGTAAG TAATGGTACC
CGTCCTAGCG ACGATGCTAC CATCCGGGTA CGTGGTATCA ATACGATCAA CGCTGGTTCT
TCTCCTATCT ACGTTGTGGA TGGTGTCATC ATGGAAAACA ACCAGGGTGG CTTTAACTCC
GTAAACGTGA ATGACGTTGC TTCCGTACAG GTACTGAAAG ACGCTTCTGC TACCGCATTA
TACGGTTCCC GTGGTGCAAA CGGCGTTGTG GTGATCACCA CCAAGAAAGG TCAGCGCAGA
GGTGGAGACG GTCTTGTGAC CTACGATGCC TGGGCTGGCG TGTCTGACTT TACCCGCATT
CCTAAAACGA TGAATGCACA GCAGCTGTTC GATCTGCGTA TCGACGCTTA TGCGAATGGT
TATATGAAGG ATAATCCGAA CGCTAATCGC CAGGATTATA TCGATAATAC CCTGATGAAG
ACAAACATCG CTTTCTCTAA TCAGGAGTTC GATACGCATA ACAAAAACCA AAGCTTTAAC
TGGCTCGACC AGGTAACCCG TACGGGCTTT CAGCAGAACC ATGCTTTGAG CTTCTCTGGT
GGATCTGACC GTGGTATCTT CTATCTCAGC CTTGGTTACG CAGGTGTTAA AGGTGTGGTA
GAAAATACTG ACCAGAAAAA ATATACCGGC CGTTTCAATG CCGAATACAA TATCAAGAAA
TGGTTAAAAG TAGGTACCAA CACCGGCTTT ACCCGTCTAA ATGATGGTAT GCCATCTGAT
GATGTATATG GTAAAGCACT GAACGGTAAC CCGCTGCTGG ACTATGCGCC TTACCGCGAT
CCGGCTACCC GTTTTACTTA CGACTACCTC ACACTTTATT ACCGTTCACA TGGTGAGCAG
AACAACAACG ATTTCAACCC GTTCAACTCT CTGTTAATGC AGCGCGATCG TACGCGTAGC
CGTGTAACTA CTGCTAACTA TGTAGACATC TCTCCTATAA AAGGTCTGAA CTTCCGTTCC
AGCTATTCCC TGGATTACGG TGCACAGGAC TGGTTTGAAT TCACACCACA TAACATCCAG
GAAGCGATCC GTCACTACAA TGGTGATGCG CGCGCTAAAC ATGAAAGATG GAGCGATACT
TACTGGCAGT GGGATAACAC CCTGACATAT AATACCATGT TGGGAACTGA CCATAAACTG
ACAGCGTTGG TGGGTACCAG CTCCAGCAAA CGTTCTTCTA ACTATTCCAA AGCACAGGGC
GACCGTTTTG CAAGTGATGA TCTGGGTTAT TATGACCTGG GTGGCGCTGC TGCACTGGAA
AAAGCAGTAC TCGGTTCTGA CTTCTATGCT TACAGTCTGA TGTCTTTCAT CGGTCGCGTA
AACTATAGCT ATAAAGATAA ATATCACCTG ACGGCTACTG CCCGTTATGA TGGTTCTTCC
CGTTTCGCTG CAGGCAACCG CTGGGGTATC TTCCCGTCTA TGTCCGCTGC ATGGGATATC
ATCAAGGAAG ATTTCATGAA AGACGTGCCG GTATTCACAC AGTTGAAACT GCGTGCAGGT
TATGGTGTGG TTGGTAACCA GGATATCGGT AACTACGCCT TCCAGACACT GTACGGTTCC
AAGATCGATA ACGGTAATGC ACTGATTGCA AATGATGGTC GTCGTGGTAA TCCGAATATT
ACCTGGGAAA AACAGAAACA GACCAACCTG GGGCTGGATA TGGGCTTCCT GAAATCAAGA
TTGACTGTCA CTGCCGACTT CTTCTACATC AATAACGACA ACCTGTTACT GGACCGTTCC
CTGGCATTTA CTACTGGTTA TAGCAAACAG TGGGAAAACG TAGGACGTGT AAATAACAAG
GGTATGGAGT TCGCTGTAAA TGGTGAAGTG ATCCAGACGA AAAACTTCAA CTGGAACGTA
TCGGCTAACA TCTCTTTCGA CAAGAACAAA GTGACCCGTC TGTATGGTAA TGCTACAGAG
ATCTATAATC TCTATGAAAA CTCCATCCAA CGTGAAGGCA ACATCTTCCT GGGTCAGTCT
ATCCATAGCA TCTACACCCT GAAATCTGGT GGTATTGCAC AGGAATCTAA CCGTCGTGAC
TGGGAAAATA TCAATTACAA TGGTAAGACG GTTAATCCGG GCGACCTCTT TGCAAAAGAC
ATCTCTGGTC CTAACGGTGT ACCTGATGGT ATAGTAGACC TGACCTATGA CAGAACCGTC
GTAGCGAAAA CAGATCCTAA ATTTTATGGT GGTTTCTCTA CCAACCTCGG TTACAAAGCA
TTCTCCGTGA GCGCAATGTT CACTTACTCT TACGGCGCTA AAAAGATCAG CAGTTACTAT
GAAGGTCTGG CGAACAGTAT CGGTGAGAGC ATGGCATCTG TAGACCTGCT GAACCGCTGG
ACGCCAACGA ATACCAATAC AACTGTTCCA CGTGTGATCG GCAATACCAG CTACAACCGT
TTTAATCCGT CCGATCTGGA CTACGCTGTC CAGAACGCTT CATTCCTGCG TCTGTCTGCA
CTCACGCTGT CTTACACACT GCCTGAGAAA ATGCTGAGCA GCTGGAAAAT GAACAACCTG
CGTCTGTATG CAACTGGTTC TAACCTGTTC TGTATCACAA AGTACAAGGG TATGGATCCG
GAAACCGGCG ACTACGGTTA CCCTCCGGTG AAAATGTATG TATTAGGATT GAATGTAGGT
TTCTAA
 
Protein sequence
MHVKNASKRL WMLLSFIAML TGAMPAFAQS GKISGTVTDI QGQALPGVSV KLTGTKTGTV 
TDLNGAFSLN ATEKGTLEFS FLGFEIQTVV FSGSQPIKVK LQQATANVDE VVVVGASMKK
SDLTGSVATV DSKKLLERPA TNINQALQGN AAGVFVSNGT RPSDDATIRV RGINTINAGS
SPIYVVDGVI MENNQGGFNS VNVNDVASVQ VLKDASATAL YGSRGANGVV VITTKKGQRR
GGDGLVTYDA WAGVSDFTRI PKTMNAQQLF DLRIDAYANG YMKDNPNANR QDYIDNTLMK
TNIAFSNQEF DTHNKNQSFN WLDQVTRTGF QQNHALSFSG GSDRGIFYLS LGYAGVKGVV
ENTDQKKYTG RFNAEYNIKK WLKVGTNTGF TRLNDGMPSD DVYGKALNGN PLLDYAPYRD
PATRFTYDYL TLYYRSHGEQ NNNDFNPFNS LLMQRDRTRS RVTTANYVDI SPIKGLNFRS
SYSLDYGAQD WFEFTPHNIQ EAIRHYNGDA RAKHERWSDT YWQWDNTLTY NTMLGTDHKL
TALVGTSSSK RSSNYSKAQG DRFASDDLGY YDLGGAAALE KAVLGSDFYA YSLMSFIGRV
NYSYKDKYHL TATARYDGSS RFAAGNRWGI FPSMSAAWDI IKEDFMKDVP VFTQLKLRAG
YGVVGNQDIG NYAFQTLYGS KIDNGNALIA NDGRRGNPNI TWEKQKQTNL GLDMGFLKSR
LTVTADFFYI NNDNLLLDRS LAFTTGYSKQ WENVGRVNNK GMEFAVNGEV IQTKNFNWNV
SANISFDKNK VTRLYGNATE IYNLYENSIQ REGNIFLGQS IHSIYTLKSG GIAQESNRRD
WENINYNGKT VNPGDLFAKD ISGPNGVPDG IVDLTYDRTV VAKTDPKFYG GFSTNLGYKA
FSVSAMFTYS YGAKKISSYY EGLANSIGES MASVDLLNRW TPTNTNTTVP RVIGNTSYNR
FNPSDLDYAV QNASFLRLSA LTLSYTLPEK MLSSWKMNNL RLYATGSNLF CITKYKGMDP
ETGDYGYPPV KMYVLGLNVG F