Gene Cpin_4667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4667 
Symbol 
ID8360841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5823660 
End bp5826890 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content49% 
IMG OID644966818 
Producthypothetical protein 
Protein accessionYP_003124305 
Protein GI256423652 
COG category[S] Function unknown 
COG ID[COG5305] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0836755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000312371 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATTTTA AAAGGACCAA CAATATTGTA GGTTGGGTTG TCTGTATCAT TGCCTGCACT 
GTCTACATTA TGACTATGGA GGCCACCGGA AGTCTGTGGG ACTGCGGCGA GTTTATTTCT
AGTGCCTACA AGGTACAGGT TCCGCACCCT CCGGGAGCTC CTTTATTTGT GCTGTTGGGT
AGATTGTTTA CCATTCCGTT TCCGCCCTCT CAGGCAGCTA TCGGCGTAAA CCTGATGTCT
GCACTGTCAA GCGGATTCAC TATCCTCTTC CTCTTCTGGA CCATCACACA CTTCGCTCGT
CGTCTGATGG TGAAAGCCGG AGAAGTTATT TCCAGCGAAA AAATGATCGC CATCATGGGC
GCCGGCACAG TAGGTGCATT AGCTTACACC TTCTCTGACT CTTTCTGGTT CTCTGCCGTT
GAAGGCGAGG TGTACGCAAT GTCCTCCCTC TTCACCGCCA TCGTATTCTG GGCGATCCTG
AAATGGGAAC ACGAATCAGA TGAAGCATAT GCTGACCGCT GGATCGTTTT CATCGCTTTC
ACAATGGGTC TCTCTATCGG TGTCCATCTG CTGAACCTCC TCACCATCCC GGCTATCGTA
ATGGTGTACT ACTTCAAACG CTCTCCTAAA GTAACGCCTA TCGGTACTTT CTGGGCATTT
ATCATTGGTT GCGCTATCAC CGGTCTCGTG CAGAAGTTCG TTATCCAGGA TACCGTTAAG
GCTTCCGGTC TGATGGACGT ATTCTTCGTG AATAGCCTCG GCCTGCCATT CTACAGTGGC
TTCGCTTTCT ACTTCCTCGC ACTGGCTGCA GTCCTGTTAT ACGGATTGAA AAATCCTAAA
TTCGGTCTCT ACGCGCCACT GATCCTGATC GCTTCCGTTA TCGTCATCCC TGCCTTTAAC
GACGCTTCAG GTGCTGGTAT CAAAATATTA AAGCTGATCA TCTCAGCCGC AATCGTATTT
ATACCTTATC TGGTAAAACT GTTCGATGTT AAAATCGAAT CATCCAGCTT TACTCACGCT
ATCAAAGTAA CCATCTACTC TATCATCTTC CTGCTGCTCG GTTACTCCAC TTATATTACT
ACAATGATCC GCTCTACCGC GAATCCATCT GTAGATATGT ACAACGTGGA CAACCCGATC
TCCCTGGTAG GTTACCTGGG TCGTGAACAG TATGGTGATT TCCCGCTGAT CTATGGTCAG
GTATTCACTG CCCGCCCTAC TTCTTATGAA GATGCAGGAA ACATCTACGC ACGTGGTAAA
GACAAATATG AGATCGCTGG TAAAAAACAG GTTCCTGTTT ACGCTGCTGA AGATAAAATG
CTGTTCCCGC GTGTCTGGGA CGCCAGCAAC GATCAGGGGC ATGCGGATTA CTACCGCGAC
TGGCTGGGGC TCGATGCCAA CGCACGTCCT AGCTTTAAAG ATAACGTCAG CTTCTTCGTG
ACATACCAGG TTTACTTCAT GTACTTCCGC TACTTTATGT GGAACTTCTC CGGTAAACAA
AACGATACAC AAGGTTACGG TAACAAACGT GACGGTAACT GGATCACTGG TATCTCCTTT
ATAGATAATA TTATGTACGG CGACCAATCC ATGATGCCGG ACAGCTTAAA GAACAACAAA
GGACACAATA CCCTGTTCCT CCTGCCGTTC GTACTGGGTG TGATTGGTTT CTTCTATCAG
TATAACAATC ACCGTAAAGA TACACTGGTC GCTTCCCTCC TGTTCTTCTT CACCGGTTTT
GCCATCGTGC TCTACCTGAA CCAGGCGGGT AACCAGCCAC GTGAACGTGA CTATGCATAC
GTAGGTTCCT TCTATGCATT CGCCATCTGG ATCGGTCTGG GTGTACTGTC TGTGGCTGAA
TTCCTGAAGA AAAAGACTAA ATCAGCGATC TCCGCTCCGG CAGCTGCACT CGTTTGTCTG
CTGGCAGTGC CTGTCCTGAT GGGCTTCCAG GAATGGGATG ACCACGACCG TTCTACCAAA
ACCATCGCCC GTGATGTTGC TGCTGACTAC CTGAACTCCT GTGCTGAAAA TGCAATCCTG
TTCACCGTCG GCGATAACGA TACCTACCCG CTGTGGTATG CACAGGAAGT AGAAGGTATC
CGTCCTGATG TTCGTGTGAT CAACCTCAGC CTCCTCGGTG TAGACTGGTA TATCGATCAG
CAACGTCATA TGGTTAACAA GAGCGCAGGC GTTCCAATGT CCTGGACTCC TGATAAATAC
CAGGGCGAAA ACCGCAACTA CATCCAATAC TATGATGGCG GTAGCTTCCC GCAGGATAAA
TTCTATAACC TGAGAGAAGT AATGGCATTC ATGGGTTCTG ATGATCCTCG TGCTAAACTG
TCTACTACCG ACGGTTCACA GATCAACTAC CTGCCTGCTA AAAAACTCTT CGTACCGGTA
AACGTAGCAG AAGTGCTGAA AAACGGTACC GTTGATATCC ACGATAGCGC ACGTGTAATG
CCACAGCTGC CATTCCAGAT CAGCAAATCT TACCTGCTGA AAAATGACCT GGCGGTATAC
GACATCATCG CCGCTAACGA CTGGAAACGT CCTATCTACT TCACCAGCCC AACCGATCTC
GGTCTGAACG ACTACCTCCG TCCGGATGGT CTGACCTACC GCCTCGTGCC GCTGGCTAAG
ACAGAAAGCA ACGATCCGAT GGGTGCCGAC AATAACGTTA ACATCCCTGT GATGTACAAA
AACCTGATGG AAAAATTCGC TTTCGGTGGT GCTAACGTTC CAGGTACTTA CTTCGACGAA
CCTAACCGTA AACTGCTGCA ATACCTGCGT AACGCTTACA CTAAACTGGG TACCGCTATG
GCCCTGGCCG GTGACAAAGA TTCCGCACTC GCAGTACTGA ATAAGAGCGA TAAAAACCTC
CTGCAGGGTA ACTTCCCTTA CGCAATGACC ACGCCAGGAC AGATGCACAA CTACAGCTCT
ATGCAGACTG TATACGCTTA CTACCTGGCT GGCGATGCGA AGAAAGCAGA CGAGATCTCT
CAGCTGATCA TCAAAGATTG TACACAGCAG TTACAATACT ATCGCTGCCT GCCACCTTCT
AAAATGAATG GTTTGCAGCG TGATATGCAG ATGGCTGAAC AGTTCATCAC CCTGCTGCAG
CGTATGAAGG AAGATTTTAC ACATCCTGAA CGCCGCCAGT CACTGGAGCA ACCGGGTGGT
GTCAATATTG ACACTGTTGA GCCGGAAGCT GATGGTGCAC AAACAAAATA A
 
Protein sequence
MNFKRTNNIV GWVVCIIACT VYIMTMEATG SLWDCGEFIS SAYKVQVPHP PGAPLFVLLG 
RLFTIPFPPS QAAIGVNLMS ALSSGFTILF LFWTITHFAR RLMVKAGEVI SSEKMIAIMG
AGTVGALAYT FSDSFWFSAV EGEVYAMSSL FTAIVFWAIL KWEHESDEAY ADRWIVFIAF
TMGLSIGVHL LNLLTIPAIV MVYYFKRSPK VTPIGTFWAF IIGCAITGLV QKFVIQDTVK
ASGLMDVFFV NSLGLPFYSG FAFYFLALAA VLLYGLKNPK FGLYAPLILI ASVIVIPAFN
DASGAGIKIL KLIISAAIVF IPYLVKLFDV KIESSSFTHA IKVTIYSIIF LLLGYSTYIT
TMIRSTANPS VDMYNVDNPI SLVGYLGREQ YGDFPLIYGQ VFTARPTSYE DAGNIYARGK
DKYEIAGKKQ VPVYAAEDKM LFPRVWDASN DQGHADYYRD WLGLDANARP SFKDNVSFFV
TYQVYFMYFR YFMWNFSGKQ NDTQGYGNKR DGNWITGISF IDNIMYGDQS MMPDSLKNNK
GHNTLFLLPF VLGVIGFFYQ YNNHRKDTLV ASLLFFFTGF AIVLYLNQAG NQPRERDYAY
VGSFYAFAIW IGLGVLSVAE FLKKKTKSAI SAPAAALVCL LAVPVLMGFQ EWDDHDRSTK
TIARDVAADY LNSCAENAIL FTVGDNDTYP LWYAQEVEGI RPDVRVINLS LLGVDWYIDQ
QRHMVNKSAG VPMSWTPDKY QGENRNYIQY YDGGSFPQDK FYNLREVMAF MGSDDPRAKL
STTDGSQINY LPAKKLFVPV NVAEVLKNGT VDIHDSARVM PQLPFQISKS YLLKNDLAVY
DIIAANDWKR PIYFTSPTDL GLNDYLRPDG LTYRLVPLAK TESNDPMGAD NNVNIPVMYK
NLMEKFAFGG ANVPGTYFDE PNRKLLQYLR NAYTKLGTAM ALAGDKDSAL AVLNKSDKNL
LQGNFPYAMT TPGQMHNYSS MQTVYAYYLA GDAKKADEIS QLIIKDCTQQ LQYYRCLPPS
KMNGLQRDMQ MAEQFITLLQ RMKEDFTHPE RRQSLEQPGG VNIDTVEPEA DGAQTK