Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_4667 |
Symbol | |
ID | 8360841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 5823660 |
End bp | 5826890 |
Gene Length | 3231 bp |
Protein Length | 1076 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644966818 |
Product | hypothetical protein |
Protein accession | YP_003124305 |
Protein GI | 256423652 |
COG category | [S] Function unknown |
COG ID | [COG5305] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0836755 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00000000312371 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATTTTA AAAGGACCAA CAATATTGTA GGTTGGGTTG TCTGTATCAT TGCCTGCACT GTCTACATTA TGACTATGGA GGCCACCGGA AGTCTGTGGG ACTGCGGCGA GTTTATTTCT AGTGCCTACA AGGTACAGGT TCCGCACCCT CCGGGAGCTC CTTTATTTGT GCTGTTGGGT AGATTGTTTA CCATTCCGTT TCCGCCCTCT CAGGCAGCTA TCGGCGTAAA CCTGATGTCT GCACTGTCAA GCGGATTCAC TATCCTCTTC CTCTTCTGGA CCATCACACA CTTCGCTCGT CGTCTGATGG TGAAAGCCGG AGAAGTTATT TCCAGCGAAA AAATGATCGC CATCATGGGC GCCGGCACAG TAGGTGCATT AGCTTACACC TTCTCTGACT CTTTCTGGTT CTCTGCCGTT GAAGGCGAGG TGTACGCAAT GTCCTCCCTC TTCACCGCCA TCGTATTCTG GGCGATCCTG AAATGGGAAC ACGAATCAGA TGAAGCATAT GCTGACCGCT GGATCGTTTT CATCGCTTTC ACAATGGGTC TCTCTATCGG TGTCCATCTG CTGAACCTCC TCACCATCCC GGCTATCGTA ATGGTGTACT ACTTCAAACG CTCTCCTAAA GTAACGCCTA TCGGTACTTT CTGGGCATTT ATCATTGGTT GCGCTATCAC CGGTCTCGTG CAGAAGTTCG TTATCCAGGA TACCGTTAAG GCTTCCGGTC TGATGGACGT ATTCTTCGTG AATAGCCTCG GCCTGCCATT CTACAGTGGC TTCGCTTTCT ACTTCCTCGC ACTGGCTGCA GTCCTGTTAT ACGGATTGAA AAATCCTAAA TTCGGTCTCT ACGCGCCACT GATCCTGATC GCTTCCGTTA TCGTCATCCC TGCCTTTAAC GACGCTTCAG GTGCTGGTAT CAAAATATTA AAGCTGATCA TCTCAGCCGC AATCGTATTT ATACCTTATC TGGTAAAACT GTTCGATGTT AAAATCGAAT CATCCAGCTT TACTCACGCT ATCAAAGTAA CCATCTACTC TATCATCTTC CTGCTGCTCG GTTACTCCAC TTATATTACT ACAATGATCC GCTCTACCGC GAATCCATCT GTAGATATGT ACAACGTGGA CAACCCGATC TCCCTGGTAG GTTACCTGGG TCGTGAACAG TATGGTGATT TCCCGCTGAT CTATGGTCAG GTATTCACTG CCCGCCCTAC TTCTTATGAA GATGCAGGAA ACATCTACGC ACGTGGTAAA GACAAATATG AGATCGCTGG TAAAAAACAG GTTCCTGTTT ACGCTGCTGA AGATAAAATG CTGTTCCCGC GTGTCTGGGA CGCCAGCAAC GATCAGGGGC ATGCGGATTA CTACCGCGAC TGGCTGGGGC TCGATGCCAA CGCACGTCCT AGCTTTAAAG ATAACGTCAG CTTCTTCGTG ACATACCAGG TTTACTTCAT GTACTTCCGC TACTTTATGT GGAACTTCTC CGGTAAACAA AACGATACAC AAGGTTACGG TAACAAACGT GACGGTAACT GGATCACTGG TATCTCCTTT ATAGATAATA TTATGTACGG CGACCAATCC ATGATGCCGG ACAGCTTAAA GAACAACAAA GGACACAATA CCCTGTTCCT CCTGCCGTTC GTACTGGGTG TGATTGGTTT CTTCTATCAG TATAACAATC ACCGTAAAGA TACACTGGTC GCTTCCCTCC TGTTCTTCTT CACCGGTTTT GCCATCGTGC TCTACCTGAA CCAGGCGGGT AACCAGCCAC GTGAACGTGA CTATGCATAC GTAGGTTCCT TCTATGCATT CGCCATCTGG ATCGGTCTGG GTGTACTGTC TGTGGCTGAA TTCCTGAAGA AAAAGACTAA ATCAGCGATC TCCGCTCCGG CAGCTGCACT CGTTTGTCTG CTGGCAGTGC CTGTCCTGAT GGGCTTCCAG GAATGGGATG ACCACGACCG TTCTACCAAA ACCATCGCCC GTGATGTTGC TGCTGACTAC CTGAACTCCT GTGCTGAAAA TGCAATCCTG TTCACCGTCG GCGATAACGA TACCTACCCG CTGTGGTATG CACAGGAAGT AGAAGGTATC CGTCCTGATG TTCGTGTGAT CAACCTCAGC CTCCTCGGTG TAGACTGGTA TATCGATCAG CAACGTCATA TGGTTAACAA GAGCGCAGGC GTTCCAATGT CCTGGACTCC TGATAAATAC CAGGGCGAAA ACCGCAACTA CATCCAATAC TATGATGGCG GTAGCTTCCC GCAGGATAAA TTCTATAACC TGAGAGAAGT AATGGCATTC ATGGGTTCTG ATGATCCTCG TGCTAAACTG TCTACTACCG ACGGTTCACA GATCAACTAC CTGCCTGCTA AAAAACTCTT CGTACCGGTA AACGTAGCAG AAGTGCTGAA AAACGGTACC GTTGATATCC ACGATAGCGC ACGTGTAATG CCACAGCTGC CATTCCAGAT CAGCAAATCT TACCTGCTGA AAAATGACCT GGCGGTATAC GACATCATCG CCGCTAACGA CTGGAAACGT CCTATCTACT TCACCAGCCC AACCGATCTC GGTCTGAACG ACTACCTCCG TCCGGATGGT CTGACCTACC GCCTCGTGCC GCTGGCTAAG ACAGAAAGCA ACGATCCGAT GGGTGCCGAC AATAACGTTA ACATCCCTGT GATGTACAAA AACCTGATGG AAAAATTCGC TTTCGGTGGT GCTAACGTTC CAGGTACTTA CTTCGACGAA CCTAACCGTA AACTGCTGCA ATACCTGCGT AACGCTTACA CTAAACTGGG TACCGCTATG GCCCTGGCCG GTGACAAAGA TTCCGCACTC GCAGTACTGA ATAAGAGCGA TAAAAACCTC CTGCAGGGTA ACTTCCCTTA CGCAATGACC ACGCCAGGAC AGATGCACAA CTACAGCTCT ATGCAGACTG TATACGCTTA CTACCTGGCT GGCGATGCGA AGAAAGCAGA CGAGATCTCT CAGCTGATCA TCAAAGATTG TACACAGCAG TTACAATACT ATCGCTGCCT GCCACCTTCT AAAATGAATG GTTTGCAGCG TGATATGCAG ATGGCTGAAC AGTTCATCAC CCTGCTGCAG CGTATGAAGG AAGATTTTAC ACATCCTGAA CGCCGCCAGT CACTGGAGCA ACCGGGTGGT GTCAATATTG ACACTGTTGA GCCGGAAGCT GATGGTGCAC AAACAAAATA A
|
Protein sequence | MNFKRTNNIV GWVVCIIACT VYIMTMEATG SLWDCGEFIS SAYKVQVPHP PGAPLFVLLG RLFTIPFPPS QAAIGVNLMS ALSSGFTILF LFWTITHFAR RLMVKAGEVI SSEKMIAIMG AGTVGALAYT FSDSFWFSAV EGEVYAMSSL FTAIVFWAIL KWEHESDEAY ADRWIVFIAF TMGLSIGVHL LNLLTIPAIV MVYYFKRSPK VTPIGTFWAF IIGCAITGLV QKFVIQDTVK ASGLMDVFFV NSLGLPFYSG FAFYFLALAA VLLYGLKNPK FGLYAPLILI ASVIVIPAFN DASGAGIKIL KLIISAAIVF IPYLVKLFDV KIESSSFTHA IKVTIYSIIF LLLGYSTYIT TMIRSTANPS VDMYNVDNPI SLVGYLGREQ YGDFPLIYGQ VFTARPTSYE DAGNIYARGK DKYEIAGKKQ VPVYAAEDKM LFPRVWDASN DQGHADYYRD WLGLDANARP SFKDNVSFFV TYQVYFMYFR YFMWNFSGKQ NDTQGYGNKR DGNWITGISF IDNIMYGDQS MMPDSLKNNK GHNTLFLLPF VLGVIGFFYQ YNNHRKDTLV ASLLFFFTGF AIVLYLNQAG NQPRERDYAY VGSFYAFAIW IGLGVLSVAE FLKKKTKSAI SAPAAALVCL LAVPVLMGFQ EWDDHDRSTK TIARDVAADY LNSCAENAIL FTVGDNDTYP LWYAQEVEGI RPDVRVINLS LLGVDWYIDQ QRHMVNKSAG VPMSWTPDKY QGENRNYIQY YDGGSFPQDK FYNLREVMAF MGSDDPRAKL STTDGSQINY LPAKKLFVPV NVAEVLKNGT VDIHDSARVM PQLPFQISKS YLLKNDLAVY DIIAANDWKR PIYFTSPTDL GLNDYLRPDG LTYRLVPLAK TESNDPMGAD NNVNIPVMYK NLMEKFAFGG ANVPGTYFDE PNRKLLQYLR NAYTKLGTAM ALAGDKDSAL AVLNKSDKNL LQGNFPYAMT TPGQMHNYSS MQTVYAYYLA GDAKKADEIS QLIIKDCTQQ LQYYRCLPPS KMNGLQRDMQ MAEQFITLLQ RMKEDFTHPE RRQSLEQPGG VNIDTVEPEA DGAQTK
|
| |