Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_1052 |
Symbol | |
ID | 8357166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 1259652 |
End bp | 1262396 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644963206 |
Product | surface antigen variable number repeat protein |
Protein accession | YP_003120751 |
Protein GI | 256420098 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4775] Outer membrane protein/protective antigen OMA87 |
TIGRFAM ID | [TIGR03303] outer membrane protein assembly complex, YaeT protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000329593 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT TATTTCCTAA GAGCCTACTG GCCATAGTAT TATGTTGCAG TGCCGGCATT CGTGTGTCCG CCCAGATCAG GGACACTAGT GTTACGCCGG TTCCAACACC TGACCAGCAA GCGGCGGGAT TGAATTTATC AGGTCCGTTA CAGGCACAGC CTTACGAAAT TGCTGACATT ACCATTGTAG GGACGCAATA CCTGGATAAA TCTCTGCTGA TTTCCCTGTC AGGCCTCAAT GTCGGTGATA AAGTGGTTTA TCCCGGCGGT GACCAGTTCG CAAAGGCCAT CCAGAGTCTC TGGGGCCAGC GTTTGTTTGC CAACGTTGCT ATTTATGTTA CCAAAATAGA AGACGGGAAG ATTTGGCTGG AAATAGAACT CCAGGAACGC CCTCGTCTGA ATAACTTCGT TTTCAGAGGT GTTAAAAAGT CTGAAACAGA AGAACTGATA AAAAAAGCCG GCCTGCGTAA AGGATCCGTT GTAACCGAGA GCATGAAACA GAATGCAATC GGCGTGATCT CTAAACACTA TGGCGATAAA GGCTTCCGCA ATGCTACTGT TAACATTACC GAAAGAACAG ATACCTCCCA GGTAAATGCT TCCGACCTGG TAATTACCGT AGTAAAAGGT GGTAAAGCAA AAGTGAACGA TATCCAGATC GTGGGTAACG ACAATATTGA CGATACAAGA GTGAAGAAGA AAATGAAAGG TACCAAAGAG CGTACCCGCT TCACTTTATA TCCGGATATC GAGTCGGTAT ATGAAGATTC AACCGAACTG CAGGAAAACT ACTGGAAAAC CTTCGGTTTC CTGTATCCTT CCCGCACTAT GGAACAACTG GATCCGTATT TCCGTTTCAA ATTATTCTCT TCTGCCAAGT TCAATGAGAA CAAATACGCA GAAGACAAAG AGAAAGTAAT CGCTTATTAT AACACACAGG GTTACCGTGA CGCCGTTCTG GTGAGAGATA CTACTTATCG TGCGCACAAT GGTGGTGTAA ACGTAGCCAT GCAGATCAGC GAGGGTAAGA AATATTACTT CGGTAACATT ACCTGGAAAG GTAACAGCCG TTATAACGAC TCCCTGCTGA CCCGTGTACT CGGTATCAGA AAAGGAGATA CCTACAACCA GGAACTGCTG CAGAAACGCT TATTGTCTTC TGAAGGTGGC GACATCGGTG GTATGTACAT GGACTTCGGT TACCTGTTCT TCCGTGCTGA CCCTGTAGAG GTGGCGATCC ATGGCGATAC CATCGACTAT GAGATCCGTA TCTCTGAAGG TCCGCAGGCA ACCATCAAAG AAGTACGTAT TGCCGGTAAC GAAAAAACCA ACGAACACGT AATCCGTCGT GAGTTGCGTA CGCTGCCTGG TGAGAAATTC AGCCGTACTG ACCTGATCCG TTCCAACCGC GAGATCGCGA ACCTCGGTTT CTTCAACCCT GAGAAGATCG GTATGGACCC TATCCCGAAC ATCCAGGATG GTACCGTGGA TATCAACTAC ACTGTAGAAG AGAAAGCGAA TGACCAGCTG GAATTATCTG CAGGATGGGG TGGTTACATC GGTCTGACCG GTACTTTGGG TGTGACCTTC AACAACTTCT CCCTGCGTAA CATTTTAAGA AAGGAAACAT GGGATCCATT ACCAAGTGGT GACGGCCAGA AATTATCTGT GCGTGTATCA TCCAACGGTA AAGCCTATCG CTCCTATAAC TTCTCATTTA CCGAGCCATG GTTAGGTGGT AAAAAGCGTA ACCAGTTCTC TGTCAGCTTC TACAGCAGCT ACCAGAACCC GAACGCTTAT TCTGCTTATA TATATGGTAG CACCCTGTCT AACAATGCTT ACTTCAAAGT ATTGGGTGGT TCCGTATCCC TCGGTAAACA GCTGAAATGG CCTGATGACT TCTTCACGCT GATCTACTCC CTGAACTATC AGCAGTATAA ACTGAAGAAC TATAACTATT TCAATATTCC TGGTTTCTCC AGCGGTACTT CCAATAACGT CAACATTAAA CTGACCCTCG CCCGTTCATC TGTGAACCAG CAGATCTATC CGAGCAGTGG TTCTAACTTC CTGTTGTCCG GACAGTTCAC GCCTCCTTAC TCCATATTTA ATCCTAACAG GGATTACAAA CTGGAGTCTA TTCAGGATCA GTTTAACCTG ATTGAATACC AGAAATATCG TTTCAACGCG GAGTGGTACG TTCCATTGAG CAGACCTAAA GGTTCTGATA ACAAAGTGTT CGTACTGAAA GTAGCCGCTA AATTCGGTTA CATCGGCCGC TACAACAACC GCACAACATT GTCTCCATTT GGTCGTTTTG AGCTGGGTGG CGATGGTCTG AGTAACTTCG CGATCTATGA CCGTGATATC ATCTCCCAGA GAGGTTATCC GGTATACTAT ACCTCGGATC CTAGGATGAA CTCAGAAACC GGACAACCAA CAGGTTATGA AGGTTTCACC GTGTTCAACA AATATGTGAT GGAGTTACGT TATCCGTTCA GTCTGAACCC AAGCTCTACC ATCTTTGGTC TGGCGTTCGT GGAGGCAGCC AATGGTTACC GCGATGTAAG AGACTTCAAT CCGTTCAGAT TACGTCGTTC AGCAGGTCTC GGTATGCGTT TCTACCTGCC GATGTTTGGT CTGCTTGGAT TTGACTATGG TATCGGTTTC GACCGTCTGC AGTCCGGCAA TGGTCTGAAA GATGCTGCTA AGTTCACCTT CATGCTGGGC TTCGAACCGG AATAA
|
Protein sequence | MKKLFPKSLL AIVLCCSAGI RVSAQIRDTS VTPVPTPDQQ AAGLNLSGPL QAQPYEIADI TIVGTQYLDK SLLISLSGLN VGDKVVYPGG DQFAKAIQSL WGQRLFANVA IYVTKIEDGK IWLEIELQER PRLNNFVFRG VKKSETEELI KKAGLRKGSV VTESMKQNAI GVISKHYGDK GFRNATVNIT ERTDTSQVNA SDLVITVVKG GKAKVNDIQI VGNDNIDDTR VKKKMKGTKE RTRFTLYPDI ESVYEDSTEL QENYWKTFGF LYPSRTMEQL DPYFRFKLFS SAKFNENKYA EDKEKVIAYY NTQGYRDAVL VRDTTYRAHN GGVNVAMQIS EGKKYYFGNI TWKGNSRYND SLLTRVLGIR KGDTYNQELL QKRLLSSEGG DIGGMYMDFG YLFFRADPVE VAIHGDTIDY EIRISEGPQA TIKEVRIAGN EKTNEHVIRR ELRTLPGEKF SRTDLIRSNR EIANLGFFNP EKIGMDPIPN IQDGTVDINY TVEEKANDQL ELSAGWGGYI GLTGTLGVTF NNFSLRNILR KETWDPLPSG DGQKLSVRVS SNGKAYRSYN FSFTEPWLGG KKRNQFSVSF YSSYQNPNAY SAYIYGSTLS NNAYFKVLGG SVSLGKQLKW PDDFFTLIYS LNYQQYKLKN YNYFNIPGFS SGTSNNVNIK LTLARSSVNQ QIYPSSGSNF LLSGQFTPPY SIFNPNRDYK LESIQDQFNL IEYQKYRFNA EWYVPLSRPK GSDNKVFVLK VAAKFGYIGR YNNRTTLSPF GRFELGGDGL SNFAIYDRDI ISQRGYPVYY TSDPRMNSET GQPTGYEGFT VFNKYVMELR YPFSLNPSST IFGLAFVEAA NGYRDVRDFN PFRLRRSAGL GMRFYLPMFG LLGFDYGIGF DRLQSGNGLK DAAKFTFMLG FEPE
|
| |