Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccur_13730 |
Symbol | |
ID | 8375578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptobacterium curtum DSM 15641 |
Kingdom | Bacteria |
Replicon accession | NC_013170 |
Strand | - |
Start bp | 1559043 |
End bp | 1561934 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644994289 |
Product | putative collagen-binding protein |
Protein accession | YP_003151730 |
Protein GI | 256827771 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4932] Predicted outer membrane protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 120 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAAGAA AGTTATGTTG CCTCATAAGT GCTGTACTGG CATTCAGTAT CTGTATGCCT GCGGGAGCCT TGGAGGCGGT TGCCCTTGAT AACGAAGGAA GCACAAGCTC TGCGAGTGCA ACGCAGGAAA GTCCTGCCAC TTCTATAGAG AACAGTGAAA ACAGCCGTGC AGTTGTAAAT GCTGAGGCTC CCATGGCACA AGATGAGTGG CAAGCGAGTC AATCTGATAC AGCTAACGGA AGCAAACAGA CATCATCTAC GGGCACCGTT GACTTGACAA GGATGATCGA GCCGGTAGAT CTTAATACTT CTCACAAGGT GTCTAAGCGT GCTGCTTCTG GATCAAGTGG CGTTGAGCTT CCTAATGTTG TTACGAGTGC CAAAGTTACT ACCGCTACTG GCACGACACC GGTTAATGTG GGGGCGTGGC AAGCATTCAA GCTTAATTTC CATTATGCAC TTCCAAATCT TGGCGTGCAT GCGGGCGACA CAACTACGAT AGAGCTTCCA GCCGGGTTCA AAAGCGCTCC GCCAACTGAC TTTGTCATTC AAGACGGATC AGGCAATATT ATTGCGCGCG GTAAGACTGA CCCGACCAAC ACCAAGTTTA TTATTACATA TACGGACTAC GCTGAGGGTA AATCGGATAT CTCTGGAGAC TTCTCGGTTA ATGTTCAGAT CGATAATGAT GTACACACTC ATTCTGGCGT CTTACCAGTT AATCCGGTGA TAAGCGGTGA AACAGTTCCG GCTGGCAATG TCAACTACAC GGTACATACT GAAACAGCTG TGCCTATTAT TAAATCTGGC TGGGCTAACG CTTTCGATAC CACTAAGGGC GTTTGGCAGG TAAAGATTAA TCAAGATGGC AAGGCGTATA CGAATGCAGT ACTTGATGAC AGCCTTTTAA CTCCAGGCGT GTCGTTTATT CCTGGTACAC TTGAAGTATT CGAAGGAACC TGGCAGCTTC AAGGCACGAG CTATAAACTT GTTGGGCAGA CCAATGTTAC AAGTCAGTAT GCTTCAAAAA TTACTTATAA CGGAACAAAC TTTAAGCTGA ATCTGGGAAA TATCCCTGCC ACTAAAGGGC TATTGGTACG TTTCCAAACC AAGATTAACT ACACACCTCT TCCCGGTGAA AAGTTCGAAA ATAAAGCAAG TCTCACTGAT AACGGCGTAA CTAAAGAAAG TAAGGCCTAT TACCTTCTAC CAACCTCTGG TGGCACTGGT GAAGGTTACA AGTACAAGAT TAATATCAAA AAGACTGACG AATCAGGAAA TCCACTTGCC GGAGCAGTTT TTGATATTGT TCGTGCTCGT TCTGGCGCGG TTGTTGGCCA AGTAACAACA AATGCGTCTG GCGAAGCAAG CCTGGGCGGT CTTTTGCGTG ATGGTTATAT CATTAAGGAA ACTACTCCGC CTTCTGGATA TCTTGCTGCA GCTGATCAGA CGATTGCCGA TACTGATTTC TCAACAGCCA CTCAAGACGT CACGCGCACG TTTGTCGATA AAGCCATTCC GCCTACTGTT AATGTCTCCG TTGAAAAGAC GTGGAGCGAT GCCGACAATC AAGACGGAAT GCGTCCGACT TCGGTGACGG TTCATTTATA TGCTGATGGT GTCGATACCG GTAAGACAGT AACGCTTGAT GGAAGCAATT CGTGGAAAGA TACTTTCGCG AGCCTCGATA AGAAGAATGC AGCTGGTAAC GATATCGTTT ATACCGTGGC TGAGGATCCA ACTCCGTCAG GGTATATCGC TGCTGTTACT GGTTCTGCGG TCGCAGGCTT TACCATCACC AACACCCATA CCCCTGAAAC CATCAATATC CCAGTAACCA AGAAGTGGGT TGGGCCAGAA GGCTCATCAG TTACTGTCAA GCTTTTGGCT GATGGGGTAG ATAGTGGCAA GTCTGTCACG CTTTCCTCAG CTAATAGCTG GAGCGATACG TTTACTAACC TGCCTAAGTA CAAAAATGGT ACTGCTATTA CCTACACCGT TGATGAATCA TCAGTTACTG GTGTGGATGC AACCAAGTAC ACAACAGCTA TAAGCGGCAG TGCTACAGCG GGATACACTA TCACTAACAC CAATAAAGAA AAGATTGACA TCTCAGGAAC AAAGACCTGG AATGATGATG GCAATCGTGA TGGAGCGCGT CCGTCTTCTA TCACCATCAA CCTTCTGGCC GATGGCACCC AAGTAGACTC AAAGGCAGTC ACCCCGGATG CATCAGGTGC ATGGAGCTAT AGCTTTGCTG GTCTTGCTAA GTATTCTGCA ACCGATGGTC ACCAGATTGC CTACACGATC ACTGAGAATG CTGTTGCTGA TTATTCAACA ACCATTACTG ATTATGATGT CACAAACACT CATACACCAG CTCAAACATC GCTGACAGTT ACTAAAGCAT GGAGTGATGA TAATGACCGC GATGGCGTGC GTCCTTCCTC TGTGGAAGTA GTGCTCTATG CCAATGGAGT AGCAAAGGGA ACACCTGTTA CTCTCAATGC TGCCAATAAT TGGTCATATA CGTGGACTGG CCTTGACCAG AAGGACAATG GCACCAACAT TGTCTACACG GTCGATGAGC CCACTGTTCC CACTGGATAC ACCAAAGAGG TAACGGGGGA TGCCACCAGT GGCTTTACCA TCACCAACAC CCATACCCCC ACTCCTCCAG AGCCGGGCCC AAATCCTGCT CCCGAGCCCG ATCCAACACC AGCACCGAGC CCAGATCCTG ATCCAGACCC CAATGGCAAA GGACCAGCTT CCATCCTTCC CAAAACCGCT GATGAAGGAA CTCTGTTTGC TGGAGCTGCT GGTTTAGCTA TTCTCTCTGC TGTCGGTGGA GCGATTGCTG TGACTGCTCG CCGTCGCGAA GAGCAGGATT AG
|
Protein sequence | MKRKLCCLIS AVLAFSICMP AGALEAVALD NEGSTSSASA TQESPATSIE NSENSRAVVN AEAPMAQDEW QASQSDTANG SKQTSSTGTV DLTRMIEPVD LNTSHKVSKR AASGSSGVEL PNVVTSAKVT TATGTTPVNV GAWQAFKLNF HYALPNLGVH AGDTTTIELP AGFKSAPPTD FVIQDGSGNI IARGKTDPTN TKFIITYTDY AEGKSDISGD FSVNVQIDND VHTHSGVLPV NPVISGETVP AGNVNYTVHT ETAVPIIKSG WANAFDTTKG VWQVKINQDG KAYTNAVLDD SLLTPGVSFI PGTLEVFEGT WQLQGTSYKL VGQTNVTSQY ASKITYNGTN FKLNLGNIPA TKGLLVRFQT KINYTPLPGE KFENKASLTD NGVTKESKAY YLLPTSGGTG EGYKYKINIK KTDESGNPLA GAVFDIVRAR SGAVVGQVTT NASGEASLGG LLRDGYIIKE TTPPSGYLAA ADQTIADTDF STATQDVTRT FVDKAIPPTV NVSVEKTWSD ADNQDGMRPT SVTVHLYADG VDTGKTVTLD GSNSWKDTFA SLDKKNAAGN DIVYTVAEDP TPSGYIAAVT GSAVAGFTIT NTHTPETINI PVTKKWVGPE GSSVTVKLLA DGVDSGKSVT LSSANSWSDT FTNLPKYKNG TAITYTVDES SVTGVDATKY TTAISGSATA GYTITNTNKE KIDISGTKTW NDDGNRDGAR PSSITINLLA DGTQVDSKAV TPDASGAWSY SFAGLAKYSA TDGHQIAYTI TENAVADYST TITDYDVTNT HTPAQTSLTV TKAWSDDNDR DGVRPSSVEV VLYANGVAKG TPVTLNAANN WSYTWTGLDQ KDNGTNIVYT VDEPTVPTGY TKEVTGDATS GFTITNTHTP TPPEPGPNPA PEPDPTPAPS PDPDPDPNGK GPASILPKTA DEGTLFAGAA GLAILSAVGG AIAVTARRRE EQD
|
| |