Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcenmc03_1492 |
Symbol | |
ID | 6123170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia MC0-3 |
Kingdom | Bacteria |
Replicon accession | NC_010508 |
Strand | - |
Start bp | 1655819 |
End bp | 1657507 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641638070 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_001764789 |
Protein GI | 170732842 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000276986 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000188028 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACATTA AGGTTATCCC TCATGGCGTG TTCCGGACGA CTCTGATCGC GGCCACCGTT GCAGCCATGC TTTCCCTCTC CGCGTGCGGC GGCTCCGGCT CCATCAGCCA GGGGCTCGGC GGCGGCGGTT CGAGCTCCGG CGGCGGCGAT ACGATCTCCA CGTCGGGCGG CAACGGCTCG AGCGGCACCT CGGGTACGAG CGGTACCTCC GGCACCAGCG GTACTTCGGG CACGAGCGGC ACGTCCGGTA CGTCCGGCAC CAGCGGAACC TCTGGTACGA GCGGCACGTC GGGCACCAGC GGCACCTCGG GCTCGAGCGG GACGTCCGGC ACCTCGGGTG TGTCGGCGAA TCCAGTGGGA AATGTCCTCG CACAAGGCAG CAATATCATC ACGAATGCCG GCGACACCGT CTCCGGTCTC GGCTCCGTCA TCGGCAGCCA GACGCTGCCG GGCGTCAATC CGGCCACCAC GCAAGCCGCG GGCGGCATCG TGCAAAGCGT CGGCGGCGCG GTCACCGCAC TGGGCAACGG CCTCGGCAAC GGGCTCGGCC AGCTCGGCGC GACGAAGGAC CCGGTCGGCA CCACGGTCGC CAGCACCGGC GGCGTGGTCA ATCAGCTCGG CGGCGCAGTC ACGCAAACGG GCAACCTGGT CACGAGCCTC GGCAGCGGCC CGCTGTCGCC GCTCGCGCCG GTCACCGGCG CCGTCGGCGG CCTCGTCACG ACGCTCGGCG GCGCCGTGTC GAACGGCGGC ACCACGCTCA CCAATGTGCT GTCGACCGGC CCCATCCAGC AGGTCACGCA AACGGTCAGC TCGGCCATCA CGCCGATCAC GACGATGGTC GGCCAGACGA CCCAGACGAT CGGCACCGCA ACCGGCCTCG GCGCACCGGT CAACACGCTG CTCGGCACGG TCGGCAACGG GCTGAACCAG GCCGGCGCCC TGCTCGCATC GACGGGCGGC AACCCGGTCA CCACGGGTCT CGGCAACACC GTCTCGTCGA CGGGCAACAC CGTGAAGGCC GTCGGCGGCC TGCTCACGGG CAGCACCGGC GGCGTGACCA ACCCGCTCGC CCCGCTCACG GGCCTCGTCT CGACGGTCAC CGGCGCACTC GGTGGCGCGA CGGGTGGCGG CAGCGGCCCG CTCGCACCGG TCACAGGTCT TGTCTCCACG GTCACCGGCG CGCTCGGCGG CGCAGCAGGC GGCGGCAGCG GCCCGCTCGC TCCGGTCACG GGTCTCGTCT CCACGGTCAC CGGCGCACTC GGCGGCGTAG CAGGCGGCGG CAGCGGTCCG CTCGCTCCGG TCACGGGTCT CGTCTCCACG GTCACCGGCG CACTCGGCGG CGCAGCAGGC GGCGGCAGCA GCCCGCTCGC ACCGGTTACG GGCCTCGTCT CCACGGTCAC CGGCGCGCTC GGCGGCGCAA CGGGCGGTAC CAGCGGCGGC CCGCTCGCGC CGGTCACGGG CCTCCTCGGC GCAGTCACGG GCGCCCTCGG CGGCGCGACG GGGGGCGCAG GCGGCAGCAG CCCGCTCGCC CCGGTGACGA ACGCCGTCTC CACGGTGACG AGCACGGTCA GCGTACCGGC ACTGAGCGGC GGCACGGGTG CCGTCACGAA CTCGGGCGCG TCGTCGAATC CGCTTGCGCC CGTCACGTCG CTGATCGGCG GCCTGCTCGG CGGCACGCAC GGCAAGTAA
|
Protein sequence | MDIKVIPHGV FRTTLIAATV AAMLSLSACG GSGSISQGLG GGGSSSGGGD TISTSGGNGS SGTSGTSGTS GTSGTSGTSG TSGTSGTSGT SGTSGTSGTS GTSGSSGTSG TSGVSANPVG NVLAQGSNII TNAGDTVSGL GSVIGSQTLP GVNPATTQAA GGIVQSVGGA VTALGNGLGN GLGQLGATKD PVGTTVASTG GVVNQLGGAV TQTGNLVTSL GSGPLSPLAP VTGAVGGLVT TLGGAVSNGG TTLTNVLSTG PIQQVTQTVS SAITPITTMV GQTTQTIGTA TGLGAPVNTL LGTVGNGLNQ AGALLASTGG NPVTTGLGNT VSSTGNTVKA VGGLLTGSTG GVTNPLAPLT GLVSTVTGAL GGATGGGSGP LAPVTGLVST VTGALGGAAG GGSGPLAPVT GLVSTVTGAL GGVAGGGSGP LAPVTGLVST VTGALGGAAG GGSSPLAPVT GLVSTVTGAL GGATGGTSGG PLAPVTGLLG AVTGALGGAT GGAGGSSPLA PVTNAVSTVT STVSVPALSG GTGAVTNSGA SSNPLAPVTS LIGGLLGGTH GK
|
| |