Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen_4702 |
Symbol | |
ID | 4094226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia AU 1054 |
Kingdom | Bacteria |
Replicon accession | NC_008061 |
Strand | + |
Start bp | 1971197 |
End bp | 1972507 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 638017989 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_624555 |
Protein GI | 107027044 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGGCGC TCGGTACGAC GGTCGTAAAC GGCGGCGGTC AGATCGGCGG CGTGCAGATT CCCGGCACGA ACCCGACGAC GGCCACCAGC ATCGGCAACG CGGTCGGCAG CCTCGGCAAC GGCGTGCAAT CGCTCGGCAA CGGCATCGCG GCCGGCCTGG GTTCGATCGG CGTGTCGGCG AACCCGCTCG GGCCGACGAT CACGTCGACC ACCGGCCTGC TGACCGGCGC CGGCGGCGCG GTCAACAACC TCGGCAATGC CGTGACGAGC CTCGGCACCG GTCCGCTGTC GCCGCTCGCG CCCGCGACGA CCCTGGTCGG CAGCCTCGTC AACACGGTCG GCACCGCGGT CAACTCGACC GCATCGGCGC TGAACACGGC ACTGAACAGC TCGCCGGTCC AGCAGCTCGA AACGCAGCTC GGCAAGGTGA TCAACCCGAT CACGAACACG TTGACCGGCG GCGTCACGAC GCCGGGCGCC ACGCAGACGC TCGGCGGCGT GACGCTGCTC GGCACGCCGC TCAACGGCCT GCTGAGCACG CTCGGCAGCG GCCTCGGGCT CGCGGGCACG AAGGTCGGCG GCGCGACCGA CAACCCGGTC GGCGCGGGCC TCGGCGGCGT CGTGACGCAG CTCGGCAACA CGGTGACGTC GACGGGCGGC CTCGTCCACG ACAACAACGC GGGCAGCTCC AGCAGCGGCA CGGGCGGCAG CAATCCGCTC GCGCCGATCA CGGGCCTGCT GGGCACGTTG ACGGGCGGCC TCGGCGGCGG CAGCTCGAGC GGTTCGGGCG GCACCAGCGG CACCAGCAGC GGCGGCCCGC TCGCACCGAT CACCGGCCTG CTCGGGACGG TGACGGGCGC ACTCGGCGGC ATCGGTTCGA GCGGCACCAG CGGCACGGGC GGAACGAGCG GGACCGGCGG CACCAGCGGT ACGGGTGGCG CAGGCCTCGG CGGCCTGCTG GCGCCGGTCA CCAATCTCGT CAACTCGCTG ACGCCGCTCG GCGCGAGCCT CACCGGCACC GTCACGACGC CGGGCGGCAA CCTGTCCGGC ACGCTCGGCG GGGTGCTGAC GAGCGGCCCG GTCGGCACGC TCACGGGTGC GCTGGGCACG CCGGCCGGCT CGGCCGGCGC CACCGGCACG GTCAGCCCCG GCGGCGCAGC AGGCACCGTG ACGACGCCGG GCGGCAGCGG CTCCGTGGTG ACCGGCCTGA CCGGCAGCAC GGGCGGCGCC GCCGCGGGCG GCACCGGCAA CCTGCTGTCG CCGGTGACCA ACCTGCTCGG CGGTCTGCTG GGCGCCGGCA CCAAGAAGTA A
|
Protein sequence | MTALGTTVVN GGGQIGGVQI PGTNPTTATS IGNAVGSLGN GVQSLGNGIA AGLGSIGVSA NPLGPTITST TGLLTGAGGA VNNLGNAVTS LGTGPLSPLA PATTLVGSLV NTVGTAVNST ASALNTALNS SPVQQLETQL GKVINPITNT LTGGVTTPGA TQTLGGVTLL GTPLNGLLST LGSGLGLAGT KVGGATDNPV GAGLGGVVTQ LGNTVTSTGG LVHDNNAGSS SSGTGGSNPL APITGLLGTL TGGLGGGSSS GSGGTSGTSS GGPLAPITGL LGTVTGALGG IGSSGTSGTG GTSGTGGTSG TGGAGLGGLL APVTNLVNSL TPLGASLTGT VTTPGGNLSG TLGGVLTSGP VGTLTGALGT PAGSAGATGT VSPGGAAGTV TTPGGSGSVV TGLTGSTGGA AAGGTGNLLS PVTNLLGGLL GAGTKK
|
| |