Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BamMC406_3565 |
Symbol | |
ID | 6180174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria MC40-6 |
Kingdom | Bacteria |
Replicon accession | NC_010552 |
Strand | - |
Start bp | 499965 |
End bp | 501254 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641683334 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_001810248 |
Protein GI | 172062597 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.274791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0384386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGCACGA CGGTCGCGAA CGGCGGTAGC CAGATCGGCG GCGTGCAGGT CCCCGGCACG AGCCCGACGA CGGCCACCAG CATCGGCAAC GCCGTCACGA GCCTCGGCAT CGGCGTGCAG TCGCTCGGCA GCGGCGTCGC GGCCGGCCTC GGGTCGATCG GCGTGTCGCC GAACCCGCTC GGCCCGACGC TCACGTCGAC CACCGGCCTG CTGACGGGCG CCGGCGGCGC GGTCAACAAC CTCGGCAACG CGGTGACGAG CCTCGGCAGC GGCCCGCTGT CGCCGCTCGC GCCGGCCACG ACCCTGGTCG GCGATCTGGT CAACACGGTC GGCACCGCGG TCAACTCGAC CGCATCGGCG CTGAACACGG CGCTGAACAG CGCGCCGGTC CAGCAACTCG AGACGCAGCT CGGCAGCGTG ATCAACCCGA TCACGAACAC GCTGACCGGC GGCGCCACGA CGCCGGGCGC CACGCAGACG CTCGGCAACG CAACCCTCCT CGGCGCCCCG CTCAGCGGCC TGCTGAGCAC GCTCGGCAGC GGCCTCGGCC TCGCCGGCTC GCAGGTCGGC AGCGCGACCG GCAACCCGGT CGGCACGAGC GCCGGCGGCA GCGTGTCCCA GCTCGGCAAC ACGGTGGCGT CGGCCGGCGG CCTGCTGTCA AGCGGCAGCT CCAGCAGCGG CACCAACCCG CTCGCACCGA TCACCGGTCT GCTCGGTACG CTGACGGGTG GCCTCGGCGG CGGCAGCAGC TCGAGCGGGT CGGGCGGCAC CAGCGGAACG AGCGGCACGA GTGGTACCAG CGGCGGCCCG CTCGCACCGA TCACCGGCCT GCTCGGCACG GTGACGGGTG CGCTCGGCGG CCTCGGTTCG AGCGGAACGA GCGGAACGAG CGGAACGAGT GGCACCGGCG GCACCAGCGG CTCGGGTGTC GGAGGTCTGC TTGCGCCGGT CACCGGCCTC GTCAACTCGC TGACCCCGCT CGGCGCCAGC CTGACCGGCA CGGTCACGAC GCCGGGCGGC GGCCTGACGG GCACCCTCGG CGGTGCGCTG ACGAGCGGCC CCGTCGGCAC GCTGACGGGT TCGCTCACGT CGCCGGCCGG CGCGTTGGGC GCGCTCGGCA CGGCCAGCCC GAGCGGCGCG GCCGGCACGG TGACGACGCC GTCCGGCAGC AGCGTCGTCA CGACCGGCCT GACCGGCAGC TCGACCGGTG CCGCGGGCGG CACGAACAAC CTGCTCGCGC CGGTGACGAA CCTGCTCGGC GGCCTGCTCG GCGCCACCCC GAAGAAGTAA
|
Protein sequence | MGTTVANGGS QIGGVQVPGT SPTTATSIGN AVTSLGIGVQ SLGSGVAAGL GSIGVSPNPL GPTLTSTTGL LTGAGGAVNN LGNAVTSLGS GPLSPLAPAT TLVGDLVNTV GTAVNSTASA LNTALNSAPV QQLETQLGSV INPITNTLTG GATTPGATQT LGNATLLGAP LSGLLSTLGS GLGLAGSQVG SATGNPVGTS AGGSVSQLGN TVASAGGLLS SGSSSSGTNP LAPITGLLGT LTGGLGGGSS SSGSGGTSGT SGTSGTSGGP LAPITGLLGT VTGALGGLGS SGTSGTSGTS GTGGTSGSGV GGLLAPVTGL VNSLTPLGAS LTGTVTTPGG GLTGTLGGAL TSGPVGTLTG SLTSPAGALG ALGTASPSGA AGTVTTPSGS SVVTTGLTGS STGAAGGTNN LLAPVTNLLG GLLGATPKK
|
| |