Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphy_3403 |
Symbol | |
ID | 6244964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phymatum STM815 |
Kingdom | Bacteria |
Replicon accession | NC_010623 |
Strand | - |
Start bp | 335679 |
End bp | 337427 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642595191 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_001859603 |
Protein GI | 186472261 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0000154784 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCACGC CGACCGTCGA GATCATCGAT GTCGTGGCGA TCGACGCCGA ACGCATGACG GTCGCCGTTA CCGTCGATAT CGAGCCCGAG GAAGTCGACA TCACTGACGT GGAAGCGGCG AATGCCATCC TGATCGAGAT CATCGAGCAG TCCAACCGTT CAGGGCCGCC CGGCCCGATA GGTCCGCAGG GTCCGGCCGG TCCGCAAGGT GTCCAGGGCA TTGCTGGCAA CGATGGCAAG GATGGCGCGA CCGGCCCCGC CGGCGCGAGA GGTCCTGCAG GCCCGCAAGG CGACGCGGGT CCGCAGGGTG TCGCCGGTCC TGCTGGCACA ACGGGTCCGC AGGGTGCGAC GGGGCCGGCC GGACCCAGGG GCGATCCCGG CGTGGCCGGT CCGCAAGGCG CACAGGGTCT CAAGGGCGAT CAGGGTCCGA AAGGCGATGC AGGTCCGCAA GGTCCGGCCG GTGCGACGGG CTCGCAGGGT CCGGCCGGTG CGATGGGTCT CACGGGTCCG CAAGGAGCGA AAGGCGATCC CGGTCCCGTG GGTCCGATGG GTCCCGAAGG CGACACGGGT CCGGCTGGCG ATCAGGGTCC GGAGGGAGAC ACAGGCGCGC AGGGTCCGCA GGGTGCAACG GGTTCACAGG GACCCGCTGG TCCGGTCGGG CAGACTGGCC CGCAGGGTGT CAAGGGCGAC ACCGGCGCGA CCGGCGAGCA GGGTCCGAAA GGCGACACGG GTGCGCAAGG TCCGCAAGGC CCGGTTGGTC CAGCCGCCGA TACTTCGACC TTCGTGCAGA AGTCCGGCGA CACGATGACG GGGCAATTGC GGATTGCGCC GGCGTCGGGT GACGCGTCTC TATTGCTCAA TGCGCAGGGT TCAAACCTGC CCCGTATCTA TTTTCAGAAA GGCGGGGCTG GCCCCAGCAT GCTCTACGAT GGTACGTCGA TAGGGTTTGC AAACGCCACG CTCAACGCGT GGAATATGGC TATTAACGAC TCTGGCTCTG TCTCGTTTCG CAACACGGTC AATATCAACA ACGGAACCGA CCTCTGGTTA AGCGCACAGT CCGGCAGCTA TAGCGGCAGG CTCATGCTAA ACGCGAACGG CTACGCGCCG TTCCTTCGAT CCAACGGGGC CAACGGCAAT ATTGAAGTTG TCAATTCGGC CAACAGCGCC GTTAACTTTA CGATCTGGGA TGACGGGCAC GTAACCGCAC GCGGTAACGT CTACGCGTCG AGTGCTTTCC TGCAAACCAA TGGCGACGTT AACGGCTCGC AATGGGTAGG CGGCTGGTTA AGTGCAGGCA TTAACAATAT CTGCATGACG CGCACCACGG GCAACACGTA CAAAATCGGT TGGGATGGCG GCGCAGGCAA CCTCAACTTC TATGTCGACG GCACATTTAT AGCGTATCTG CACAGCAACG CCTCGGACTC CGCGCTTAAA GCCAACGTCG AGGAAGTTAC GCCCGACTCG CTTACCCTCA TTGAACAGAT AGACTTTTGC AGTTACGACA TTGCCGGGCG GCATGTCGAC ATGGGCATCA TTGCGCAGCA GCTACAGACC ATCACGCCGC GTTGGGCGTA TAAACCGCCG ACGCCTGATA CGCCCGAACC GTACTACGGC GACCCGGAAC CGCAACCCTA TACCCCTTCG CCTCTGGCGT ATGACCGCGA CGCGCTGCTG TTCGATGCGT TGCGCGCCGT GCAGCAGCTA TCGGCACGCG TGACACAGCT TGAGGCGATG CGGCCATGA
|
Protein sequence | MSTPTVEIID VVAIDAERMT VAVTVDIEPE EVDITDVEAA NAILIEIIEQ SNRSGPPGPI GPQGPAGPQG VQGIAGNDGK DGATGPAGAR GPAGPQGDAG PQGVAGPAGT TGPQGATGPA GPRGDPGVAG PQGAQGLKGD QGPKGDAGPQ GPAGATGSQG PAGAMGLTGP QGAKGDPGPV GPMGPEGDTG PAGDQGPEGD TGAQGPQGAT GSQGPAGPVG QTGPQGVKGD TGATGEQGPK GDTGAQGPQG PVGPAADTST FVQKSGDTMT GQLRIAPASG DASLLLNAQG SNLPRIYFQK GGAGPSMLYD GTSIGFANAT LNAWNMAIND SGSVSFRNTV NINNGTDLWL SAQSGSYSGR LMLNANGYAP FLRSNGANGN IEVVNSANSA VNFTIWDDGH VTARGNVYAS SAFLQTNGDV NGSQWVGGWL SAGINNICMT RTTGNTYKIG WDGGAGNLNF YVDGTFIAYL HSNASDSALK ANVEEVTPDS LTLIEQIDFC SYDIAGRHVD MGIIAQQLQT ITPRWAYKPP TPDTPEPYYG DPEPQPYTPS PLAYDRDALL FDALRAVQQL SARVTQLEAM RP
|
| |