Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_6591 |
Symbol | |
ID | 6278282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010676 |
Strand | - |
Start bp | 2889876 |
End bp | 2894666 |
Gene Length | 4791 bp |
Protein Length | 1596 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642617624 |
Product | alpha-2-macroglobulin domain protein |
Protein accession | YP_001890261 |
Protein GI | 187921229 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.509097 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0153598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGAA TGATTTCCGT CACACGATCA AGCGGTTGGT TGGGCAAGGC GCGCCGTCGC GCGGCATGCA CGTTCGCGGC GCTAGTCGCA GCGTTGACAC TGGTCGCCGC ACCCGCCGCT TATGCGGACG ACGAAGCGTC GAGCGCGGCC GCCGCCGATG CCAACATCGC CGGCAATCCG ACGCTGCAGA CGGCGCCGTC GAGCAACTTC AGCTCGCAGA AAGTGGACGG TCAGCCGTTC TTCCTGCTTT CGGATGCGAG CTTCGGCAGC GATCAACCGG CACAAGTGCG GCTCGAAGCG CCGGGCCGCG ATTTCAAGGA TGCATTGCAA GCGTACGGCG GCGCGGACAT CGTCGTGTAT CGCGTGCCGA AGCCGCTGGA CTTTCTCAAG GCGCAAAAGA ATCTGCATCG GCTGAATGTC GCGCCGAATT ATCAGGGCGA AGGGCTCGCG AATACGCTCG CGTATCTGTG GGACCGCTGG TTTACCGAAG CACGTCGCGC ATGGCAGCGG GTGCTGTCGT TCGCAACCCG CAGCAAGGCG ACCGAAGCGG CGCCGCAATT CAAGCTCGGC GAACAGACCG GCAAGCAAAC GCAGTTCGAG TCGAACTCGC AATTCGCGCC GCTCAAAGGC TACGACATGG TCGCGCGTTT TCGCTATCCG ATCTGGGACG CGAAAGTGAT CGAGCCGCCG AAGGGCGTGA ATCTCGCGGG CAGCAGCAGT AACTTTATCG AAGCCAGTTC GGGCAACGTG ATGATTCCGG TCGGCAAACT GCCGCCGGGG CTGTATATCG TCGAGGCGGT GATCGGCAAC TATCGCGCGC ATACGCTGCT GTTCGTGTCG GACACGGTGG CGGTCGTGAA GGCGGCGTCG AGCGGCATGT TGGTGTGGAC CACGCGGCGC GATAACGGCA AGCCCGTGGC GAATACGGAA GTGAACTGGA CCGACGGCGT GGGCGTTCTG CAAAGCGGCA CGACCGGCGG CGACGGCGCG CTGGTGCTGC AACACGTGAG CCCCGAGCGC AGCTATGTGC TCGGCGCCGA TCCGCAGGGC GGCGTGTTCG TCTCCGAGAA CTTCTACTAC GACAGCGAGA TCTACAACAC GAAGATCTAC GCAGTGACCG ATCGGCCGAT GTACCGGCCG GGCGATCCGG TGCATGTGAA GTTCATCGGC CGCACGTTCC AGAATGCGAC GCAATCGTCG GCGCCGGCCG AGGCCGATAT CAAGCTCGAC GTGCTCGATC CGAACGGCGC GCCGGTGGCG ACCAGCAAAG TGCATTTCGC CTCCGATACC GGCGCGGATA CCGCGTTCAC GCTGCCCGCC GACGCCACCG CCGGCGGCTA CACGCTGCGC TTCGACTACA TCGGCGATGT CTACGGCAGC GCGTTTCGCG TCGCCGAATA TGTGAAGCCG CACTTCGACG TGAACTTGTC GATGGATAAA GCCGACTACG CAACCGGCGA GGCGCTGAAG GGCAAGATTC AGTTGCGCTA TCCGGACGGC AAGCCGGTGC GCGACAGCAA GATTTCGGTC ACGCTGCGCG CGCAGAAAGT GACGATCGTC GACGGTGAAT TGCGTTATGC GGGACTGTTC CCGGTCAAGC TCGACCAGCA GGAACTGAAG ACAGATAGCG ACGGCAACGC GACCCTGACG CTGCCGGCCG CGAAGGAGCC GAGCCGCTAC GCGTTGACGC TGTTCGCACA GGACGGCGCG GCGTACAAGG TGCGCGTGAC GCGCGAAGTG CTGATCGCGC GCGGCGCCAC GCCGTATCGT TTGACCACGG CCAAATCGTT CTCGCAACCG AAGCAAGCAG TCAGCTTCGA TTTGCAGGCG CTAGGCGCGA TCGATCCTTC GTCACATGCG CCGTCGAAAT ACGAGTGGAC GCGTCTCGAA TCGCAAACGC ATGGCGAAGG CGCGCTAAAG GGCGCGAGCG CGGGCGATAA GTTGTCGTTC CCGGTGCAGT TCGACGAGCC GGGTTCGTAC ATGATGTCGG TGAAGGACGA CTTGGGCAAC CTGCTCGCCG CCACCAGCCA CTGGGTCGCG GGCGACGGCC TGAAGGCGAT TCCGGGCAGC GTCGAGATCG TGTTCGATCG CGACAAGTAC AAGATCGGCG ACACCGCAGA AGCATTGATC ACATTCCCGA TGCCGGTCGA CGACGCGCTG CTCACGCTCG AACGCGACAG CGTCGAGCGT CGCGCGTTGC TGACGGCGGG CGGCGACTGG CTGCAATTGC AACGCGTGGC GCCTTCCCAA TGGAAAGCGC GCATCAAGGT CGGCGCTGAC TTCGCGCCGA ACATGACGTT CTCTGTGTTG TACGTGCACG CGGGCGAATA TGTGTTCCAG AACGCGGGCA TCGTGGTCGC GCAACCGCAG ATCGAATTGA ACGTGAAAAG CGATAAGCCG GTGTACGGAC CGGGCGACAC GGTCACGCTG AATTTCGACA GCACGTTGAA CGGCAAGCCG GAAGCGGCGA ACCTGACTGT GAGCGTGGTC GACGAAATGG TCTACGTGTT GCAGCCGGAA ATCGCGCCGA ATATCGTCGA CTTTTTCTAT CATCCGCGCC GCAATAACGT GCGGACTTCG TCGAGCCTGT CGTTCATTAC CTACGACCTC GCGCGCTCGC CGTTGAAGGG CGCGCCGGGC GGACCGCAAC GCGCCAACTA CAACGAGCGC GGCGTGAAGG TGCTCGAACG GCCGCGCCGC GACGATCAGG ACACGGCAGC GTGGGAAGGC AATCTGAAAA CCGACGCGAG CGGCCACGCC ACCATGACGT TCAAGATGCC CGATTCGCTG GCGCGCTGGC GCATTACAGT GCGCGCGGCG GCGCCGGATG GCATGGTCGG TCAACGCACT GCTTATGTGC GCTCGGACAA GGCGTTGTAT CTGAAGTGGA GCGGGCCGTC GCACTTCCGC GTCAACGATC AGCCGGCCAT CGATATGATC GCGTTCAATC AGACCGACGC GGATATGGAC GCGCAGTGGG TGGTGGACGG CGGCGGGTTG TCGCTCAATC AGAAGGTCAC GCTCAAGCGC GGCGCGAACT ATCTGCGGCT GCCGGGCGGC GCGCTGAAGG ACGGCGTGAT CAACGCGTCG CTGCGCAGCG CGGGTAAGGA CGTCGATCGT CTGCAAACCA CGATTCATCT CGACGCGAGC GGCTGGCTCG ATCTGCATCA GAACACGGTG TCGCTCGACG GCTCGAACAA GCCGCTCAAT CTGCCGCTAG ACGCACAGGA TGTGAGCGTG CGTTTCATCG GCAACGCGCA GAGCCAGTTC ATGCGCGTCG CCGACGATCT GATCAACTAT CCGTACGGTT GCGCCGAGCA GACTTCGAGC CGGTTGATTC CGCTGGCGCT TGCGCACGAC GCGATCGGCC ACGGCGACAA TGCGAGTTCG ACGCAAGGGC TCGAAGCGCT GCTGCGCAAT CAGCGGCAGC GTCTGGCTAT GCTCGCGGGC GTCGGCGGTA CGTTCGGCTG GTGGGGCGAT ACCACCGGCG GCAGCGCGCT GATTACGGCG TACGCGTATT ACGCGGATTG GCTCGCGAGC CGCAGCCTCG GCATCAGCCT GCCCGCCGAC AACTGGCAGC ACGCCATGGA CGTGTATCGC GACGCCGGGG CCAAAGAGCC GCTGTTGCAT CGCGCGCTGG CTTTGTGGCT CATGCAGCAA ATGGGCCTGC CGGTCGCGAC GCCGGTCGCG GGCGTGGCCG CGGATCTGAT GAGCGATGCC GCAGCCACCG CCGCTAAAGC GCCCGCGCGC AGCGGCACGT ACGGTGCGTC GGACAGCATC GTGTTCGCGC AGGCGGATTC TCCGCGTGGC AAGCAGATGG CGACGCTGCT GATCTCGAGC ATGGCGCGCA GCACGAGCGC GCAAATTCCC GACGGATTCG ACGCCGCGGC GAGCGCGGCG CGCCTCACGC TGGTGAACGA TCCCGCGCCG CTGGTGCAAA GCCTCATCGC GATGACGGGC GACGGCGGAG CGAGCGCCGA CGCCCCGGCA TTACTGGCAA AAAGCAGCGC CGACTATCCG ACGCTCGATC GCGCCTTGAC GCTGCTGTGG CTGCGCAAGT CGCTCGGCGC GGATCCGGCT GCTGCGTCGC TGCCGTCGCT CCAGGGCACG GGCTGGACTC GCGCCACGAC GTTGACGGGC ACGCCGCTCT ACAAGTGGAC CGGCGCGGCG CAGCCGACCA CGCTTGATGC CGGCGCCGCG CGTACCGACG TCAACGCGCT GGTTTCGTTC CGCAGCCACA CGAGCGAAGA CAGCCGCTTG AACATCACCG TCGAGCGGCG TTTCTACAAG CTCGAACCGG TCGAAGTCGC GGTCGACATG AAGAAGGAAG CCGCTGGTGA GAGCCAGTTG GGCCGCTCGG CCTTCACCGC GCGTCTGATG AAGCAGGGCG ACGCGATCGA CAGCAATGCG CTCTACGTCG ACGAAGTCAC GCTTACGCCG CGCTCCGGCA ACGCGTACCA CTATGGTCTG CTCGACGTGC CGCTGCCGCC GGGCGGCGAT GTCGAGGCAA CGAGCTGGGG CGTGTCGATC GACGGTCTGC CGGGCGCGAA AGACGGCGCC AGCGGTCCGC AGCCGTTCCA GCGCGTGGCG TCGTACGAGA TGGGTGAGCT GGCTTATCAC CAGCCGGTGC CCTTGCTCGA CCGGCCGGTC ACGCTGCGGC AACTGGTGCG CTTCGCATTG CCCGGCACGT TCGCGTTGCC GCCCGCGCGC TACTTCCGCA TGTATCAGCC GGACGCGAAG GCGTTCGAAG GCGGCAAGAG CGATCGCGTG ACGACGCTGC GCATCCAGTA A
|
Protein sequence | MTRMISVTRS SGWLGKARRR AACTFAALVA ALTLVAAPAA YADDEASSAA AADANIAGNP TLQTAPSSNF SSQKVDGQPF FLLSDASFGS DQPAQVRLEA PGRDFKDALQ AYGGADIVVY RVPKPLDFLK AQKNLHRLNV APNYQGEGLA NTLAYLWDRW FTEARRAWQR VLSFATRSKA TEAAPQFKLG EQTGKQTQFE SNSQFAPLKG YDMVARFRYP IWDAKVIEPP KGVNLAGSSS NFIEASSGNV MIPVGKLPPG LYIVEAVIGN YRAHTLLFVS DTVAVVKAAS SGMLVWTTRR DNGKPVANTE VNWTDGVGVL QSGTTGGDGA LVLQHVSPER SYVLGADPQG GVFVSENFYY DSEIYNTKIY AVTDRPMYRP GDPVHVKFIG RTFQNATQSS APAEADIKLD VLDPNGAPVA TSKVHFASDT GADTAFTLPA DATAGGYTLR FDYIGDVYGS AFRVAEYVKP HFDVNLSMDK ADYATGEALK GKIQLRYPDG KPVRDSKISV TLRAQKVTIV DGELRYAGLF PVKLDQQELK TDSDGNATLT LPAAKEPSRY ALTLFAQDGA AYKVRVTREV LIARGATPYR LTTAKSFSQP KQAVSFDLQA LGAIDPSSHA PSKYEWTRLE SQTHGEGALK GASAGDKLSF PVQFDEPGSY MMSVKDDLGN LLAATSHWVA GDGLKAIPGS VEIVFDRDKY KIGDTAEALI TFPMPVDDAL LTLERDSVER RALLTAGGDW LQLQRVAPSQ WKARIKVGAD FAPNMTFSVL YVHAGEYVFQ NAGIVVAQPQ IELNVKSDKP VYGPGDTVTL NFDSTLNGKP EAANLTVSVV DEMVYVLQPE IAPNIVDFFY HPRRNNVRTS SSLSFITYDL ARSPLKGAPG GPQRANYNER GVKVLERPRR DDQDTAAWEG NLKTDASGHA TMTFKMPDSL ARWRITVRAA APDGMVGQRT AYVRSDKALY LKWSGPSHFR VNDQPAIDMI AFNQTDADMD AQWVVDGGGL SLNQKVTLKR GANYLRLPGG ALKDGVINAS LRSAGKDVDR LQTTIHLDAS GWLDLHQNTV SLDGSNKPLN LPLDAQDVSV RFIGNAQSQF MRVADDLINY PYGCAEQTSS RLIPLALAHD AIGHGDNASS TQGLEALLRN QRQRLAMLAG VGGTFGWWGD TTGGSALITA YAYYADWLAS RSLGISLPAD NWQHAMDVYR DAGAKEPLLH RALALWLMQQ MGLPVATPVA GVAADLMSDA AATAAKAPAR SGTYGASDSI VFAQADSPRG KQMATLLISS MARSTSAQIP DGFDAAASAA RLTLVNDPAP LVQSLIAMTG DGGASADAPA LLAKSSADYP TLDRALTLLW LRKSLGADPA AASLPSLQGT GWTRATTLTG TPLYKWTGAA QPTTLDAGAA RTDVNALVSF RSHTSEDSRL NITVERRFYK LEPVEVAVDM KKEAAGESQL GRSAFTARLM KQGDAIDSNA LYVDEVTLTP RSGNAYHYGL LDVPLPPGGD VEATSWGVSI DGLPGAKDGA SGPQPFQRVA SYEMGELAYH QPVPLLDRPV TLRQLVRFAL PGTFALPPAR YFRMYQPDAK AFEGGKSDRV TTLRIQ
|
| |