Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_0418 |
Symbol | |
ID | 6284704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010681 |
Strand | - |
Start bp | 460835 |
End bp | 463273 |
Gene Length | 2439 bp |
Protein Length | 812 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642619982 |
Product | filamentous haemagglutinin family outer membrane protein |
Protein accession | YP_001894067 |
Protein GI | 187922425 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.740243 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.11734 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAAAA CTCCTCCTCG TATCATTGAC AGCAAGAGCC GGACGGTCTG GTTCGCCGCA CGCGCCAGCG CATTTGCCGC GTTGTGCGCA TTCGGCATGC AGCCGCTTGT CGCAAGCGCG CAAGCAAAGC TGACCATTAC CCCCGACGCG GGCGCGGCCA CGCGCCCGAC GATCGGCACA TCGAGCAACG GCACCCAGGT CGTCAATATC GTTGCGCCTA ATGCCGCAGG TGTGTCGAGC AATCGCTTCT CGGACTACAA CGTCGGCACG GGCGGCGTGA TCATCAACAA CGCCACGCAG GCCGCGCAAA CGCAGATCGG CGGCACGGTC CAGGCAAACT CCGTGCTCGG CAAGCAGGGC GCGAAACTGG TCCTGATGCA GGTTACGTCG GGCGCGCAGT CGCAGTTGCT CGGCACCACT GAGATCGCCG GCAACAGTGC CAATCTCGTG CTCGCCAACC CGGCAGGCAT CACGTGCTCG GGCTGCGGCT TCCTGAACGC GCCGCGCGTC ACGTTGGCAA CCGGCACGCC GACGCTGAAA AGCGACGGCT CGCTGGATAC GATCGACGTG AAGCAAGGCA CGCTTGCGGT GGACGGCGGC GGCCTGAACG GCTCGACGAG CGCGGTCGAT CTGATCGCGC GTGCGATCAC GATCAACGGC AAGGTGCAAG GCAAATCGAT CGATGCGATT GCTGGCGCCA ACCGCGTCAA TTACGCGTCG AAGTCCGCGC TCGCTCAGGC CGGCACGGGC AGCGCTCCGC AGGTGGCGAT CGATGTGCAG TCGCTCGGAA GCATGTACGG CGACGGCGCC GTGCGTCTGC TCGGCACCGA AGCGGGTGTC GGCGTGCGCG ACAACGGCAC GCTGACTTCA CTGACCGGCA ACCTGAGCGT CGGTGCCAAC GGCGACGTCA CGATCGCTGT GCCCGCGAGC ATCAAGGCGG CAGGCAACGC TGCGATCAAC GGCGCGAACG TGACGAACGA CGGCTCCATC GCCGCCAGTG GCAGCATGGA CGTTCACGCT ACTCAGGCGC TCGCCAATCG CGGCACGATC ACGGCCGACA GCATGAGCCT GATCGCCAAC ACGCTATCGA ACGCGGGCAA AGTCGCCGCG AACGGCATCC GCATGGGTGG CGACCGGTCG CTGATTAACA CCGGCACAAT CGAAGCTGCG CAACAGGCGG AACTGGCGGG CGACAGCATG ACGCTTGCGG CCGATAGCAG CGTGAACAGT GCAAATGTGA CCTTGAGGGG CGGCACCATC ACGAACCAGA GCAACGCGGT GAACGCGAGC CGATTCCTCA ATATCAACGC GGACCACATC GACAACGCAG GCACGCTGAG TTCGGGTGGC GGTGCCTATG TCAACGCGAT GAGCACGTTC GCCAACGGCG CCAACGGCAC GCTGACTGCG CAGGACAACG TTCAGATCGG CGGTGGCAAT GTAACAAACG ACGACCTGAT CGCCAGCGCG CGCTCGTTGG ACGCCAACGG TGCCTTGAGC TTGACCAATC GCGGCACCCT CACGGCCGGT GATGCCATGA ACCTGTCGAC CCGCGGCTCG CTGCTGAACA GCGGCAAGAT GTCGGCGGCG AGTCTCAGCA TGACCGCCGA CCAGTCGTTG ACGAATAGCG GCACGATCGA CGCGACGACG CAGGCAACGC TGGCATCCAA CAATCTGATG CTGTCGGGCG GCAGCGTGAA AGGTGGCACG GTAACGCTGA ACGGAGGCAC CGTCACGAAC CAGAACGCAA CGGTGAACGC CGGCCAGCTT CTCGACATTC GCGCGGACAA CGTGGAGAAC ACCGGCACGC TGAGTTCGAA CGGCGATGCA CGCGTCAACG CGATGAACAC GTTCATGAAC GGCGCGAACG GCACGCTGAC CGCACAGAAC AACGTGCAGA TCGGCGGGAC GACGCTGAAC AACAGCAACG GCAGCATCGA GGCGGTGAAG GGCACGCTAT CCGTACAGGC CAGCACGATC GCCAACCTGA ACGGCAAGCT GTCGTCGGGC AACGCGATGT CGATCAATAC GCGCGGCGAC CTCGACAATA CGGGCGGCAC GATCACCGCG GGGCGTGATG GGCAGATAGA CGTCGGCGGC AAGTTGACGA ACGACAACGG CTCGATCACG TCGCAAGCGG CGGTACGCGG CACGGCGGGG TCGATGTCGA ATGTCGGCGG CACGATCTCG GCGCCGATCT CCGCGGAAAT TCGTGTCGTC GGCGATAACG ACCGCAACAA CGGCGGCTTC ATCCCGCCGG TCAATCCGAC GCCGGAGCCG GAGCGGACGC CGGCGCCGAA GCCGGAGCCG ACTCCGACAA CGTTCGTTCC GCCTCCCGGC TTTGTTCGCA TGGATCCCCA TTCGCCTGAT CTTAACAAGT ACAGCGGTTT CTACGATCAG GACGGCTACT TCTACGCGCG AATCTTCATG CCGAGCTAA
|
Protein sequence | MQKTPPRIID SKSRTVWFAA RASAFAALCA FGMQPLVASA QAKLTITPDA GAATRPTIGT SSNGTQVVNI VAPNAAGVSS NRFSDYNVGT GGVIINNATQ AAQTQIGGTV QANSVLGKQG AKLVLMQVTS GAQSQLLGTT EIAGNSANLV LANPAGITCS GCGFLNAPRV TLATGTPTLK SDGSLDTIDV KQGTLAVDGG GLNGSTSAVD LIARAITING KVQGKSIDAI AGANRVNYAS KSALAQAGTG SAPQVAIDVQ SLGSMYGDGA VRLLGTEAGV GVRDNGTLTS LTGNLSVGAN GDVTIAVPAS IKAAGNAAIN GANVTNDGSI AASGSMDVHA TQALANRGTI TADSMSLIAN TLSNAGKVAA NGIRMGGDRS LINTGTIEAA QQAELAGDSM TLAADSSVNS ANVTLRGGTI TNQSNAVNAS RFLNINADHI DNAGTLSSGG GAYVNAMSTF ANGANGTLTA QDNVQIGGGN VTNDDLIASA RSLDANGALS LTNRGTLTAG DAMNLSTRGS LLNSGKMSAA SLSMTADQSL TNSGTIDATT QATLASNNLM LSGGSVKGGT VTLNGGTVTN QNATVNAGQL LDIRADNVEN TGTLSSNGDA RVNAMNTFMN GANGTLTAQN NVQIGGTTLN NSNGSIEAVK GTLSVQASTI ANLNGKLSSG NAMSINTRGD LDNTGGTITA GRDGQIDVGG KLTNDNGSIT SQAAVRGTAG SMSNVGGTIS APISAEIRVV GDNDRNNGGF IPPVNPTPEP ERTPAPKPEP TPTTFVPPPG FVRMDPHSPD LNKYSGFYDQ DGYFYARIFM PS
|
| |