Gene Bphyt_0418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_0418 
Symbol 
ID6284704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010681 
Strand
Start bp460835 
End bp463273 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content64% 
IMG OID642619982 
Productfilamentous haemagglutinin family outer membrane protein 
Protein accessionYP_001894067 
Protein GI187922425 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.740243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.11734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAA CTCCTCCTCG TATCATTGAC AGCAAGAGCC GGACGGTCTG GTTCGCCGCA 
CGCGCCAGCG CATTTGCCGC GTTGTGCGCA TTCGGCATGC AGCCGCTTGT CGCAAGCGCG
CAAGCAAAGC TGACCATTAC CCCCGACGCG GGCGCGGCCA CGCGCCCGAC GATCGGCACA
TCGAGCAACG GCACCCAGGT CGTCAATATC GTTGCGCCTA ATGCCGCAGG TGTGTCGAGC
AATCGCTTCT CGGACTACAA CGTCGGCACG GGCGGCGTGA TCATCAACAA CGCCACGCAG
GCCGCGCAAA CGCAGATCGG CGGCACGGTC CAGGCAAACT CCGTGCTCGG CAAGCAGGGC
GCGAAACTGG TCCTGATGCA GGTTACGTCG GGCGCGCAGT CGCAGTTGCT CGGCACCACT
GAGATCGCCG GCAACAGTGC CAATCTCGTG CTCGCCAACC CGGCAGGCAT CACGTGCTCG
GGCTGCGGCT TCCTGAACGC GCCGCGCGTC ACGTTGGCAA CCGGCACGCC GACGCTGAAA
AGCGACGGCT CGCTGGATAC GATCGACGTG AAGCAAGGCA CGCTTGCGGT GGACGGCGGC
GGCCTGAACG GCTCGACGAG CGCGGTCGAT CTGATCGCGC GTGCGATCAC GATCAACGGC
AAGGTGCAAG GCAAATCGAT CGATGCGATT GCTGGCGCCA ACCGCGTCAA TTACGCGTCG
AAGTCCGCGC TCGCTCAGGC CGGCACGGGC AGCGCTCCGC AGGTGGCGAT CGATGTGCAG
TCGCTCGGAA GCATGTACGG CGACGGCGCC GTGCGTCTGC TCGGCACCGA AGCGGGTGTC
GGCGTGCGCG ACAACGGCAC GCTGACTTCA CTGACCGGCA ACCTGAGCGT CGGTGCCAAC
GGCGACGTCA CGATCGCTGT GCCCGCGAGC ATCAAGGCGG CAGGCAACGC TGCGATCAAC
GGCGCGAACG TGACGAACGA CGGCTCCATC GCCGCCAGTG GCAGCATGGA CGTTCACGCT
ACTCAGGCGC TCGCCAATCG CGGCACGATC ACGGCCGACA GCATGAGCCT GATCGCCAAC
ACGCTATCGA ACGCGGGCAA AGTCGCCGCG AACGGCATCC GCATGGGTGG CGACCGGTCG
CTGATTAACA CCGGCACAAT CGAAGCTGCG CAACAGGCGG AACTGGCGGG CGACAGCATG
ACGCTTGCGG CCGATAGCAG CGTGAACAGT GCAAATGTGA CCTTGAGGGG CGGCACCATC
ACGAACCAGA GCAACGCGGT GAACGCGAGC CGATTCCTCA ATATCAACGC GGACCACATC
GACAACGCAG GCACGCTGAG TTCGGGTGGC GGTGCCTATG TCAACGCGAT GAGCACGTTC
GCCAACGGCG CCAACGGCAC GCTGACTGCG CAGGACAACG TTCAGATCGG CGGTGGCAAT
GTAACAAACG ACGACCTGAT CGCCAGCGCG CGCTCGTTGG ACGCCAACGG TGCCTTGAGC
TTGACCAATC GCGGCACCCT CACGGCCGGT GATGCCATGA ACCTGTCGAC CCGCGGCTCG
CTGCTGAACA GCGGCAAGAT GTCGGCGGCG AGTCTCAGCA TGACCGCCGA CCAGTCGTTG
ACGAATAGCG GCACGATCGA CGCGACGACG CAGGCAACGC TGGCATCCAA CAATCTGATG
CTGTCGGGCG GCAGCGTGAA AGGTGGCACG GTAACGCTGA ACGGAGGCAC CGTCACGAAC
CAGAACGCAA CGGTGAACGC CGGCCAGCTT CTCGACATTC GCGCGGACAA CGTGGAGAAC
ACCGGCACGC TGAGTTCGAA CGGCGATGCA CGCGTCAACG CGATGAACAC GTTCATGAAC
GGCGCGAACG GCACGCTGAC CGCACAGAAC AACGTGCAGA TCGGCGGGAC GACGCTGAAC
AACAGCAACG GCAGCATCGA GGCGGTGAAG GGCACGCTAT CCGTACAGGC CAGCACGATC
GCCAACCTGA ACGGCAAGCT GTCGTCGGGC AACGCGATGT CGATCAATAC GCGCGGCGAC
CTCGACAATA CGGGCGGCAC GATCACCGCG GGGCGTGATG GGCAGATAGA CGTCGGCGGC
AAGTTGACGA ACGACAACGG CTCGATCACG TCGCAAGCGG CGGTACGCGG CACGGCGGGG
TCGATGTCGA ATGTCGGCGG CACGATCTCG GCGCCGATCT CCGCGGAAAT TCGTGTCGTC
GGCGATAACG ACCGCAACAA CGGCGGCTTC ATCCCGCCGG TCAATCCGAC GCCGGAGCCG
GAGCGGACGC CGGCGCCGAA GCCGGAGCCG ACTCCGACAA CGTTCGTTCC GCCTCCCGGC
TTTGTTCGCA TGGATCCCCA TTCGCCTGAT CTTAACAAGT ACAGCGGTTT CTACGATCAG
GACGGCTACT TCTACGCGCG AATCTTCATG CCGAGCTAA
 
Protein sequence
MQKTPPRIID SKSRTVWFAA RASAFAALCA FGMQPLVASA QAKLTITPDA GAATRPTIGT 
SSNGTQVVNI VAPNAAGVSS NRFSDYNVGT GGVIINNATQ AAQTQIGGTV QANSVLGKQG
AKLVLMQVTS GAQSQLLGTT EIAGNSANLV LANPAGITCS GCGFLNAPRV TLATGTPTLK
SDGSLDTIDV KQGTLAVDGG GLNGSTSAVD LIARAITING KVQGKSIDAI AGANRVNYAS
KSALAQAGTG SAPQVAIDVQ SLGSMYGDGA VRLLGTEAGV GVRDNGTLTS LTGNLSVGAN
GDVTIAVPAS IKAAGNAAIN GANVTNDGSI AASGSMDVHA TQALANRGTI TADSMSLIAN
TLSNAGKVAA NGIRMGGDRS LINTGTIEAA QQAELAGDSM TLAADSSVNS ANVTLRGGTI
TNQSNAVNAS RFLNINADHI DNAGTLSSGG GAYVNAMSTF ANGANGTLTA QDNVQIGGGN
VTNDDLIASA RSLDANGALS LTNRGTLTAG DAMNLSTRGS LLNSGKMSAA SLSMTADQSL
TNSGTIDATT QATLASNNLM LSGGSVKGGT VTLNGGTVTN QNATVNAGQL LDIRADNVEN
TGTLSSNGDA RVNAMNTFMN GANGTLTAQN NVQIGGTTLN NSNGSIEAVK GTLSVQASTI
ANLNGKLSSG NAMSINTRGD LDNTGGTITA GRDGQIDVGG KLTNDNGSIT SQAAVRGTAG
SMSNVGGTIS APISAEIRVV GDNDRNNGGF IPPVNPTPEP ERTPAPKPEP TPTTFVPPPG
FVRMDPHSPD LNKYSGFYDQ DGYFYARIFM PS