Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BamMC406_3717 |
Symbol | |
ID | 6179669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria MC40-6 |
Kingdom | Bacteria |
Replicon accession | NC_010552 |
Strand | + |
Start bp | 681295 |
End bp | 684066 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641683487 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_001810400 |
Protein GI | 172062749 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.123962 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.562424 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATC AACAACACCA TCGCCCGCGT GCGGCGACGC CAGCCGCGCG CACTACGCTC GCCGCAGCGC TGACGCTCGC GATGCCGATC CTCGCGGCGC AGCCCGCGTT TGCGCAACCG GGGCTCGTCG TCGATCCGAA CGCGGCGAAC CGTCCCGGTA TCACGAACGG GCCGAACGGC ACGCCGATCG TCGGCATTAA CGCACCGGAT GCCGCCGGCA TTTCGGCCAA CCGCTTCACC GAATACAACG TCGGGCCGGC AGGTCTCGTG CTGAACAACA GTGTGAACGG CACGAACACG CAGCTGGCGG GCAGCATCGA CGGCAATGCG CGGCTCGGCG GGCGTTCGGC GCAGGTCATC CTGAACCAGG TGACGGGCGG CAACGCATCG GTGCTCGCAG GGGCGACCGA AGTCGCGGGT CAGCAGGCGC GCGTGATCGT CGCGAACCCG AACGGGATCG GCGTGAACGG CGCGAGCTTC ATCAACGCGA GCCGCGTGTC GCTCGTCGCG GGCACGACGG AATTCGACGA GGACGGCCGT ATCGCACGAT TCCGGACCGA GAACGGCCGC ATCACGATCG ACGGCGCCGG GCTCGACGCG CGCAATCTGG ACCAGCTCGA TCTCGTGTCG CGCAGCCTGA AGGTGAACGC CGCGTTGCAC GCGAAGAAAC TGGTCGCGGT CGCGCAACAA GGGACGGCGG CCATCGAGAA TCCTCAGGCG ATGTCGTTCG ATGCGTCGAC GAGCGGGGAA CTGCCGCGCG TCGCGATCGA CGTGTCGCGA CTCGGCAGCG TGCATGGAGA GGATTCGATC GTCATGCGCG GCACGTCGGC GGGCGTCGGC ATCAATATCT CGGGCAAGGT CGAGGCGCTC ACGGGCTCGG TGACGTTGCT GTCGGATGGC AGGGTCCGGA TTTCGGGCGG CGGATCGCTG CGCGCGGGGA CGCTCTCGGC TCCGTCCGGT CTGCAGTACG GCGGGAACTG GACGGACGAT GACGAGGCGG CGAAGCCGCC GGTCGAGCCG GAGCCGCCGG TAGCGGAGAC GAAGCCGCCG GTCGAGCCGG AGCCGCCGGT AGCCGAGACG AAGCCGCCGG TCGAGCCGGA ACCGCCGGTA GCCGAGACGA AGCCGCCGGT CGAGCCGGAG CCGCCGGTAG CGGAGACGAA GCCGCCGGTC GAGCCGGAAC CGCCGGTAGC CGAGACGAAG CCGCCGGTCG AGCCGGAACC GCCGGTAGCG GAGACGAAGC CGCCGGTCGA GCCGGAGCCG CCGGTAGCGG AGACGAAGCC GCCGGTCGAG CCGGAACCGC CGGTAGCGGA GACGAAGCCG CCGGTCGAGC CGGAACCGCC GATCGCCGAA ACGAACCCGA AACTCGTCGT CGATCCGAAC GCGGCCAACC GTCCCGGTCT CGGCGCCGCG CCGAACGGCA CGCCGATCGT CGACATCAAT GCGGCGGACG CCTCCGGCAT TTCCTCGAAC CGCTTTCTCG ACTACAACGT CGGCCCGAAC GGTCTCGTGC TGAACAACAG CGTGAACCGC ACGAACACGC AGCTGGCGGG CAGCATCGAC GGCAATGCGC GGCTCGGCGG CCGTTCGGCG CAGGTCATCC TGAACCAGGT GACGGGCGGC AACGCATCGG TGCTGGCCGG GGCGACTGAA GTCGCGGGGC AGCAGGCGCG CGTGATCGTC GCGAACCCGA ACGGGATCAG CGTGAATGGC GCGAGCTTCA TCAACGCGAG CCGCGCGTCG CTCGTCGCGG GCACGACGGA ATTCGACGAG GACGGCCGTA TCGCACGATT CCGGACCGAG AACGGCCGCA TCGCGATCGA CGGCGCCGGG CTCGACGCGC GCAATATGGA TCAGCTCGAT CTCGTGTCGC GCGGCCTGAA GGTCGATGCG GCGGTGCGCG CTAAGAAGCT GGTCGCGATC GCGCATCAAG GCACGGCGGC AATCGAGCAA TCGAACCGGC AGACGCTCTC CGGCTCGGCA AGCGGCAAGG CGCCGGCCGC GGCGATCGAC GTGTCCGAAT CGGGCAGCCT GCGGGCGGAC GAGATCGCGC TGATCGGCGC GTCCGCGAAA ACCGGCGTCC GGATCGCCGG CACGGTCGAC GCCGATACCG TCGACGTGTC CGGTCGCCTC GACAACGACG GTTCCGTGCG CGGGCGCAGC GTCAGGATTG CCGGCGACGC GACCAATTCC GGCACGCTGC ACGCGGAAAA GTTGCTCGCG ATGGGGAACG TGACCAACGG CGGCACGATC GAGGCAGGTG ACGTTCAGGT CATGGGCAAT ACGACCAACA ACGGGACGCT GCGCGCGGAG AAGTTGCTCG CGATGGGGAA CGTGACCAAC GGCAGCACGA TCGAGGCAGG TGACGTTCAG GTCATGGGCA ATACGACCAA CAACGGCGCG CTGCGTGCGA AGAAGTTGTT TTCCACGTCG GGCAACATGA CCAACCGCGG CACGATCGAG GGCGGCGACG TCAAGGTCAT GGGCAACACG ACCAACGACG GCACGCTGCG CGCGGAGAAG TTGCTCTCCG CGATGGGCAA CGTGACCAAC CGCGGCACGA TCGAGGCCGG CGCCGTCAAG GTCATGGGCA AAGTGACCAA CGACGGCACG TTGCACGGCG ACGATTCGGT ATCGGTGATG GGCCGCCTGT ACAACGGGTT GTCGGGTGTC ACGAGCAGCT ACGGCAACGT GTTCGGCGCT GGGCGCGTGT CCGGTCCGGG GCGCATCGTC AGCTACATGG TGCGTCCGAC GCCTGCCGGC GACGTTCGAT AA
|
Protein sequence | MKHQQHHRPR AATPAARTTL AAALTLAMPI LAAQPAFAQP GLVVDPNAAN RPGITNGPNG TPIVGINAPD AAGISANRFT EYNVGPAGLV LNNSVNGTNT QLAGSIDGNA RLGGRSAQVI LNQVTGGNAS VLAGATEVAG QQARVIVANP NGIGVNGASF INASRVSLVA GTTEFDEDGR IARFRTENGR ITIDGAGLDA RNLDQLDLVS RSLKVNAALH AKKLVAVAQQ GTAAIENPQA MSFDASTSGE LPRVAIDVSR LGSVHGEDSI VMRGTSAGVG INISGKVEAL TGSVTLLSDG RVRISGGGSL RAGTLSAPSG LQYGGNWTDD DEAAKPPVEP EPPVAETKPP VEPEPPVAET KPPVEPEPPV AETKPPVEPE PPVAETKPPV EPEPPVAETK PPVEPEPPVA ETKPPVEPEP PVAETKPPVE PEPPVAETKP PVEPEPPIAE TNPKLVVDPN AANRPGLGAA PNGTPIVDIN AADASGISSN RFLDYNVGPN GLVLNNSVNR TNTQLAGSID GNARLGGRSA QVILNQVTGG NASVLAGATE VAGQQARVIV ANPNGISVNG ASFINASRAS LVAGTTEFDE DGRIARFRTE NGRIAIDGAG LDARNMDQLD LVSRGLKVDA AVRAKKLVAI AHQGTAAIEQ SNRQTLSGSA SGKAPAAAID VSESGSLRAD EIALIGASAK TGVRIAGTVD ADTVDVSGRL DNDGSVRGRS VRIAGDATNS GTLHAEKLLA MGNVTNGGTI EAGDVQVMGN TTNNGTLRAE KLLAMGNVTN GSTIEAGDVQ VMGNTTNNGA LRAKKLFSTS GNMTNRGTIE GGDVKVMGNT TNDGTLRAEK LLSAMGNVTN RGTIEAGAVK VMGKVTNDGT LHGDDSVSVM GRLYNGLSGV TSSYGNVFGA GRVSGPGRIV SYMVRPTPAG DVR
|
| |