Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BamMC406_1536 |
Symbol | |
ID | 6177548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria MC40-6 |
Kingdom | Bacteria |
Replicon accession | NC_010551 |
Strand | + |
Start bp | 1704719 |
End bp | 1707733 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641681297 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_001808239 |
Protein GI | 172060587 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00333772 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.683009 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACAAGA CCTATTCTTT GATATGGAAC GACGCGCAAG GCGCCTGGCA GGTCGCCTGC GAACGAGTAC GCAGGCGAGG CAAGTCCTCG ACGCGCGCAC GTGTCGTCGC GATGGTGGCA CTGCTCGCCG GCGGCGCGGC GACGTCTGCG AATGCGCTGC CGACAGGCGA GAAGATCATC TCGGGGAAAG GCGACATTCA CCGTTACGAC AACGGGCAGC AGATGTCGGT CAACCAGCAT AGCGACAAGC TGGTGACGGA CTGGCAAACG TTCAACATCG ACAAGGGGGA GCGTGTCACG TTCAACCAGC CGTCCGCGGC GTCGATCGCG CTGAACCGGG TCGTCGGCCA GGACGGCAGC GCAATCTACG GCAATCTCGA TGCGAACGGC CGCGTGTTCC TCGTCAACCC GAACGGCATC CTGTTCGGCA AGAACGCGCA GATCAATGTC GGCGGGCTGG TGGCGACGAC CCTGGATATC AGGAACGAAG ACTTCGACGC GGGCCGCTAC CGGTTTTCCG GCCAGTCGCC GAGCGAAGTG CAGAACTTCG GCAATCTCGT GGCGGCCGAA GGCGGCGCGA TCGCGCTGCT TGGCGCGAAA GTCGTCAACA AGGGGATCGT CCAGGCCCAG ATGGGCACCG TCGCGCTGGG TGCGGGCAGT GACGCCACGC TGAACTTCGA CGGCAGCAAG CTGCTCAGCG TCCAGATCGA CCAGGGGATC GTCAACGCGC TGGTCAGCAA CGAGCAGTTG CTGAGGGCCG ACGGCGGGCA GGTGCTGATG AGCGCGAAGA CCGCCGATAC GCTGCTGCGC ACGGTCGTCA ACAACCAGGG CGTCATCGAG GCGCGCACGC TCAGGAATGC AGCGGGCCGG ATCTCGCTGG ACGGTCACGA TGCCGGCACC GTCAACGTCG CGGGCGTGCT CAACGCGAGC GCGACGACGC CGGGCAACGG CGGCATGGTC GAAACCCGCG GCGCCGACGT GAAGGTGGCA CTCGGCGCGA TGGTCGACAC GCGCGCGACC AACGGGCGTG TCGGCACCTG GCGCATCGCG TCGTCCGACG TGTCGGTCAA AGCGGGCGCG AACCAGCCTG GCACGATCGT CGCCGACACG CTGTCGCGCA ATCTCGCGAC CACCGATGTC GAACTGGTCG CCGAGCGCGG ACAACTTTCG GTCGACGGGC CGGTGACGTG GGCCAGCGGC AACCGATTGA CGCTGGCCAG CCGGCAGGGC GACGTGTCGG TCAACGGCGC GCTGCGCGCG ACGGGCGCGA ATGCGCGGTT GGCCATCGAC GCGAAGCAGA ACGTGCGGAT CGCGGCCCCG ATCGCGCTGA CCGGCGCGAA CGCGCTGCTG ACACTCGACT ACGGTGTCGC GCACTCGCTC TCCGGCGGCG CGGCCGTGAC GCTGTCGGGG GCCGGCGCCG GTTTCGAATC GAACGGCTAT CGCTATACCG TGGTCCAGAA TCTGCAGCAG TTGCAGGCTG TCGATGCGAA TCTCGACGGG CTGTACGTGC TTGGCAACGA CATCGCCGGT TCCTATTACT ACGGCACCGC GTTCAAGACG ATCGGTTCGG GCGCGTCGTT CGCTGGCGTG TTCGACGGGC TCGGCAACAC GATCAGCAAC CTCACCATCG CGAGCAGCAA TGCGTATGCC GGGCTGTTCG GCCGCAACAC CGGCACGCTC GCCAACCTCA ACCTGAAGTC GTTGCGTGTC AGCGCTGCGT CGGGCGTCGG TCCGGTGGCC ATCGGCGGGC TGGTAGGCGA AAACGCCGGC AAGATCTCGA ACGTGACCGC GACGGGTATG CAGGTGAGTG CCGGCGCCAG CCGCAACAAT GCGCTGGGCG GTATGGTCGG CATCAATAGC GGCGAGATTT CCCGGACGTC GTATTCGGGA TCGGTAACGG GCAACAGCAT GTCGGACGCC GTCGGCGGCC TGGTCGGCGA GAACCGCCTG GATCAGCTTG CCGGCATCGT GTCCGACAGC GAGTCCAACG CGACCGTATC CGGCGGCGCA TCGAACCGCG CGTCGATCGG CGGTCTCGTC GGCGTGAACC GGGGCGGTGA AATTCTGCGC TCGACGAGTC GCGGCACCAC ATCCGGCACC ACATCCGGCT ACAACCTCGC CGGCGTGAAC GTCGGCGGGC TCGTGGGTGC GAACCTGCTC GGCAAGATCG TCGACGGATC GGCGCTGGGG CGCGTGACGG GCGGCTCGAG CGGCACGGCA GGCGGGTTCG TCGGAAACAA TACCGGCACG ATCACCGGTT CGAGTGCGAG CGGTGTCGTG GATGGCCGAT ACGCGCAGGC AATCGGCGGC TTCGTCGGGC TGAACCAGGG CAACGTGGCC GACAGCAAGG CGCTGGGCAC CGTCTCGAGC TCATCGACAG GTTCGACGGG CGGTTTCGTC GGCATCAATC TCGGCGACAA CGCGTTCATC GACATGGCGG AAGCGCATGG TGCGGTAACG GGTGGTCAGG GCGCCAATGG CGGTTTCGTC GGCAGTCATC TCGGCGGCCG GATTGCTCAC GCGGTCGCAC GCGGCAAGAC GACGGGCGGC AGTTACAGCA AGACGGGCGG CTTCGTCGGC AGCAACGAGG CGGAATTGGC CAACGTCGAC GCGAGCGGCG ACGTGTCGGC CGGTGCCGGC GCGTCGGTGG GCGCGTTCGC CGGCGCGAAC ACGCGGTTCG GCAAGATCGA GGCCGCGTCG GCAACCGGCA ACGTGACGGG CGGCTCGTCG AGCACGGTGG GCGGGTTCGC AGGCGAAAAC CTCGGCACGA TGCGCGACGT ATCGGCGTCG GGGACGGTCG GCGCGGGCTA CTACGGTGCG ACGCTCGGCG GCCTGGTCGG TGTGAACGCG GGGCTCGTCG AGCGCGCTGC CGCAAGCGGG CGGGTCAACG GCTCGTCGAG CCAGACCTTC GGCGGCCTCG TTGGCATCAA CCGGGGCATC TTCCGCAACA GCGTCACGTC CGGCGAGGCT GCGCTGCAGA AGATTGCCGG TCTCAATCTC GGTGTGATCG AGTAA
|
Protein sequence | MNKTYSLIWN DAQGAWQVAC ERVRRRGKSS TRARVVAMVA LLAGGAATSA NALPTGEKII SGKGDIHRYD NGQQMSVNQH SDKLVTDWQT FNIDKGERVT FNQPSAASIA LNRVVGQDGS AIYGNLDANG RVFLVNPNGI LFGKNAQINV GGLVATTLDI RNEDFDAGRY RFSGQSPSEV QNFGNLVAAE GGAIALLGAK VVNKGIVQAQ MGTVALGAGS DATLNFDGSK LLSVQIDQGI VNALVSNEQL LRADGGQVLM SAKTADTLLR TVVNNQGVIE ARTLRNAAGR ISLDGHDAGT VNVAGVLNAS ATTPGNGGMV ETRGADVKVA LGAMVDTRAT NGRVGTWRIA SSDVSVKAGA NQPGTIVADT LSRNLATTDV ELVAERGQLS VDGPVTWASG NRLTLASRQG DVSVNGALRA TGANARLAID AKQNVRIAAP IALTGANALL TLDYGVAHSL SGGAAVTLSG AGAGFESNGY RYTVVQNLQQ LQAVDANLDG LYVLGNDIAG SYYYGTAFKT IGSGASFAGV FDGLGNTISN LTIASSNAYA GLFGRNTGTL ANLNLKSLRV SAASGVGPVA IGGLVGENAG KISNVTATGM QVSAGASRNN ALGGMVGINS GEISRTSYSG SVTGNSMSDA VGGLVGENRL DQLAGIVSDS ESNATVSGGA SNRASIGGLV GVNRGGEILR STSRGTTSGT TSGYNLAGVN VGGLVGANLL GKIVDGSALG RVTGGSSGTA GGFVGNNTGT ITGSSASGVV DGRYAQAIGG FVGLNQGNVA DSKALGTVSS SSTGSTGGFV GINLGDNAFI DMAEAHGAVT GGQGANGGFV GSHLGGRIAH AVARGKTTGG SYSKTGGFVG SNEAELANVD ASGDVSAGAG ASVGAFAGAN TRFGKIEAAS ATGNVTGGSS STVGGFAGEN LGTMRDVSAS GTVGAGYYGA TLGGLVGVNA GLVERAAASG RVNGSSSQTF GGLVGINRGI FRNSVTSGEA ALQKIAGLNL GVIE
|
| |