Gene BamMC406_3717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBamMC406_3717 
Symbol 
ID6179669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia ambifaria MC40-6 
KingdomBacteria 
Replicon accessionNC_010552 
Strand
Start bp681295 
End bp684066 
Gene Length2772 bp 
Protein Length923 aa 
Translation table11 
GC content68% 
IMG OID641683487 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001810400 
Protein GI172062749 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.123962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.562424 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATC AACAACACCA TCGCCCGCGT GCGGCGACGC CAGCCGCGCG CACTACGCTC 
GCCGCAGCGC TGACGCTCGC GATGCCGATC CTCGCGGCGC AGCCCGCGTT TGCGCAACCG
GGGCTCGTCG TCGATCCGAA CGCGGCGAAC CGTCCCGGTA TCACGAACGG GCCGAACGGC
ACGCCGATCG TCGGCATTAA CGCACCGGAT GCCGCCGGCA TTTCGGCCAA CCGCTTCACC
GAATACAACG TCGGGCCGGC AGGTCTCGTG CTGAACAACA GTGTGAACGG CACGAACACG
CAGCTGGCGG GCAGCATCGA CGGCAATGCG CGGCTCGGCG GGCGTTCGGC GCAGGTCATC
CTGAACCAGG TGACGGGCGG CAACGCATCG GTGCTCGCAG GGGCGACCGA AGTCGCGGGT
CAGCAGGCGC GCGTGATCGT CGCGAACCCG AACGGGATCG GCGTGAACGG CGCGAGCTTC
ATCAACGCGA GCCGCGTGTC GCTCGTCGCG GGCACGACGG AATTCGACGA GGACGGCCGT
ATCGCACGAT TCCGGACCGA GAACGGCCGC ATCACGATCG ACGGCGCCGG GCTCGACGCG
CGCAATCTGG ACCAGCTCGA TCTCGTGTCG CGCAGCCTGA AGGTGAACGC CGCGTTGCAC
GCGAAGAAAC TGGTCGCGGT CGCGCAACAA GGGACGGCGG CCATCGAGAA TCCTCAGGCG
ATGTCGTTCG ATGCGTCGAC GAGCGGGGAA CTGCCGCGCG TCGCGATCGA CGTGTCGCGA
CTCGGCAGCG TGCATGGAGA GGATTCGATC GTCATGCGCG GCACGTCGGC GGGCGTCGGC
ATCAATATCT CGGGCAAGGT CGAGGCGCTC ACGGGCTCGG TGACGTTGCT GTCGGATGGC
AGGGTCCGGA TTTCGGGCGG CGGATCGCTG CGCGCGGGGA CGCTCTCGGC TCCGTCCGGT
CTGCAGTACG GCGGGAACTG GACGGACGAT GACGAGGCGG CGAAGCCGCC GGTCGAGCCG
GAGCCGCCGG TAGCGGAGAC GAAGCCGCCG GTCGAGCCGG AGCCGCCGGT AGCCGAGACG
AAGCCGCCGG TCGAGCCGGA ACCGCCGGTA GCCGAGACGA AGCCGCCGGT CGAGCCGGAG
CCGCCGGTAG CGGAGACGAA GCCGCCGGTC GAGCCGGAAC CGCCGGTAGC CGAGACGAAG
CCGCCGGTCG AGCCGGAACC GCCGGTAGCG GAGACGAAGC CGCCGGTCGA GCCGGAGCCG
CCGGTAGCGG AGACGAAGCC GCCGGTCGAG CCGGAACCGC CGGTAGCGGA GACGAAGCCG
CCGGTCGAGC CGGAACCGCC GATCGCCGAA ACGAACCCGA AACTCGTCGT CGATCCGAAC
GCGGCCAACC GTCCCGGTCT CGGCGCCGCG CCGAACGGCA CGCCGATCGT CGACATCAAT
GCGGCGGACG CCTCCGGCAT TTCCTCGAAC CGCTTTCTCG ACTACAACGT CGGCCCGAAC
GGTCTCGTGC TGAACAACAG CGTGAACCGC ACGAACACGC AGCTGGCGGG CAGCATCGAC
GGCAATGCGC GGCTCGGCGG CCGTTCGGCG CAGGTCATCC TGAACCAGGT GACGGGCGGC
AACGCATCGG TGCTGGCCGG GGCGACTGAA GTCGCGGGGC AGCAGGCGCG CGTGATCGTC
GCGAACCCGA ACGGGATCAG CGTGAATGGC GCGAGCTTCA TCAACGCGAG CCGCGCGTCG
CTCGTCGCGG GCACGACGGA ATTCGACGAG GACGGCCGTA TCGCACGATT CCGGACCGAG
AACGGCCGCA TCGCGATCGA CGGCGCCGGG CTCGACGCGC GCAATATGGA TCAGCTCGAT
CTCGTGTCGC GCGGCCTGAA GGTCGATGCG GCGGTGCGCG CTAAGAAGCT GGTCGCGATC
GCGCATCAAG GCACGGCGGC AATCGAGCAA TCGAACCGGC AGACGCTCTC CGGCTCGGCA
AGCGGCAAGG CGCCGGCCGC GGCGATCGAC GTGTCCGAAT CGGGCAGCCT GCGGGCGGAC
GAGATCGCGC TGATCGGCGC GTCCGCGAAA ACCGGCGTCC GGATCGCCGG CACGGTCGAC
GCCGATACCG TCGACGTGTC CGGTCGCCTC GACAACGACG GTTCCGTGCG CGGGCGCAGC
GTCAGGATTG CCGGCGACGC GACCAATTCC GGCACGCTGC ACGCGGAAAA GTTGCTCGCG
ATGGGGAACG TGACCAACGG CGGCACGATC GAGGCAGGTG ACGTTCAGGT CATGGGCAAT
ACGACCAACA ACGGGACGCT GCGCGCGGAG AAGTTGCTCG CGATGGGGAA CGTGACCAAC
GGCAGCACGA TCGAGGCAGG TGACGTTCAG GTCATGGGCA ATACGACCAA CAACGGCGCG
CTGCGTGCGA AGAAGTTGTT TTCCACGTCG GGCAACATGA CCAACCGCGG CACGATCGAG
GGCGGCGACG TCAAGGTCAT GGGCAACACG ACCAACGACG GCACGCTGCG CGCGGAGAAG
TTGCTCTCCG CGATGGGCAA CGTGACCAAC CGCGGCACGA TCGAGGCCGG CGCCGTCAAG
GTCATGGGCA AAGTGACCAA CGACGGCACG TTGCACGGCG ACGATTCGGT ATCGGTGATG
GGCCGCCTGT ACAACGGGTT GTCGGGTGTC ACGAGCAGCT ACGGCAACGT GTTCGGCGCT
GGGCGCGTGT CCGGTCCGGG GCGCATCGTC AGCTACATGG TGCGTCCGAC GCCTGCCGGC
GACGTTCGAT AA
 
Protein sequence
MKHQQHHRPR AATPAARTTL AAALTLAMPI LAAQPAFAQP GLVVDPNAAN RPGITNGPNG 
TPIVGINAPD AAGISANRFT EYNVGPAGLV LNNSVNGTNT QLAGSIDGNA RLGGRSAQVI
LNQVTGGNAS VLAGATEVAG QQARVIVANP NGIGVNGASF INASRVSLVA GTTEFDEDGR
IARFRTENGR ITIDGAGLDA RNLDQLDLVS RSLKVNAALH AKKLVAVAQQ GTAAIENPQA
MSFDASTSGE LPRVAIDVSR LGSVHGEDSI VMRGTSAGVG INISGKVEAL TGSVTLLSDG
RVRISGGGSL RAGTLSAPSG LQYGGNWTDD DEAAKPPVEP EPPVAETKPP VEPEPPVAET
KPPVEPEPPV AETKPPVEPE PPVAETKPPV EPEPPVAETK PPVEPEPPVA ETKPPVEPEP
PVAETKPPVE PEPPVAETKP PVEPEPPIAE TNPKLVVDPN AANRPGLGAA PNGTPIVDIN
AADASGISSN RFLDYNVGPN GLVLNNSVNR TNTQLAGSID GNARLGGRSA QVILNQVTGG
NASVLAGATE VAGQQARVIV ANPNGISVNG ASFINASRAS LVAGTTEFDE DGRIARFRTE
NGRIAIDGAG LDARNMDQLD LVSRGLKVDA AVRAKKLVAI AHQGTAAIEQ SNRQTLSGSA
SGKAPAAAID VSESGSLRAD EIALIGASAK TGVRIAGTVD ADTVDVSGRL DNDGSVRGRS
VRIAGDATNS GTLHAEKLLA MGNVTNGGTI EAGDVQVMGN TTNNGTLRAE KLLAMGNVTN
GSTIEAGDVQ VMGNTTNNGA LRAKKLFSTS GNMTNRGTIE GGDVKVMGNT TNDGTLRAEK
LLSAMGNVTN RGTIEAGAVK VMGKVTNDGT LHGDDSVSVM GRLYNGLSGV TSSYGNVFGA
GRVSGPGRIV SYMVRPTPAG DVR