Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2480 |
Symbol | |
ID | 4887868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2397897 |
End bp | 2400641 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640132417 |
Product | hemagglutinin-like protein |
Protein accession | YP_001063474 |
Protein GI | 126444573 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.168613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAGGA ATGAGGTCGT GAACAGGAAC GTGTTTCGTT TGGTGCTGAA CAGGGTGGCG GGCATGCCGG TGCCGATGCC GGCGGCGGAG GTGTCGCGCG GGCGCGGCAA GCTCGGCTGC GGCGGCGTGC GTGCGCAACG TCGCGGCGGT GCGGCGTGCG CGGCGCTGCT TGGGGTGGCC GGGCCGTCCT TGGCGTTCGC GGCGGTGGTG GCGGACCCGA ACGGGGGCGC GCAGCGGCCC GGCATGGCGA CGACGGCGAA CGGGACGGAC CTGGTCAATA TCGTCGCGCC GGACGCGACG GGGTTGTCGC ACAACAAGTT CAACGAGTTC AGCCCGGTTG GACGCGGCGT GGTGTTGAAC AACAGCGTGC GGCCCGGGGA ATCGCAGATC GGCGGCATGG CGGCGCAGAA CCCGAACTTG ATGCAACCGG CCACCCGGGC ATTGCTCGAG GTGACGCAGC AACGCAGCGT GCTGCAGGGC ACGCTGGAGG CGTTCGGCGG CAAGCTCGAC GTGCTGGTGG CGAACCAGCA TGGAGTGACG ATCAACGGCT TGACGACGCT GAACGTGGGC CGGCTCGGCG TGACGACGGG GCAGGTGCTG CCGCAAGCGG CCGGGCAGTT GCGTTTGGGC GTGACGCAAG GCGACGTGCT GATCGACCAT GGGGGCATCG ATACCCAGGG CCTGGACATG TTCGACGTGG TGAGCCGCAG CATCGCCGTG CGCGGGCCGA TCCACGATTC GAGCCGCGCC GCGGGCGCCG ACGTGCGCCT CGTGGCGGGC GCGACGGCCT ACGATCCGCA GACCGGTCAT TATGAGGCGA TCGCGGCGGA CGAATCGAAG GCGCCGGTGC AGGAGGGAAT CAGCGGCGAA CTGCTGGGAG CGATGCACGG CCGTCACATT GTGCTGGTGA GCACGGAATC GGGCGTGGGC GTGCGGCACG ACGGACCGAT CAAGTCGGCG AACGACATTC GGGTGAGCGC GAACGGCGAG GTGACGCTGG GCGGGCCGCA GCGGGCGGCC CAGGAGACGG TTGCAGGAGC GCAGGCGGTA GGCGGCGCCG GCATGCAGAA CGTGATCGCG GGCGGCACGG TGAGCGTCTG CGCGCGTGGG CACGTCGCGA TCCAGGGCGC GATGATCGCG GGACAGGATG TGGATCTGCA GGGGAAAAGC GTGAAGGCTG GCCGGATGAG CGCGCAGCGC GACGCGCTGG TGACGGCGGC GGATGGCGTG ACGCTCGATG GTCCGGTGGA CGCGAAGCGT CACGTGTGGA TCGGAGCCCA CGGTGATGTG GTGATCCGTG AAGCGGCGGC GGGGCAGAAC GTGGTGCTGC TGGGGCGCAG CGTAACGGCC GGCCGGTTGG ACGCGCAGCG CGACGTATTG GCGGCGGCCC GCGACGGCGT GACGATCCAT GAAGCGGCGG CCGCGGGGCA GGACGTGGTG CTGCAGGGAA GCAGCGCGAG GGTCGGCCAG ATGAGCGCGC AGCGCGATGT GCTGGTGATG GCGGCAGATG GCGTGACGCT CGATGGGCCG GTGAGCGCGC AGCGCGCCGT ATGGGTCGAG ACCCAAGGTG ACGTGACGGG CAGTGAGTGG ATCAAGGCCG GACGGGACGT GCAAATCGGC GCGGCGGCGG ATCTGGCGGG CGCGGTAACG GCCGAAGAGA TGCAGCAACT CAAGGCCCAT GGTGACGCGG CGAACAGGCG GCGCGTCAAA GCCGGACGGA ACGAGCCAGC CGGCACGGCG GCTGAACGTC CGGCCGCGGC GGAGCAGACG GTGGCCGTCG CTGACGCGAT GCGCGAGATC GGCGTGGGCG GCGATCGGCT GTCCGGATTG GATGCCGCGC CGGGTACGCC GGGTACGCCC TTCGGCGCAC ACCCGCAAGC GATGTTCGAC GATCCGGCGG CGCAGATTGC GCGATCGGCT CGATCCACGG CAACGGCGGG CGGACATGCG GGTTCGTTCA TGCGCGTCGG AGACGGTTAC ATCGCCAAAA TGACCACGTC CAGAGAGGCG GAGATATACG AGAATTATCG CTTGGCTCTT GCCGGCGTCA TCCCCGACAC CGTGCCGCCT GAACAGATGG ATTCGCTGGC CGGTGTCACG GCCAGGCAGA GGCAGGCCAT GGCGAGTTTC AAGGAGTGGG CGGAGGTGAA CGACCAGCGG GTTGTCGTCA TGCAGGCGCT GGGCGCGGAG ATCGCGCCGG AGGACAAGAT CGAGCTGGAC GTCAAGATCG GCGCCAGTAC GGTGTCGCGC ACCGAGTTGA TCGGCGCCGG CAGGACTCGC TGGCAGGCCT TGAGCAAGAA GGTGAGATTG ACGGCGGCGG ACCTGCTGCG GGGCTCGCGT TCGCTGGTGG GCGACGATCG CGGCTATACG CTCGCCGGCC GCACGAGCGG GGGGATTGCC CTGGACGCGA GGAATTCACG CAACTCCGTC GGCCGATCCA GCGAATCGCT GATTCGCGAG GCGCTGGATC GCTCGCCCGA TACGCGCTGG CGGAACGCGC AGCACTTGCT CGGGCAGTTG CAGACCATTC GAGAGAAGAT GCACGCGTTG CCGCTCACCT TCGTCGCCTC CAGCGTCCTC ATTGCAATCG ACAAACGGAA ACCGGAAAAC TCGGTCGCCC GGCTGATCGA TCTCGCGCAC CCGGTGCAGC CTTTCGAAAA CGAAGCGGAC TATGAGAAAG TCAATCACCG CTTCGAGGAT GGTCTTGACA AGCTGATCAG ACTCTTCCAG CAGGTGGAAA AATAG
|
Protein sequence | MQRNEVVNRN VFRLVLNRVA GMPVPMPAAE VSRGRGKLGC GGVRAQRRGG AACAALLGVA GPSLAFAAVV ADPNGGAQRP GMATTANGTD LVNIVAPDAT GLSHNKFNEF SPVGRGVVLN NSVRPGESQI GGMAAQNPNL MQPATRALLE VTQQRSVLQG TLEAFGGKLD VLVANQHGVT INGLTTLNVG RLGVTTGQVL PQAAGQLRLG VTQGDVLIDH GGIDTQGLDM FDVVSRSIAV RGPIHDSSRA AGADVRLVAG ATAYDPQTGH YEAIAADESK APVQEGISGE LLGAMHGRHI VLVSTESGVG VRHDGPIKSA NDIRVSANGE VTLGGPQRAA QETVAGAQAV GGAGMQNVIA GGTVSVCARG HVAIQGAMIA GQDVDLQGKS VKAGRMSAQR DALVTAADGV TLDGPVDAKR HVWIGAHGDV VIREAAAGQN VVLLGRSVTA GRLDAQRDVL AAARDGVTIH EAAAAGQDVV LQGSSARVGQ MSAQRDVLVM AADGVTLDGP VSAQRAVWVE TQGDVTGSEW IKAGRDVQIG AAADLAGAVT AEEMQQLKAH GDAANRRRVK AGRNEPAGTA AERPAAAEQT VAVADAMREI GVGGDRLSGL DAAPGTPGTP FGAHPQAMFD DPAAQIARSA RSTATAGGHA GSFMRVGDGY IAKMTTSREA EIYENYRLAL AGVIPDTVPP EQMDSLAGVT ARQRQAMASF KEWAEVNDQR VVVMQALGAE IAPEDKIELD VKIGASTVSR TELIGAGRTR WQALSKKVRL TAADLLRGSR SLVGDDRGYT LAGRTSGGIA LDARNSRNSV GRSSESLIRE ALDRSPDTRW RNAQHLLGQL QTIREKMHAL PLTFVASSVL IAIDKRKPEN SVARLIDLAH PVQPFENEAD YEKVNHRFED GLDKLIRLFQ QVEK
|
| |