Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2341 |
Symbol | |
ID | 4905635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2320691 |
End bp | 2323396 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640145446 |
Product | hemagglutinin-like protein |
Protein accession | YP_001076374 |
Protein GI | 126455972 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTGCTGA ACAGGGTGGC GGGCATGCCG GTGCCGATGC CGGCGGCGGA GGTGTCGCGC GGGCGCGGCA AGCTCGGCTG CGGCGGCGTG CGGGCGCAAC GTCGCGGCGG TGCGGCGTGC GCGGCGCTGC TTGGGGTGGC CGGGCCGTCC TTGGCGTTCG CGGCGGTGGT GGCGGACCCG AACGGGGGCG CGCAGCGGCC CGGCATGGCG ACGACGGCGA ACGGGACGGA CCTGGTCAAT ATCGTCGCGC CGGACGCGAC GGGGTTGTCG CACAACAAGT TCAACGAGTT CAGCCCGGTT GGACGCGGCG TGGTGTTGAA CAACAGCGTG CGGCCCGGGA AATCGCAGAT CGGCGGCATG GCGGCGCAGA ACCCGAACTT GATGCAACCG GCCACCCGGG CATTGCTCGA GGTGACGCAG CAACGCAGCG TGCTGCAGGG CACGCTGGAG GCGTTCGGCG GCAAGCTCGA CGTGCTGGTG GCGAACCAGC ATGGAGTGAC GATCAACGGC TTGACGACGC TGAACGTGGG CCGGCTCGGC GTGACGACGG GGCAGGTGCT GCCGCAAGCG GCCGGGCAGT TGCGTTTGGG CGTGACGCAA GGCGACGTGC TGATCGACCA TGGGGGCATC GATACCCAGG GCCTGGACAT GTTCGACGTG GTGAGCCGCA GCATCGCCGT GCGCGGGCCG ATCCACGATT CGAGCCGCGC CGCGGGCGCC GACGTGCGCC TCGTGGCGGG CGCGACGGCC TACGATCCGC AGACCGGTCA TTATGAGGCG ATCGCGGCGG ACGAATCGAA GGCGCCGGTG CAGGAGGGAA TCAGCGGCGA ACTGCTGGGA GCGATGCACG GCCGTCACAT TGTGCTGGTG AGCACGGAAT CGGGCGTGGG CGTGCGGCAC GACGGACCGA TCAAGTCGGC GAACGACATT CGGGTGAGCG CGAACGGCGA GGTGACGCTG GGCGGGCCGC AGCAGGCGGC TCAGGAGGCG GTTGCAGGAG CGCAGGCGGT AGGCGGCGCC GGCATGCAGA ACGTGATCGC GGGCGGCACG GTGAGCGTCT GCGCGCGTGG GCACGTCGCG ATCCGGGGCG CGGTGATCGC GGGACAGGAT GTGGATCTGC AGGGGAAAAG CGTGAAGGCC GGCCGGATGA GCGCGCAGCG CGACGCGCTG GTGACGGCGG CGGATGGCGT GACGCTCGAT GGTCCGGTGG ACGCGAAGCG TCACGTGTGG ATCGGAGCCC ACGATGATGT GGTGATCCGT GAAGCGGCGG CGGGGCAGAA CGTGGTGCTG CTGGGGCGCA GCGTAACGGC CGGCCGGTTG GACGCGCAGC GCGACGTATT GGCGGCGGCC CGCGACGGCG TGACGATCCA TGAAGCGGCG GCCGCGGGGC AGGATGTGGT GCTGCAGGGA AGCAGCGCGA GGGTCGGCCA GATGAGCGCG CAGCGCGATG TGCTGGTGAT GGCGGCAGAT GGCGTGACGC TCGATGGGCC GGTGAGCGCG CAGCGCGCCG TATGGGTCGA GACCCAAGGT GACGTGGCGG GCAGTGAGTG GATCAAGGCC GGACGGGACG TGCAAATCGG CGCGGCGGCG AATCTGGCGG GCGCGGTAAC GGCCGAAGAG ATGCAGCAAC TCAAGGCCCA TGGTGACGCG GCGAACAGGC GGCGCGTCAA AGCCGGGCGG AACGAGCCAG CCGGCACGGC GGCTGAACGT CCGGCCGCGG CGGAGCAGAC GGTGGCCGTC GCTGACGCGA TGCGCGAGAT CGGCGTAGGC GGCGATCGGC TGTCCGGATT GGATGCCGCG CCGGGTACGC CGGGTACGCC CTTCGGCGCA CACCCGCAAG CGATGTTCGA CGATCCGGCG GCGCAGATTG CGCGATCGGC TCGATCCACG GCAACGGCGG GCGGACATGC GGGTTCGTTC ATGCGCGTCG GAGACGGTCA CATCGCCAAA ATGACCACGT CCAGAGAGGC GGAGATATAC GAGAATTACC GCTTGGCTCT TGCCGGCGTC ATCCCCGACA CCGTGCCGCC TGAAGAGGTG GATTCGCGGG TCGGTGTCAC GGCCAGGCAG AGGCAGGCCA TGGCGACTTT CAAAGGGTGG GCGGAGATGA AAGGCCAGCG GGTTGTCGTC ATGCAGGCGC TGGGCGCGGA GATCGCGCCG GAGGACAAGA TCGAGCTGGA CGTCAAGATC GGCGCCAGTA CGGTGTCGCG CACCGAGTTG ATCGGCGCCG GCAGGACTCG CTGGCAGGCC TTGAGCAAGA AGGTGAGATT GACGGCGGCG GACCTGCTGC GGGGCTCGCG TTCGTTGGTG GGCGACGATC GCGGCTATAC GCTCGCCGGC CGCACGAGCG GGGGGATTGC CCTGGACGCG AGGAATTCAC GCAACTCCGT CGGCCGATCC AGCGAATCGC TGATTCGCGA GGCGCTGGAT CGCTCGCCCG ATACGCGCTG GCGGAACGCG CAGCACTTGC TCGGGCAGTT GCAGACCATT CGAGAGAAGA TGCACGCGTT GCCGCTCACC TTCGTCGCCT CCAGCGTCCT CATTGCAATC GACAAACGGA AACCGGAAAA CTCGGTCGCC CGGCTGATCG ATCTCGCGCA CCCGGTGCAG CCTTTCGAAA ACGAAGCGGA CTATGAGAAA GTCAATCACC GCTTCGAGGA TGGTCTTGAC AAGCTGATCA GACTCTTTCA GCAGGTGGAA AAATAG
|
Protein sequence | MVLNRVAGMP VPMPAAEVSR GRGKLGCGGV RAQRRGGAAC AALLGVAGPS LAFAAVVADP NGGAQRPGMA TTANGTDLVN IVAPDATGLS HNKFNEFSPV GRGVVLNNSV RPGKSQIGGM AAQNPNLMQP ATRALLEVTQ QRSVLQGTLE AFGGKLDVLV ANQHGVTING LTTLNVGRLG VTTGQVLPQA AGQLRLGVTQ GDVLIDHGGI DTQGLDMFDV VSRSIAVRGP IHDSSRAAGA DVRLVAGATA YDPQTGHYEA IAADESKAPV QEGISGELLG AMHGRHIVLV STESGVGVRH DGPIKSANDI RVSANGEVTL GGPQQAAQEA VAGAQAVGGA GMQNVIAGGT VSVCARGHVA IRGAVIAGQD VDLQGKSVKA GRMSAQRDAL VTAADGVTLD GPVDAKRHVW IGAHDDVVIR EAAAGQNVVL LGRSVTAGRL DAQRDVLAAA RDGVTIHEAA AAGQDVVLQG SSARVGQMSA QRDVLVMAAD GVTLDGPVSA QRAVWVETQG DVAGSEWIKA GRDVQIGAAA NLAGAVTAEE MQQLKAHGDA ANRRRVKAGR NEPAGTAAER PAAAEQTVAV ADAMREIGVG GDRLSGLDAA PGTPGTPFGA HPQAMFDDPA AQIARSARST ATAGGHAGSF MRVGDGHIAK MTTSREAEIY ENYRLALAGV IPDTVPPEEV DSRVGVTARQ RQAMATFKGW AEMKGQRVVV MQALGAEIAP EDKIELDVKI GASTVSRTEL IGAGRTRWQA LSKKVRLTAA DLLRGSRSLV GDDRGYTLAG RTSGGIALDA RNSRNSVGRS SESLIREALD RSPDTRWRNA QHLLGQLQTI REKMHALPLT FVASSVLIAI DKRKPENSVA RLIDLAHPVQ PFENEADYEK VNHRFEDGLD KLIRLFQQVE K
|
| |