Gene BURPS1710b_1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1750 
Symbol 
ID3689716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1893703 
End bp1896927 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content70% 
IMG OID637728206 
Producthaemagluttinin family protein 
Protein accessionYP_333151 
Protein GI76809734 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGCGC CGGAGACGGC GCGTCGCAAA GGGAAAGGAC ATTCGCTGAC GATCGTGTGC 
GCGATCGCCT CAGGCCTGCT GCTTGCGGCG CCTGCGTGGG CGGACACGGT GTCGCCCTCG
GGCACGGATA ACGTCTACGG CGTCGACGCG ACCGATCCCG GCGTGTCGAC GAACCAGGGC
AATACGGCCT ACGGGGCGCA GGCGGGCGCG AAGGTCACGG GTTCGTACAA CACCGCGATC
GGGTATCAAG CAGGGCAGAA CGTGAACGCC ATCGATACCG TATCGATCGG CAAGCAGGCC
ACCGCGAGCG CGAATGACGC GATCGCGATC GGCACGAACA CGAAGGCGAG CGGGCCGGCC
GACATCTACA TGGGGCTGAA CGCAGGCGCC GGCGCCGGCT CGACGACGAG CCCGGACGGC
ACCGTCACGC TCGGCATTCG CAACATGGGC CTCGGGGAAT CCGCGGGCTC GTACGTGACG
GGCCAGAACA ACACGGGGAT CGGCTATCAG TCGGGCATGA ACGTGACGGG CGACCAGAAC
GTCGGCCTCG GGCAGCAGGC GGGACAATTC GTGACCGGGA CCGGCAACTC GGCGATGGGG
CATCTGGCGG GGTCGACGGT GTCGGGCAGC TACAACGCCG CGTTCGGCGA GTATGCGGGG
ACCAACACGA GCGGCGGCGC CAATGCCGCG TTCGGCTTCT ATGCGGGGCG CTACATCAAC
GGCACGAACA ACACGGCGCT CGGTGCGTAC GATCTGCCGG TCGTCAATGG CACCTGGTAC
GGTTCGTACG TGACGGGCAG CAACAACCTC GGCGCCGGCC ATAATTCGGG CGCCTACGTG
AGCGGCGCGA GCAACGTCGG GCTCGGCGAC GGCGCGGGCA CGTTCGTGAC CGGCAGCAAC
AACGTCGCCA TCGGCACGGC AGCGGGCTCG GGCGCGTATA CCAGCGGTCC GAGCGGCGCG
ACGCTCAATG CGGCGCTCGT CGCGAGCAAC ACCGTGAGCA TCGGTACCCG CGCCACGGCG
AGCCAGAGCG ACGCGATCGC GATCGGCAAG GGCGCGACCG CGAGCGGCGC GCAATCGATC
AGCATCGGCA CCGGCAACGT CGTGAGCGGC AAGGGAAGCG GTGCGATCGG CGATCCGAGC
ACCGTCAGCG GCGCGGGGTC CTATTCGATC GGCAACAACA ATACCGTCGC GAACAGCAAC
ACGTTCGTGC TCGGCAACGG CGTGACGACG ACGCAGGACA ACAGCGTCGT GCTCGGCAAT
CAGAGCACCG ACCGCGCGGC CGTCGCGGTT TCGAGCGAAA CCATCAATGG CACGACGTAC
AACTACGCGG GCGTCGCGAG CCCGGCCAAC GGCGTCGTCA GCATCGGCGG CGTGGGCACG
GAGCGCCAGC TCATCAACGT GGCGGCGGGC CAGGTGAGCG CGACCAGCAC GGACGCGATC
AACGGCAGCC AGCTGTACGC GACGAACCAG GCGGTGATCG CCGAGGACGC GAAAGTGAAT
TCGCTCGGCG GCGGCGTGGC GAGCGCGCTC GGCGGCAACG CGGCGTACAA CGCGACGACC
GGCGCGATCA CCGCGCCGAG CTACGCGGTC TACGGGACCA CGCAAAACTC CGTGGGCGGC
GCGATCGATG CGCTGCAGGC CCTCGCGCCG CTGCAGTACA CGTCCGGCCC GGGCGTGACC
ACGCCGAACG CGCCGGGATC GGCGCCGACG AACACGGTGA CGCTCGTCGG CGCCGGCGGG
CCGGGAGCCA ATACCACGCC GGTGACGCTC ACGAACGTCG CGCCGGGCAA ACTCTCCGCG
ACCAGCACGG ACGCGGTCAA CGGCTCGCAG CTCTACGCGA CCAACCAGCA GGTCGCGAAC
CTCGTGAGCT CGGTGAACAA CGGCGGCGTC GGCCCGGTGC AGTACAGCGA TCCTAGCGCG
CCGACGACGC CCAACGGCGG CAAGCCCTCG CAGGACCTGA CGCTCGTCGG CGCGGCAAGC
GGCCCTGTCG CGCTGCATAA CGTCGCGCCG GGCACGGCGT CCACCGATGC GGTCAACGTC
GGGCAGCTCG GCGCGGTGAC GACCGGCCTG GGCGGCGGCG CGGCGATCGA TCCGAAGACG
GGCGCCGTGA CCGCGCCGTC GTACACGGTC TACAACGCCG ACGGCACGAC GTCGAACGTC
GGCAACGTCG GCGCGGCGAT CGATGCGATC AACTCGACCG GCATCAAGTA TTTCCACGCG
AACAGCACGA AGCCGGACAG CCAGGCGCTC GGCGCGGACA GCGTCGCGAT CGGCCCGAAC
GCCGTCGCGA ACAACGCGGG CGACGTCGCG CTCGGTTCGG GAGCGGTCAC GTCGCAAGCG
GGCGGCACGC TGAGCGAAAC GATCAACGGC GTGACCTACT CGTTCGCCGG CACGACGCCG
ATCGGCACGG TGAGCGTCGG CGCGCCGGGC GTCGAGCGCA CGATCACCAA CGTTGCCGCG
GGGCGCATCG GGCAGTCGAG CACGGACGCG ATCAACGGCT CGCAACTGTA CGGCACCAAC
CAGTCGATCG AGGCGTTGAC GGACAAGATG AACAGCCTCG GCAACACCGT GGCGAACACG
CTCGGCAGCG GCGCGTCGTA CAACCCGCAA ACAGGCGCGG TGAACGGCCC GGCCAACTCG
GGCGGCGTGG TCACGCCCAC GGTGATCCAG GAGGCGGCGA ACAAATGGGT GAGCGCCAAT
CCGTCGACCT ACGTGGCGCC CGTCGCGACG GGCACGAACG GCATGGCGGT CGGCAGCGGC
GCGGTTTCGA CGGGCCAGAA CTCGGTCGCG CTCGGCACGA ACGCGTCGGA CGGCGGCCGC
TCGAACGTCG TGAGCGTCGG GGCGCCGGGC GCGGAGCGCC AGGTGACGAA CGTGGCGGCC
GGCACGCAGG CGACCGATGC GGTCAACCTC GGGCAGATGA ACGGCGCGCT CGCGCAGCAA
ACCGACAGCT TCAACCAGCG GCTGGGCGCG GTTCAGCAGG ACGTCGACAA CGTCGCGCGC
GCCGCCTACG GCGGCATCGC GGCCGCGACC GCGCTCACGA TGATCCCCGA GGTCGACAAG
GACAAGACGA TCGCGGTGGG CATCGGCGGC GGCACGTATC GCGGCTACCA GGCGGTGGCG
CTCGGCGCGA CGGCGCGCAT CACCGAGAAC ATCAAGGTTC GTGCGGGCGT CGGCATGAGC
TCGGGCGGGA CGACGGCCGG CATCGGCGCA TCGATGCAGT GGTAA
 
Protein sequence
MVAPETARRK GKGHSLTIVC AIASGLLLAA PAWADTVSPS GTDNVYGVDA TDPGVSTNQG 
NTAYGAQAGA KVTGSYNTAI GYQAGQNVNA IDTVSIGKQA TASANDAIAI GTNTKASGPA
DIYMGLNAGA GAGSTTSPDG TVTLGIRNMG LGESAGSYVT GQNNTGIGYQ SGMNVTGDQN
VGLGQQAGQF VTGTGNSAMG HLAGSTVSGS YNAAFGEYAG TNTSGGANAA FGFYAGRYIN
GTNNTALGAY DLPVVNGTWY GSYVTGSNNL GAGHNSGAYV SGASNVGLGD GAGTFVTGSN
NVAIGTAAGS GAYTSGPSGA TLNAALVASN TVSIGTRATA SQSDAIAIGK GATASGAQSI
SIGTGNVVSG KGSGAIGDPS TVSGAGSYSI GNNNTVANSN TFVLGNGVTT TQDNSVVLGN
QSTDRAAVAV SSETINGTTY NYAGVASPAN GVVSIGGVGT ERQLINVAAG QVSATSTDAI
NGSQLYATNQ AVIAEDAKVN SLGGGVASAL GGNAAYNATT GAITAPSYAV YGTTQNSVGG
AIDALQALAP LQYTSGPGVT TPNAPGSAPT NTVTLVGAGG PGANTTPVTL TNVAPGKLSA
TSTDAVNGSQ LYATNQQVAN LVSSVNNGGV GPVQYSDPSA PTTPNGGKPS QDLTLVGAAS
GPVALHNVAP GTASTDAVNV GQLGAVTTGL GGGAAIDPKT GAVTAPSYTV YNADGTTSNV
GNVGAAIDAI NSTGIKYFHA NSTKPDSQAL GADSVAIGPN AVANNAGDVA LGSGAVTSQA
GGTLSETING VTYSFAGTTP IGTVSVGAPG VERTITNVAA GRIGQSSTDA INGSQLYGTN
QSIEALTDKM NSLGNTVANT LGSGASYNPQ TGAVNGPANS GGVVTPTVIQ EAANKWVSAN
PSTYVAPVAT GTNGMAVGSG AVSTGQNSVA LGTNASDGGR SNVVSVGAPG AERQVTNVAA
GTQATDAVNL GQMNGALAQQ TDSFNQRLGA VQQDVDNVAR AAYGGIAAAT ALTMIPEVDK
DKTIAVGIGG GTYRGYQAVA LGATARITEN IKVRAGVGMS SGGTTAGIGA SMQW