Gene BURPS1106A_A2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2233 
Symbol 
ID4903511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2219567 
End bp2221987 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content67% 
IMG OID640145338 
Productricin-type beta-trefoil lectin domain/galactose oxidase domain-containing protein 
Protein accessionYP_001076266 
Protein GI126456133 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTTCT TTCGATTCGC GTGGCGGCAC GGCCGATATT GGCTTTGCCT GCTGATTGCG 
AGCGTATTGC TGCCGCAAGG CGCGGCGGCG GCCGTCACGG CGTCCTACAC GGACATCGTC
GGCGCGGACA GCGGGCTGTG CGTCTCGACG GCCGGCAACT CGAGCGCATC GGGCGCGGGC
GTCGTGCAGA CCTCGTGCGC GGGCCTGAGC AACACGACAT GGTCGTTCGT TCCCGTCGGC
AACCGCTATC ACATCGTGCT GCAAGGCAGC GGCCTGTGCC TGAACGTGCC CGGGGGCTCG
CTGAACAGCG GCACGCAATT GATCCAGTAC GCATGTCAGG GCAACGGCCA GACCAATGAC
CAATGGACGG TCGTCGCGGT CGGCTCGAGC TACCGGATCG TGTCCGCGTC GAGCGGCATG
TGCGTGAACG TGAGCGGCGC ATCGCACGCG AGCGGCGCGG CGCTGATCCA GTATCCGTGC
CAGGGCGCGG GCGCGCTCAA CGATCAATTC AATCTGTATC TGCCCGTCGT CGCGGCCACC
AACGTCACGG CGGCGAACAG CAACCTCTGC GTGAGCGTCA ATGGCGGATC GACGGCCGCC
GGCGCATCGA TCGTCCAGGG AACATGCTCG AATCAGGGCG CCACGGGCTG GTCGCTGCTG
CCCGCCGGCA GCGGCTATCA CGTCGTCTCG CAAGGCACCG GGCAATGCCT GAACGTATAT
GGCGGATACA CGACGAGCGG CGCGCCGATC ATCCAGTATC CGTGTCAGGG CGACGCGCAG
ACGAACGATC AATGGACGCT CGTGCCCGTC GGCTCGAAGT ACCGGCTGAT TTCGGTGTCG
AGCGGCATGT GCCTGAACGT GAGCGGCGGC TCGCTGTCGC CGGGCGCGCC GCTGATCCAG
TACCCGTGCC AGGGCGCGAA CGCGCTCAAC GACCAGTTCT CGCTCAGCCT GCCGCAGACT
TTCCCGGTCA CGCTGCCGTC CGCATGGAGC CCCGTGATTC CGCTGCCCGT CAATCCGATC
GGCATCGCGA ACGCGCCGAA CGGCAAGCTC GTGATGTGGT CCGCTGATCA GCAATTGAGC
TTCCAGAACG ACGTCGGCAG CAAGGCGACG CAGACGCAGA CCGCGGTGTT CGACCCGGCG
ACGAACACCG CGACGCAGTA TCTCGAAACC TCGGCGGGCT CGGACATGTT CTGCACCGGC
ACCGCGATGC TGCCCGACGG CCGCCTGCTC GTGAACGGCG GCGACAGCAG CCCGAAGACG
ACGCTGTACG ACTGGACGAC CAATACGTGG AGCGCCGCGG CGACGATGAA CATTGCGCGC
GGCTATCAGG GCGACACGCT GCTGTCGAAC GGCTCGGTGC TCACGCTCGG CGGCTCGTGG
AGCGGCGGTC AGGGCGGCAA GACCGCCGAA GTGTGGACGA ACGGCGGCGC GTGGACGCTG
CTGCCCGGCG TGCCCGAGAC GAACATCGTC GGCCCCGATC CGCAGGGTAT CTATCGCGGC
GACAATCATC TGTGGCTGTT CGCGCAAGGC AACGGCACGG TTTTCCATGC GGGGCCGAGC
TCGCAGATGA ACTGGATCTC GACGGCGGGC GGCGGCTCGA TCCAGTCGGC GGGCATGCGC
GGCGTCGATC CGTTCAGCAT CAACGGCACC GCGTCGCTGT ACGACGTCGG CAAGATCCTG
AAGGCGGGCG GCGCGAAATC GTACCAGCAG AACGGCAGCG TCACGACCTA TGCGTCGAAC
TCGGTGTACC AGATCGACAT CACGCGCGGG CCGAACCAGC CGGCATCGGT GCAGCGCCTG
AACGGCATGA CGTACCAGCG CGCGTTCGCC AACAGCGTGA TCCTGCCGAA CGGCAGCATC
GTGATGATCG GCGGCCAGAG CGTGCCGATG CCGTTTACCG ACACGACCGC GATCATGGTC
CCCGAAATCT GGGACCCGGC GACGCAACGC TTCAACCTGC TCAAGCCGAT GCAGACGCCG
CGCACCTATC ACAGCACGGC GATCCTGATG GCGGACGGCC GCGTGTTCGC GGGCGGCGGC
GGCCAGTGCG GCACCGGCTG CGCAATGAAC CATCTGAACG CGGAGATCCT CACGCCGCCT
TACCTGCTCA ACGCGGACGG CACGCCGGCG CCGCGGCCGG TGATCACGAA CGCGCCGGCC
ACGGCGAAGC TCGGCGCGAC GATCGCCGTG TCGACGCAAG GCCCCGTCGC GTCGTTCGTG
CTGATGCGCC TGTCGTCCGT CACGCATACG ACGAACAACG ATCAGCGGCG CATTCCGCTC
GCGATCGCGT CGTCCGGCGG CACGAGCTAC CAGCTCGCGA TTCCCGCCGA TCCCGGCGTG
GTCCTGCCCG GCTACTACAT GCTGTTCGCG CTCAACGCGC AAGGCGTGCC GAGCGTGTCG
GCATCGATCC GGATCTCATG A
 
Protein sequence
MHFFRFAWRH GRYWLCLLIA SVLLPQGAAA AVTASYTDIV GADSGLCVST AGNSSASGAG 
VVQTSCAGLS NTTWSFVPVG NRYHIVLQGS GLCLNVPGGS LNSGTQLIQY ACQGNGQTND
QWTVVAVGSS YRIVSASSGM CVNVSGASHA SGAALIQYPC QGAGALNDQF NLYLPVVAAT
NVTAANSNLC VSVNGGSTAA GASIVQGTCS NQGATGWSLL PAGSGYHVVS QGTGQCLNVY
GGYTTSGAPI IQYPCQGDAQ TNDQWTLVPV GSKYRLISVS SGMCLNVSGG SLSPGAPLIQ
YPCQGANALN DQFSLSLPQT FPVTLPSAWS PVIPLPVNPI GIANAPNGKL VMWSADQQLS
FQNDVGSKAT QTQTAVFDPA TNTATQYLET SAGSDMFCTG TAMLPDGRLL VNGGDSSPKT
TLYDWTTNTW SAAATMNIAR GYQGDTLLSN GSVLTLGGSW SGGQGGKTAE VWTNGGAWTL
LPGVPETNIV GPDPQGIYRG DNHLWLFAQG NGTVFHAGPS SQMNWISTAG GGSIQSAGMR
GVDPFSINGT ASLYDVGKIL KAGGAKSYQQ NGSVTTYASN SVYQIDITRG PNQPASVQRL
NGMTYQRAFA NSVILPNGSI VMIGGQSVPM PFTDTTAIMV PEIWDPATQR FNLLKPMQTP
RTYHSTAILM ADGRVFAGGG GQCGTGCAMN HLNAEILTPP YLLNADGTPA PRPVITNAPA
TAKLGATIAV STQGPVASFV LMRLSSVTHT TNNDQRRIPL AIASSGGTSY QLAIPADPGV
VLPGYYMLFA LNAQGVPSVS ASIRIS