Gene BURPS1106A_A0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0224 
Symbol 
ID4903932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp205454 
End bp207595 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content65% 
IMG OID640143331 
Producthypothetical protein 
Protein accessionYP_001074267 
Protein GI126456001 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCGGC AAAGCGGTGA AGCGGCGGAT CGGCGGGATA GCGAAACGAT TCGGCTGAAC 
AACCCGAACG CGGGACGCGG GATGGGGGAC GCCCGGCAGA CGCAACGCGG CGGCAGTGAC
GTGGCGACCA ATCAACGAGG GCGGCGACGA TTCGTTCGTC GTACTCACCT ATGTCATACC
CGGCGACTCA TACTGTCGAC GGGGCAGATC GGGAGAGCAT CACTCATGGT CAGGAAACGA
ATCCGTGCCG TGTTCGCCGC CGCGCTCGTG GGGCTGGCCG CGCACGCGGC GCACGCGCAA
TACACGACGG ACTGGATCGC GAACACTTAC GGGACCATCG CGTCGCACGT CGGCAACAAC
GCGCGATCGA TGTGGGTCTC GCCCGAAGGC GTGATCTATA CGGCATCCTT CTGGGACGAA
AACGCCGGCG GCGTCGCGAT CTATCAGAAC GGCAAGACCC TCGGCTCGAT CGGCACGCAC
GCCGAATTCC AGGGCGGCGC GATCACCGGC AACGCGACGT CGATCTTCGC GGCGATGCAG
TACAGCACCC CGCAGGGCAG CGGCACGGTC GGCCGCTACA ACCGCGCGAC GCTGCAGCGC
GATCTCACGA TCCCGGTCAG CGTGTGGAAC GCGGTCAGCC GAGCCGACGT GATCACCGGC
CTCGCGACCG CGGGCACGCT GCTGTACGCG AGCGACTACT TCGGCAACCG CGTGCGCGTG
TTCACCACCG ACGGCGTCTG GCAGCGCGAC ATCGGCATCG CGAACCCCGG CGCGCTCGCG
CTCGACGATG CCGGCAATCT GTGGGTCGCG CAGAAGAATG CGGCGAAGAT CGTCGAATTC
AGCCCCACCG GCGCGCTGAT GAACACGATC CAGATGGCGA GCGCGTCGCG GCCGGCTTCG
CTGTACTTCG ATGCGTCGAA GCGGCAACTG ATGATCGGCG ATCAAGGGCC CGACATGAAC
ATCAAGCTCT ACGCGATCGC GGGCATGCCG AAGCAGGTCG GCACGTTCGG CGTGCAGGGC
GGCTATCTCG ACACGACGAC GGGCATCAGG GGCCAAGTCG GCGACCGGCG CTTCACGCGC
GTGGTCGGCA TCGGCAAGGA TGCCGCCGGC ACGCTGTACG TGCTCAACAA TCCGTGGGGC
GGCGGCTGGG ACCTCGGGCG CAACGGCGCG ACCGACATTC ACGCGTACGA CGCGCTCGGC
AACGCGCTGT GGAAGCTGCA GGCGCTGAAC TTCGAGGCGA TCGCCGCGCC GGACCCGACG
ACCGACGGCG CGCTGTTCTA CAGCGGCATG AACGTCTATT CCGGCACCGC GGGCGGCGCC
TTCGTCGCGA ACACGGTCGA TCCGTTCACG TACCCGTCCG ATCCACGTCT CGACATGAAC
GATTATCAGC GCGGCCAGCA TTTCGGCCAG CTCGTGAGCG TCGGCGGCCA CAAGATTCTC
GTCGCGTCGG GGCAGAATCC GGGCAACTTC AATTTCTATC ACTTCAACGC GGCGAGCGGC
TACATCGCGA TTCCCGATGC GTCGCTGCCG GGCAAGGGGT TCAACACATC GCTGCAGGTG
ACGGCCGGCT TTTCGATCGA CAACAAGGGC GACGTGTGGG CGGGCCTCAA TGGAACGAAC
GCGATCTCGC ACTATCCGCT CGCGGGAATC GATGCGAGCG GCAAGCCATC ATGGGGCGCG
CCCACCTCGA TCCCGACGCC GGCGAGCGTC CAGCCGACCA CGCGCATTCT CTACCTGTCG
GACAGCGATA CGATGATCCT CGCGCAGGGC ATCGCGGGAA GCTGGGACTG GACCGCGATG
AACGGCCGGA TCGAGGTGTA TCACGGCTGG AGCGCGGGCA ACGTCACGCA GCCGAACCCG
GTGATCGCGC TCACGAGCGC CAATCCGAAA TCGATCGCGT CGGCCGGCAA CTATCTGTTC
GTCGGGTACG TGCATACGGT GCCGAACATC GACGTGTTCG ATCTCAACAC GGGCCAGCTC
GTCGCCACGC TGACCAATTC GAACACGGGC ATGATGGACG TGGGCAACGA CGTCGATTCG
ATGTACGGCC TGAGGGCGTA TCTGCGCTCG ACCGGCGAAT ACGTGATCAC GAAGGACAAC
TACAACGGAT CGAGCATCGT CGTCTATCGC TGGCGGCCGT GA
 
Protein sequence
MKRQSGEAAD RRDSETIRLN NPNAGRGMGD ARQTQRGGSD VATNQRGRRR FVRRTHLCHT 
RRLILSTGQI GRASLMVRKR IRAVFAAALV GLAAHAAHAQ YTTDWIANTY GTIASHVGNN
ARSMWVSPEG VIYTASFWDE NAGGVAIYQN GKTLGSIGTH AEFQGGAITG NATSIFAAMQ
YSTPQGSGTV GRYNRATLQR DLTIPVSVWN AVSRADVITG LATAGTLLYA SDYFGNRVRV
FTTDGVWQRD IGIANPGALA LDDAGNLWVA QKNAAKIVEF SPTGALMNTI QMASASRPAS
LYFDASKRQL MIGDQGPDMN IKLYAIAGMP KQVGTFGVQG GYLDTTTGIR GQVGDRRFTR
VVGIGKDAAG TLYVLNNPWG GGWDLGRNGA TDIHAYDALG NALWKLQALN FEAIAAPDPT
TDGALFYSGM NVYSGTAGGA FVANTVDPFT YPSDPRLDMN DYQRGQHFGQ LVSVGGHKIL
VASGQNPGNF NFYHFNAASG YIAIPDASLP GKGFNTSLQV TAGFSIDNKG DVWAGLNGTN
AISHYPLAGI DASGKPSWGA PTSIPTPASV QPTTRILYLS DSDTMILAQG IAGSWDWTAM
NGRIEVYHGW SAGNVTQPNP VIALTSANPK SIASAGNYLF VGYVHTVPNI DVFDLNTGQL
VATLTNSNTG MMDVGNDVDS MYGLRAYLRS TGEYVITKDN YNGSSIVVYR WRP