Gene BURPS1710b_A1957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1957 
Symbol 
ID3692708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2390813 
End bp2393743 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content69% 
IMG OID637732211 
Producthypothetical protein 
Protein accessionYP_337108 
Protein GI76819107 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.52891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCGCAT GCGCCCGTCC TTTCGACGCC GCCGCGCACG TCGAAGAAAC GATCGCCGCG 
CCGACGCATC TCGGCGACTG GCTCGCCGCC CATCAGTCGA CGCCCGCCGT CGGAACGCCG
CCGTCCGGCG CGCCTTCGCC GTACCTCGGC GGCCTGAGCT GGCGCTCGAA CCGCGAAGTC
GCCGCACAGC AGGCGAGCAA GCGCCGCCTG CTCGCCGGCA TCGACGCGCT GCCCGCGCTC
ACGCCGGCCG CGCAGGCGGC GCGGGCACGC CTCTCGGCGA TGATCGCCGC GCGTGCCGCA
ACCGGCCGCG TGATCGTCGC GCGAAGCGAT GCGCGCTGGC TGCAGGCCAA TCCCGCCCAC
GATCCCTATC TCGAAGCCGG CGACGTCGTG ACGATTCCCG ATCGCCCGTC GAGCGTCGCC
GTCGTGCGCG CGGACGGCTC GATCTGCACG GTCGCTCACG TGCAGGACGT CGAAGCATTG
CCGTACGTGC TCGCGTGCGC CCCCGACGCG GCGCCCGATC TCGCGTGGAT CGCGCAACCC
GACGGCACGG TCAGCGAAAG CAAGGTGGCG ATGTGGAATC GCGACGTGCA GGACGCGCCG
GCGCCCGGCA GTTGGATCTG GGCGCCCGAT CGGGGCAGCC GATGGCCGCC GGCCCTGTCG
CGCGCCCTGG CGGAATTCAT GGCGACGCAG GGCGTATCCG GGCTCGCGGA CGACGGCTCG
CCGCTGCCCG CGCCTCCCAT TGCGCCCGTC CACCAGACCG CGTTTCCGAG CGGCGCGCCC
GGCCGGTCCG CAGCGTTCCC GGTAACGGGC GGCGACTGGG GCACGGCGGG CATTCTGCAA
ACGCCGACGG CGCGAATGAA CGACGCCGGC GAAGCATCGC TCAGCATGAG CCACGTGAGC
CCGTACACGC GCCTGAACTT CACGCTGCAG CCGCTCGATT GGCTCGAAAT CGGGTTCCGC
TACACCGACG TCAGCAATCA GCCGTACGGC CCCGTCTCGC TGAGCGGCAC CCAGTCGTAC
AAGGACAAGA GCATCGACGC GAAGCTCAGG CTGTGGCGCG AATCCGCCTA TCTGCCCGAC
GTGGCCGTCG GCTTTCGCGA CATCGCCGGC TCGGGCCTGT TCTCCGGCGA GTACCTGGTG
GCCAGCAAGC GAACCGGGCC GTTCGACTGG AGCGTCGGCC TCGGCTGGGG TTACGTGGGC
GCGCGCGGCA ATCTGCGCAA CCCGCTGGCG GTGGTCAGCC GGCGGTTCGA CGATCGCACG
AACAGCGCGA CACCGAACGG CGGCGAGCTC GGCTACAGCT CATGGTTTCG CGGCCGCGTC
TCGCCGTTCG GCGGCGTGCA GTACCAGACG CCGCATGAGC GCCTCATCCT GAAAGCCGAA
TACGACGGCA ACGACTATCG GCACGAACCG TTCGGTCAAG TGCTGAAGGC ACGATCGCCA
TTCAACTTCG GCGCCGTCTA TCGCGCGACG CGCAACATCG ACTTGAGCCT CGGCTTCGAG
CGAGGCGCGC GCGTGATGTT CGGCGTCTCG CTGCACGGCA ATCTGAAGCG TGCGTCGATG
CCCAAGCTCG GCAATCCGCC GGCTCCGCCG GTGACGCAAC CGGCCGCGAA CGCCGGGCCG
CCCCCGCCGG CCGCCGATCC GGCATCGGGC GACGCGCAAG CGGCGACCGC GCAGGCATCG
CGCATCGGAC GCGCGTCGCC GTCGCCGTTC GATCGCGACT GGTCCGGCAC CGTCGCGCAA
TTGCAGGCGC AAACGCATTG GCACGTGCGC AGCATCCGTG CGCTCGGCAT GGATCTCGTC
GTCGAGTTCG ACGACGTCGA CGCGTTCTAC CTGCAGGACC CGCTCGAGCG CATCGCGACG
ATCCTGAACC GTGACGCGCC GCTCAACGTG CGCACGTTCC ATGTCGTCGC GCTCGTGCAC
GGCGTGCCGG TTGCCGACTA TCAGGTGCAG CGCACGCAGT GGTTCGCGAG CCGCACCCGC
GCCCTCACGC CGAGCGAGGC TGCGCCCGAC ACGGCGCTCG GCCGGCCGCT CACGCGACAG
TCGATCGACA TGCTGCCCTC TCTATTCGAG CAGCGGCCCA AGGCCTTCGT GGCGTCGGTC
GGGCCGGGCT ACCGGCAAAC CCTTGGCGGT CCGAACGGTT TCCTGCTCTA CCAGATCTCC
GCCGATGCAT ACGGCGAGCT GAGACTGCCC GGCGGCGCAT GGCTCGGCGG CGAACTGAAC
GTGGGGCTCG TCGACAACTA CGGCAAGTTC ACCTACACGG CGGACAGCAA GCTGCCGCGC
GTGCGCACGT ATCTGCGCGA GTACCTGACG ACGTCGCGCG TCACGCTGCC GCTGCTGCAA
CTGACGAAGA TGGGACGCCT CGGCAACGAT CAGTTCTACA GCGTATACGG CGGGCTGCTC
GAAAGCATGT TCGCGGGCGT CGGGGCCGAA TGGCTGTATC GCCCGGCGGA TAGCCGCCTC
GCGATCGGCG TCGACGTGAA CGCGGTGCGG CAGCGCGGCT TCCGCCAGGA TTTCTCGATG
CGCGACTACC GGACGCTCAC CGGACACGTG ACGGCGTATT GGAACACCGG GTGGCAGGGC
ATCCAAATCA ATCTGAGCGT CGGCCAGTAT CTGGCGAAGG ACAAGGGCGC GACGCTCGAC
ATTTCGCGGC GCTTTCGCAA CGGCGTCGTG ATCGGCGCCT ATGCGACGAA GACGAACATA
TCGGCGGCCC AATTCGGCGA AGGCAGCTTC GACAAGGGCA TCTACCTGAC GATTCCGTTC
GACGCGATGA TGACGCGCTC GAGCGGCAGC GTGGCGAATC TGCGCTGGAA CCCCGTGACG
CGCGACGGCG GCGCGAAGCT GGATCGCAAA TATCCGCTGT ACGATCTCAC CGACATGGGC
GAGCGCCGCA GCTTGTGGTA CGCGCCGCCG GATGGCGCAT TGTCGCCGTG A
 
Protein sequence
MLACARPFDA AAHVEETIAA PTHLGDWLAA HQSTPAVGTP PSGAPSPYLG GLSWRSNREV 
AAQQASKRRL LAGIDALPAL TPAAQAARAR LSAMIAARAA TGRVIVARSD ARWLQANPAH
DPYLEAGDVV TIPDRPSSVA VVRADGSICT VAHVQDVEAL PYVLACAPDA APDLAWIAQP
DGTVSESKVA MWNRDVQDAP APGSWIWAPD RGSRWPPALS RALAEFMATQ GVSGLADDGS
PLPAPPIAPV HQTAFPSGAP GRSAAFPVTG GDWGTAGILQ TPTARMNDAG EASLSMSHVS
PYTRLNFTLQ PLDWLEIGFR YTDVSNQPYG PVSLSGTQSY KDKSIDAKLR LWRESAYLPD
VAVGFRDIAG SGLFSGEYLV ASKRTGPFDW SVGLGWGYVG ARGNLRNPLA VVSRRFDDRT
NSATPNGGEL GYSSWFRGRV SPFGGVQYQT PHERLILKAE YDGNDYRHEP FGQVLKARSP
FNFGAVYRAT RNIDLSLGFE RGARVMFGVS LHGNLKRASM PKLGNPPAPP VTQPAANAGP
PPPAADPASG DAQAATAQAS RIGRASPSPF DRDWSGTVAQ LQAQTHWHVR SIRALGMDLV
VEFDDVDAFY LQDPLERIAT ILNRDAPLNV RTFHVVALVH GVPVADYQVQ RTQWFASRTR
ALTPSEAAPD TALGRPLTRQ SIDMLPSLFE QRPKAFVASV GPGYRQTLGG PNGFLLYQIS
ADAYGELRLP GGAWLGGELN VGLVDNYGKF TYTADSKLPR VRTYLREYLT TSRVTLPLLQ
LTKMGRLGND QFYSVYGGLL ESMFAGVGAE WLYRPADSRL AIGVDVNAVR QRGFRQDFSM
RDYRTLTGHV TAYWNTGWQG IQINLSVGQY LAKDKGATLD ISRRFRNGVV IGAYATKTNI
SAAQFGEGSF DKGIYLTIPF DAMMTRSSGS VANLRWNPVT RDGGAKLDRK YPLYDLTDMG
ERRSLWYAPP DGALSP