Gene BURPS1106A_3756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3756 
Symbol 
ID4899530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3667946 
End bp3669178 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content67% 
IMG OID640136982 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_001067986 
Protein GI126452532 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGAAA TACCCGGCGG CGGCATCCCG CGCGAAACCC GCGACGCGCG CGTCTCGAAT 
GCGGGCGACG GGCAGGCGAT TCCCGTCGCC GCGCCGACCA CCGCAGCGCT CGAAGCGCAT
CTCGCGCCGT ACGCGGCGCA CGCGTCGCGC TCGCGCGGGC GGCGCCATCC GGAGCCGCCG
CCCGCGGCGC GCACCGAATT CCAGCGCGAT CGCGACCGCA TCGTGCACTC CACCGCATTC
AGGCGCCTCG AATACAAGAC GCAGGTCTTC GTGAATCATG AAGGCGACCT GTTCCGCACG
CGTCTCACGC ACAGCCTCGA GGTCGCGCAG ATCGCCCGGT CCGTCGCGCG CAACCTGCGC
CTGAACGAAG ACCTCGTCGA GGCGATCTCG CTCGCGCACG ACCTCGGCCA TACGCCGTTC
GGCCACGCCG GGCAGGACGC GCTCAACGCG TGCATGCGCG ACTACGGCGG CTTCGAGCAC
AATCTGCAGA GCCTCGCCGT CGTCGACGAG CTCGAAGAGC ATTACGGCGC GTTCAATGGG
CTGAACCTGT GCTTCGAGAC GCGCGAAGGC ATCCTCAAGC ACTGCTCGCG CGAGAACGCG
CGCAAGCTCG GCGAGCTCGG CGAGCGATTC CTGCAGAGCC GCCAGCCGTC ACTCGAAGCG
CAGCTCGCGA ACATCGCGGA CGAAATCGCG TACAACAATC ACGACGTCGA CGACGGCCTG
CGCTCGGGCC TCATCACGAT CGAGCAACTC GCCGAGGTCG AGCTGTGGCA GTGCCATTAC
GAAGCGGCGC TCGCCGAATA TCCGCATCTC GAGGGCCGCC GTCTCGTGCA CGAGACGGTG
CGCCGGATCA TCAACACGCT GATCGTCGAT CTGATCGACG CGACGACGCG CAATCTCGCG
CGCCACGGGC CGACCTCGCT CGACGACGTG CGCGCGGCGC CGCACCTCGT CGCGCACGGC
GAGCCGATCG CCACGCAGGC GGCGGCGCTC AAGCGTTTCC TGTACAAGAA CCTGTATCGC
CACTACCGCG TGATGCGCAT GGCGAGCAAG GCGCAGCGGG TCGTCACCGG CCTCTTCAAC
GCGTTCACGG GCGACCCGCG CCTCTTGCCG CCCGACTATC AGGCGGCCGA CGCCGCGCAT
CAGCCGCGGC TCGTCGCGCA TTACATCGCC GGCATGACCG ATCGTTTCGC ACTGAAAGAG
TATCAACGCT TGTTTGTCAT GGACGAAAAC TAA
 
Protein sequence
MSEIPGGGIP RETRDARVSN AGDGQAIPVA APTTAALEAH LAPYAAHASR SRGRRHPEPP 
PAARTEFQRD RDRIVHSTAF RRLEYKTQVF VNHEGDLFRT RLTHSLEVAQ IARSVARNLR
LNEDLVEAIS LAHDLGHTPF GHAGQDALNA CMRDYGGFEH NLQSLAVVDE LEEHYGAFNG
LNLCFETREG ILKHCSRENA RKLGELGERF LQSRQPSLEA QLANIADEIA YNNHDVDDGL
RSGLITIEQL AEVELWQCHY EAALAEYPHL EGRRLVHETV RRIINTLIVD LIDATTRNLA
RHGPTSLDDV RAAPHLVAHG EPIATQAAAL KRFLYKNLYR HYRVMRMASK AQRVVTGLFN
AFTGDPRLLP PDYQAADAAH QPRLVAHYIA GMTDRFALKE YQRLFVMDEN