Gene BURPS668_3698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3698 
Symbol 
ID4882790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3620753 
End bp3621985 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content68% 
IMG OID640129626 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_001060702 
Protein GI126439862 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.150104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGAAA TACCCGGCGG CGGCATCCCG CGCGAAACCC GCGACGCGCG CGTCTCGAGT 
GCGGGCGACG GGCAGGCGAT TCCCGTCGCC GCGCCGACCA CCGCAGCGCT CGAAGCGCAT
CTCGCGCCGT ACGCGGCGCA CGCGTCGCGC TCGCGCGGGC GGCGCCATCC GGAGCCGCCG
CCCGCGGCGC GCACCGAATT CCAGCGCGAT CGCGACCGCA TCGTGCACTC CACCGCATTC
AGGCGCCTCG AATACAAGAC GCAAGTCTTC GTGAATCATG AAGGCGACCT GTTCCGCACG
CGTCTCACGC ACAGCCTCGA GGTCGCGCAG ATCGCCCGGT CCGTCGCGCG CAACCTGCGC
CTGAACGAAG ACCTCGTCGA GGCGATCTCG CTCGCGCACG ACCTCGGCCA TACGCCGTTC
GGCCACGCCG GGCAGGACGC GCTCAACGCG TGCATGCGCG ACTACGGCGG CTTCGAGCAC
AATCTGCAGA GCCTCGCCGT CGTCGACGAG CTCGAAGAGC ATTACGGCGC GTTCAATGGG
CTGAACCTGT GCTTCGAGAC GCGCGAAGGC ATCCTCAAGC ACTGCTCGCG CGAGAACGCG
CGCAAGCTCG GCGAGCTCGG CGAGCGATTC CTGCAGGGCC GCCAGCCGTC GCTCGAAGCG
CAGCTCGCGA ACATCGCGGA CGAAATCGCG TACAACAATC ACGACGTCGA CGACGGCCTG
CGCTCGGGCC TCATCACGAT CGAGCAGCTC GCCGAGGTCG AGCTGTGGCA GCGCCATTAC
GAAGCGGCGC TCGCCGAATA TCCGCATCTC GAGGGCCGCC GGCTCGTGCA CGAGACGGTG
CGCCGGATCA TCAACACGCT GATCGTCGAT CTGATCGACG CGACGACGCG CAATCTCGCG
CGCCACGGGC CGACCTCGCT CGACGACGTG CGCGCGGCGC CGCCCCTCGT CGCGCACGGC
GAGCCGATCG CCACGCAGGC GGCGGCGCTC AAGCGTTTCC TGTACAAGAA CCTGTATCGC
CACTACCGCG TGATGCGCAT GGCGAGCAAG GCGCAGCGGG TCGTCACCGG CCTCTTCAAC
GCGTTCACGG GCGACCCGCG CCTCTTGCCG CCCGACTATC AGGCGGCCGA CGCCGCGCAT
CAGCCGCGGC TCGTCGCGCA TTACATCGCC GGCATGACCG ATCGTTTCGC ACTGAAAGAG
TATCAACGCT TGTTTGTCAT GGACGAAAAC TAA
 
Protein sequence
MSEIPGGGIP RETRDARVSS AGDGQAIPVA APTTAALEAH LAPYAAHASR SRGRRHPEPP 
PAARTEFQRD RDRIVHSTAF RRLEYKTQVF VNHEGDLFRT RLTHSLEVAQ IARSVARNLR
LNEDLVEAIS LAHDLGHTPF GHAGQDALNA CMRDYGGFEH NLQSLAVVDE LEEHYGAFNG
LNLCFETREG ILKHCSRENA RKLGELGERF LQGRQPSLEA QLANIADEIA YNNHDVDDGL
RSGLITIEQL AEVELWQRHY EAALAEYPHL EGRRLVHETV RRIINTLIVD LIDATTRNLA
RHGPTSLDDV RAAPPLVAHG EPIATQAAAL KRFLYKNLYR HYRVMRMASK AQRVVTGLFN
AFTGDPRLLP PDYQAADAAH QPRLVAHYIA GMTDRFALKE YQRLFVMDEN