Gene BMASAVP1_A3208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A3208 
Symbol 
ID4681796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp3172786 
End bp3174018 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content68% 
IMG OID639847464 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_994491 
Protein GI121601283 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00852958 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGAAA TACCCGGCGG CGGCATCCCG CGCGAAACCC GCGACGCGCG CGTCTCGAAT 
GCGGGCGACG GGCAGGCGAT TCCCGTCGCC GCGCCGACCA CCGCAGCGCT CGAAGCGCAT
CTCGCGCCGT ACGCGGCGCA CGCGTCGCGC TCGCGCGGGC GGCGCCATCC GGAGCCGCCG
CCCGCGGCGC GCACCGAATT CCAGCGCGAT CGCGACCGCA TCGTGCACTC CACCGCATTC
AGGCGCCTCG AATACAAGAC GCAAGTCTTC GTGAATCATG AAGGCGACCT GTTCCGCACG
CGTCTCACGC ACAGCCTCGA GGTCGCGCAG ATCGCCCGGT CCGTCGCGCG CAACCTGCGC
CTGAACGAAG ACCTCGTCGA GGCGATCTCG CTCGCGCACG ACCTCGGCCA TACGCCGTTC
GGCCACGCCG GGCAGGACGC GCTCAACGCG TGCATGCGCG ACTACGGCGG CTTCGAGCAC
AATCTGCAGA GCCTCGCCGT CGTCGACGAG CTCGAAGAGC ATTACGGCGC GTTCAATGGG
CTGAACCTGT GCTTCGAGAC GCGCGAAGGC ATCCTCAAGC ACTGCTCGCG CGAGAACGCG
CGCAAGCTCG GCGAGCTCGG CGAGCGATTC CTGCAAGGCC GCCAGCCGTC GCTCGAAGCG
CAGCTCACGA ACATCGCGGA CGAAATCGCG TACAACAATC ACGACGTCGA CGACGGCCTG
CGCTCGGGCC TCATCACGAT CGAGCAGCTC GCCGAGGTCG AGCTGTGGCA GCGCCATTAC
GAAGCGGCGC TCGCCGAGTA TCCGCATCTC GAGGGCCGCC GGCTCGTGCA CGAGACGGTG
CGCCGGATCA TCAACACGCT GATCGTCGAT CTGATCGACG CGACGACGCG CAATCTCGCG
CGCCACGGGC CGACCTCGCT CGACGACGTG CGCGCGGCGC CGCCCCTCGT CGCGCACGGC
GAGCCGATCG CCACGCAGGC GGCGGCGCTC AAGCGTTTCC TGTACAAGAA CCTGTATCGC
CACTACCGCG TGATGCGCAT GGCGAGCAAG GCGCAGCGGG TCGTCACCGG CCTCTTCAAC
GCGTTCACGG GCGACCCGCG CCTCTTGCCG CCCGACTATC AGGCGGCCGA CGCCGCGCAT
CAGCCGCGGC TCGTCGCGCA TTACATCGCC GGCATGACCG ATCGTTTCGC ACTGAAAGAG
TATCAACGCT TGTTTGTCAT GGACGAAAAC TAA
 
Protein sequence
MSEIPGGGIP RETRDARVSN AGDGQAIPVA APTTAALEAH LAPYAAHASR SRGRRHPEPP 
PAARTEFQRD RDRIVHSTAF RRLEYKTQVF VNHEGDLFRT RLTHSLEVAQ IARSVARNLR
LNEDLVEAIS LAHDLGHTPF GHAGQDALNA CMRDYGGFEH NLQSLAVVDE LEEHYGAFNG
LNLCFETREG ILKHCSRENA RKLGELGERF LQGRQPSLEA QLTNIADEIA YNNHDVDDGL
RSGLITIEQL AEVELWQRHY EAALAEYPHL EGRRLVHETV RRIINTLIVD LIDATTRNLA
RHGPTSLDDV RAAPPLVAHG EPIATQAAAL KRFLYKNLYR HYRVMRMASK AQRVVTGLFN
AFTGDPRLLP PDYQAADAAH QPRLVAHYIA GMTDRFALKE YQRLFVMDEN