Gene BURPS668_A0412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0412 
Symbol 
ID4888728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp380788 
End bp382449 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content68% 
IMG OID640130353 
Producttype I phosphodiesterase / nucleotide pyrophosphatase 
Protein accessionYP_001061418 
Protein GI126442340 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.490012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCA ACAGTAAGCA GTTCGAAATC GGCGCATGGC TGGGCGCGTG CGCGCTCGCC 
TTCGCCGGCG CGGCTTCGGC CGCCGCCGTC CAGGATCGCG ATCACGACAG CCGCCCCGTC
GACGCGAAGC GCGTGCTGCT GGTGAGCATT GATGGTCTGC ACGAGCAGGA CCTCACGCGC
TGCATCGGCG CGAATACGTG CCCGAATCTC GCGCTGCTCG CGAAATCGGG GGTCACGTAC
ACGAACGCGC GCACGCCGGG GCTGTCGGAT TCGTTCCCGG GCCTGGCCGC GCTGGTGACG
GGCGGTTCGC CGAAGAGCGC GGGCCTCTTC TACGACGTGT CGTACGATCG CACGCTGTAC
GCACCGTCGG ATGCGACGTG CTCGGGCAAG CAAGGCTGGA ACGTCGTGTT CGACGAAACA
ACCGGCATCG ACGCGATGAA CGGCGGCGCG CTCACGCATC TCGACGGCGG CGGCGCGTTC
AACCCGCAGG CGATCCCGCA CGCGCGCGTG AACGGCCAGT GCGTGAGCGT CTACCCGCAC
GACTACGTGA AGACGAACAC GGTGTTCGAA GTCGTCAAGG AACATCTGCG CGGCTCGCAC
ACCGCATGGG CGGACAAGCA CGCGTGGGGC TACGACTGGG TGAACGGCCC ATCGGGCAAG
GGCGTCGACG ATCTCGCGCG CACCGAGATC AACTCGATCG ATCCGGCCAC GGGCACCCCC
TATACCGACA TCTATACGCA TACCGAAAAG TTCGACGACT ATCACGTGCA GGCGATCGTC
AACCAGATCG ACGGCAAGAA CTCGACGGGC ACCGCGGCCG CGCCCGTGCC GACCCTGTTC
GGCACGAACT TCCAGACGCT GTCGGTCGCG CAGAAGGCCA CCGTCGCGTC GGGCGGCGGC
TATCTCGACG CGAGCTTCAC GCCGGGGCCG GAAGTCGCGA ACGCGATCGC GTACGTCGAC
GGCGCGCTCG GCCGCATCGT CGCCGAGCTC AGGCAGCGCG GGCTGTACGA TTCGACGGTG
GTGATCGTCA CCGCGAAGCA CGGCCAGTCG CCGACCGACC ATACGAAGCT CGTGAAGCAC
GGCGACACGC TCACCGCGCT GCTCGAGGCG AACGGCTTCG TCGATCCGAA CGGCAACTTC
GGCCAGAACA ACACCGCGTC GGGCAACCCG AACGACGGCA CGGGCCTCGT CGGCACGGGC
TTCGTGCAGA CCGACGACGT CGGCCTCGTC TGGCTGCGCG ACCCGCGCCA GTTGAGCGCG
GCCGTCGCGA CACTGAAGGC GAATCTCGGC TGCAACGCGC CGGGGATCTG CGCGGACGGC
CCGCAGGCGT ACATCCTGTA TGGCCCGAGC GTCGCCGAGC GCTTCGGCGA TCCGGCGCTC
GGCCGCACGC CGGACATCGT CGTGCAGCCG AACCCGGGCG TGATCTACAC GTCGAGCAAG
AAGAAGGACG AAGAGCACGG CGGCAACGCG CCGGACGACA GCCACCTCGG CCTGCTCGTG
TCATACGCGG GCTTGCGCCA GGGCCGCACA ATCGACGCGC CGGTGCTGAC GACGCAGGTC
GCGCCGACGA TCCTGCGCTC GCTCGGCCTC GAGCCGCGCC TGCTGCACGC GGTCGCGCTC
GAAGGCACGC GCGTGCTGCC GGGCCTTGGC CTCGAGCGCT GA
 
Protein sequence
MKRNSKQFEI GAWLGACALA FAGAASAAAV QDRDHDSRPV DAKRVLLVSI DGLHEQDLTR 
CIGANTCPNL ALLAKSGVTY TNARTPGLSD SFPGLAALVT GGSPKSAGLF YDVSYDRTLY
APSDATCSGK QGWNVVFDET TGIDAMNGGA LTHLDGGGAF NPQAIPHARV NGQCVSVYPH
DYVKTNTVFE VVKEHLRGSH TAWADKHAWG YDWVNGPSGK GVDDLARTEI NSIDPATGTP
YTDIYTHTEK FDDYHVQAIV NQIDGKNSTG TAAAPVPTLF GTNFQTLSVA QKATVASGGG
YLDASFTPGP EVANAIAYVD GALGRIVAEL RQRGLYDSTV VIVTAKHGQS PTDHTKLVKH
GDTLTALLEA NGFVDPNGNF GQNNTASGNP NDGTGLVGTG FVQTDDVGLV WLRDPRQLSA
AVATLKANLG CNAPGICADG PQAYILYGPS VAERFGDPAL GRTPDIVVQP NPGVIYTSSK
KKDEEHGGNA PDDSHLGLLV SYAGLRQGRT IDAPVLTTQV APTILRSLGL EPRLLHAVAL
EGTRVLPGLG LER