Gene BURPS668_A1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1721 
Symbol 
ID4887180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1670671 
End bp1671855 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content62% 
IMG OID640131659 
Productterpene synthase family protein 
Protein accessionYP_001062716 
Protein GI284159991 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.24708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTGC CGCGACGCCA CCGATGCCCT TTTGAGAATA CCCCCGCAGA TGCCACGCAA 
GAGGAACATT CCGTGACCGC CCTCGTGAAC CAGTCCGTCG CGCCGATACT CTGTCCGTTT
CCATTACGGG AAATCCGGCC GGCCGACGCC CACATCGCCC GAACATCGGA ATGGTTGATT
CGATCGGGTC TCGTCGGCAG CGATGCCCAC GCGCGTGCGC TAGCCGGCGT GGGCGCACAT
TATGCAATGT GCTGCTATCC GGATATCGCC GCCGATCGGA TACCGGATCT GGCCGATTTC
GCCGCATGGA ATTGCCTGTT GGACGACTTC GCCGAGAACG GGCCGTTGAG CGGCGAGCTC
GCCGCACTCA CGCATTTCCT GAAGTCCGTC GAATACATTT GCGGCGCGTC GAACTACCGC
TGCCCGTCCG ATTTCGGATT CGATCACGGC TATCGCATCG CCGAGGCCCT TGTCGACGTG
AAGCGCCGAA TTTCCGCATG GGCCTCCTTC GCGCAAATCC GCAACCTGAT GAGCGCCACC
GGCCATTTCA TGTCGGGCCT CGCGTGGGAA GCCGCCTATG CGAGCCTGCG CCAGGTGCCG
GACCTCAACA CGTATTGCGC GATCAGAACG GCGAACTCCG GCATGTACAT GGCGAACGCG
CTGGCGGAAT GCGCGAACGA CGTCGAGCTG ACGCCGGCGC AACGCGCGTG CCCCAGGACA
GAGGCGCTGA CGCAATGCAT ACTGTTCGTC CTCGTGATCG ACAACGATCT CTACTCGCAT
CACAAGGAAA AAAACGGCCG CGCCGCGTTT GCGAGCATGA TCGACGTCCT CATGCATTCA
CGCGGCAGCG AAGACGCGCA CGCCGCGCTA TTGGAAGCAC TCGATCTGCG AAATCAGTGC
CTGCGCTGCT ATCTGGCGTT GAAAGCGAAA TGCCGGCTGA CCGCCGGCGA TCGGCTCGAC
CTTTACTTCA AGGGACTCGA AGACGTCATC AGCGGAAACC TGGTGTTCGG CAGCACGTGC
GCTCGATACG CGGCACCGGG AAGCCCTCAG TTCCTCGGCA CGACGAACGC ACGGCACCTC
AGGCCCGACA GCGTTCAGAT CCCCGTCGTC GAAGCGCTCG ACGTCCCCGC CTCGCCTCCT
CGCCACATTC CGTCGATCAC GTGGTGGTGG ACACTCGCCG ACTGA
 
Protein sequence
MTLPRRHRCP FENTPADATQ EEHSVTALVN QSVAPILCPF PLREIRPADA HIARTSEWLI 
RSGLVGSDAH ARALAGVGAH YAMCCYPDIA ADRIPDLADF AAWNCLLDDF AENGPLSGEL
AALTHFLKSV EYICGASNYR CPSDFGFDHG YRIAEALVDV KRRISAWASF AQIRNLMSAT
GHFMSGLAWE AAYASLRQVP DLNTYCAIRT ANSGMYMANA LAECANDVEL TPAQRACPRT
EALTQCILFV LVIDNDLYSH HKEKNGRAAF ASMIDVLMHS RGSEDAHAAL LEALDLRNQC
LRCYLALKAK CRLTAGDRLD LYFKGLEDVI SGNLVFGSTC ARYAAPGSPQ FLGTTNARHL
RPDSVQIPVV EALDVPASPP RHIPSITWWW TLAD