Gene BURPS668_A0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0995 
Symbol 
ID4887500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp964248 
End bp966119 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content68% 
IMG OID640130934 
Producttrehalase 
Protein accessionYP_001061993 
Protein GI284159983 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.570001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCAGC CCACGGTCAT TTGCGCGAAG TGTGGGAAGA ACCGCTTGCC TGAGCGCCGG 
TTTTCGTGCG AATCTGCCGC CGCGGCATAC GACAGACAAA CACGCCGCCA CGCGTCGGGC
CCGCGCGATC AGGAGACCGT ATTGCCGTTC ATCTCGAAAA CGGGAGGAGG AGATATGGTC
ACGCCGCGGC ATCGCCCGCT TCATGTTGAA AATCCGTTCT ATCGTCGCTT GCTGAACGCG
CCCGCGTGGG TCGCGCTCGT CGCCGCGGCC GGCATCGGCT GCACGAGCGC GACGCTCGCC
CGCGCCGATT CGTCGCCGGC GCACGCGTCC ACGGCCGCGG CCGTGGCGAG CGCCGCATCC
GCGCCGGGCG CCGCGTCGAT TCCGCCGCCG CCGAGCCAGC TCTACGGCGA TCTCTTTGTT
GCGGTGCAAA CCGCGCAGAT CTTCGCGGAT CAGAAGACCT TCGTCGATTC GACGCCGAAC
GCCGATCCCG CGACGATCGT CCAGCTGTAC CAGCAGCAGA AGGGCCAGCC GGGCTTCTCG
CTGAAGGCGT TCGTCGCGCA ATACTTCACG CCGCCGTCGG ACGAATCGGT GACGCCGCCG
CCGAACCAGA CGCTGCGCGA GCACATCGAC TGGCTATGGC CGAAGCTCAC GCGCACCACG
ACGACGGCGC CGCCGTACAG CTCGCTGATT GCGCTGCCCA AACCGTACGT GGTGCCGGGC
GGGCGCTTCC GCGAAGGCTA CTACTGGGAC ACGTACTTCA CGATGCTCGG CTTGCAGGAG
GCGGGCCGGG AAGATCTCGT CGACAACATG CTCGACAACT TCGCGTACCT GATCGACACC
GTCGGCCACG TGCCGAACGG CAACCGCAGC TATTACGTGA GCCGCTCGCA GCCGCCGTTC
TTCGCGTACA TGGTGACGCT CGCGGCGAAG GCCGAGGGCA ATCGCGTCTA TCAGAAGTAC
CTGCCCGCGC TGCGCAAGGA GTACGCGTAC TGGATGCAGG GCGAGCGCAC CACGCCGCGC
GGCCAGGCGA CGCGCAACGT CGTCGCGATG CCGGACGGCT CGGTGCTGAA CCGCTACTGG
GACGCGAGCG ACACGCCGCG CGACGAGTCG TATCTCGAAG ACGTGAAGAC CGCGCAGCAG
GCGAGCGGCC GGCCGGCGGC GGAAGTCTGG CGCGATCTGC GCGCGGCGGC GGAGAGCGGC
TGGGATTTCA GCTCGCGCTG GTTCGGCGAC AACCGCACGC TCGCGACGAT CCGCACGACC
GCGATCGTGC CCGTCGACTT GAACAGCCTG ATGTTCAACC TCGAGACGAC GATCGTGAAG
GGCTGCGCGG TGACGCGCGA TTTCGCGTGC GTCGCCGAGT TCGCCGGCCG CGCGGGCAAG
CGCGCGGTTG CGATCAACCG CTATCTGTGG AACCGCAACG GCTATTACGG CGACTACGAC
TGGAAACTCG GCAAGCCGCG CGACAACCTG TCGGCGGCGG CGTTGTATCC GCTGTTCGCG
GGCGTCGCGT GGCCGGAGCG CGCGAAGCAG ACCGCGAAGA ACGTGCAGAA AGCGCTGCTC
AAGCCGGGCG GGCTCGCGAC GACGACTTAC GACACCGCGC AGCAGTGGGA CGCACCTAAC
GGCTGGGCGC CGCTGCACTG GATCGCGCTC GTCGGGCTGC GGCACTATGG CGAGAAGTCG
CTCGCGGACG ATATCGGCAC GCGTTTTCTT GCCGACGTGA AGGGCGTGTA CGCGGCGCAG
GGCAAGCTCG TCGAGAAGTA CATCGTCGAA GGCGTGGGCA CGGGCGGCGG CGGCGGCGAG
TATCCGCTGC AGGACGGCTT CGGCTGGACC AACGGCGTGA CGCTCAAGCT GCTCGATCTG
TACGGCGGCT GA
 
Protein sequence
MPQPTVICAK CGKNRLPERR FSCESAAAAY DRQTRRHASG PRDQETVLPF ISKTGGGDMV 
TPRHRPLHVE NPFYRRLLNA PAWVALVAAA GIGCTSATLA RADSSPAHAS TAAAVASAAS
APGAASIPPP PSQLYGDLFV AVQTAQIFAD QKTFVDSTPN ADPATIVQLY QQQKGQPGFS
LKAFVAQYFT PPSDESVTPP PNQTLREHID WLWPKLTRTT TTAPPYSSLI ALPKPYVVPG
GRFREGYYWD TYFTMLGLQE AGREDLVDNM LDNFAYLIDT VGHVPNGNRS YYVSRSQPPF
FAYMVTLAAK AEGNRVYQKY LPALRKEYAY WMQGERTTPR GQATRNVVAM PDGSVLNRYW
DASDTPRDES YLEDVKTAQQ ASGRPAAEVW RDLRAAAESG WDFSSRWFGD NRTLATIRTT
AIVPVDLNSL MFNLETTIVK GCAVTRDFAC VAEFAGRAGK RAVAINRYLW NRNGYYGDYD
WKLGKPRDNL SAAALYPLFA GVAWPERAKQ TAKNVQKALL KPGGLATTTY DTAQQWDAPN
GWAPLHWIAL VGLRHYGEKS LADDIGTRFL ADVKGVYAAQ GKLVEKYIVE GVGTGGGGGE
YPLQDGFGWT NGVTLKLLDL YGG