Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0995 |
Symbol | |
ID | 4887500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 964248 |
End bp | 966119 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640130934 |
Product | trehalase |
Protein accession | YP_001061993 |
Protein GI | 284159983 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.570001 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCAGC CCACGGTCAT TTGCGCGAAG TGTGGGAAGA ACCGCTTGCC TGAGCGCCGG TTTTCGTGCG AATCTGCCGC CGCGGCATAC GACAGACAAA CACGCCGCCA CGCGTCGGGC CCGCGCGATC AGGAGACCGT ATTGCCGTTC ATCTCGAAAA CGGGAGGAGG AGATATGGTC ACGCCGCGGC ATCGCCCGCT TCATGTTGAA AATCCGTTCT ATCGTCGCTT GCTGAACGCG CCCGCGTGGG TCGCGCTCGT CGCCGCGGCC GGCATCGGCT GCACGAGCGC GACGCTCGCC CGCGCCGATT CGTCGCCGGC GCACGCGTCC ACGGCCGCGG CCGTGGCGAG CGCCGCATCC GCGCCGGGCG CCGCGTCGAT TCCGCCGCCG CCGAGCCAGC TCTACGGCGA TCTCTTTGTT GCGGTGCAAA CCGCGCAGAT CTTCGCGGAT CAGAAGACCT TCGTCGATTC GACGCCGAAC GCCGATCCCG CGACGATCGT CCAGCTGTAC CAGCAGCAGA AGGGCCAGCC GGGCTTCTCG CTGAAGGCGT TCGTCGCGCA ATACTTCACG CCGCCGTCGG ACGAATCGGT GACGCCGCCG CCGAACCAGA CGCTGCGCGA GCACATCGAC TGGCTATGGC CGAAGCTCAC GCGCACCACG ACGACGGCGC CGCCGTACAG CTCGCTGATT GCGCTGCCCA AACCGTACGT GGTGCCGGGC GGGCGCTTCC GCGAAGGCTA CTACTGGGAC ACGTACTTCA CGATGCTCGG CTTGCAGGAG GCGGGCCGGG AAGATCTCGT CGACAACATG CTCGACAACT TCGCGTACCT GATCGACACC GTCGGCCACG TGCCGAACGG CAACCGCAGC TATTACGTGA GCCGCTCGCA GCCGCCGTTC TTCGCGTACA TGGTGACGCT CGCGGCGAAG GCCGAGGGCA ATCGCGTCTA TCAGAAGTAC CTGCCCGCGC TGCGCAAGGA GTACGCGTAC TGGATGCAGG GCGAGCGCAC CACGCCGCGC GGCCAGGCGA CGCGCAACGT CGTCGCGATG CCGGACGGCT CGGTGCTGAA CCGCTACTGG GACGCGAGCG ACACGCCGCG CGACGAGTCG TATCTCGAAG ACGTGAAGAC CGCGCAGCAG GCGAGCGGCC GGCCGGCGGC GGAAGTCTGG CGCGATCTGC GCGCGGCGGC GGAGAGCGGC TGGGATTTCA GCTCGCGCTG GTTCGGCGAC AACCGCACGC TCGCGACGAT CCGCACGACC GCGATCGTGC CCGTCGACTT GAACAGCCTG ATGTTCAACC TCGAGACGAC GATCGTGAAG GGCTGCGCGG TGACGCGCGA TTTCGCGTGC GTCGCCGAGT TCGCCGGCCG CGCGGGCAAG CGCGCGGTTG CGATCAACCG CTATCTGTGG AACCGCAACG GCTATTACGG CGACTACGAC TGGAAACTCG GCAAGCCGCG CGACAACCTG TCGGCGGCGG CGTTGTATCC GCTGTTCGCG GGCGTCGCGT GGCCGGAGCG CGCGAAGCAG ACCGCGAAGA ACGTGCAGAA AGCGCTGCTC AAGCCGGGCG GGCTCGCGAC GACGACTTAC GACACCGCGC AGCAGTGGGA CGCACCTAAC GGCTGGGCGC CGCTGCACTG GATCGCGCTC GTCGGGCTGC GGCACTATGG CGAGAAGTCG CTCGCGGACG ATATCGGCAC GCGTTTTCTT GCCGACGTGA AGGGCGTGTA CGCGGCGCAG GGCAAGCTCG TCGAGAAGTA CATCGTCGAA GGCGTGGGCA CGGGCGGCGG CGGCGGCGAG TATCCGCTGC AGGACGGCTT CGGCTGGACC AACGGCGTGA CGCTCAAGCT GCTCGATCTG TACGGCGGCT GA
|
Protein sequence | MPQPTVICAK CGKNRLPERR FSCESAAAAY DRQTRRHASG PRDQETVLPF ISKTGGGDMV TPRHRPLHVE NPFYRRLLNA PAWVALVAAA GIGCTSATLA RADSSPAHAS TAAAVASAAS APGAASIPPP PSQLYGDLFV AVQTAQIFAD QKTFVDSTPN ADPATIVQLY QQQKGQPGFS LKAFVAQYFT PPSDESVTPP PNQTLREHID WLWPKLTRTT TTAPPYSSLI ALPKPYVVPG GRFREGYYWD TYFTMLGLQE AGREDLVDNM LDNFAYLIDT VGHVPNGNRS YYVSRSQPPF FAYMVTLAAK AEGNRVYQKY LPALRKEYAY WMQGERTTPR GQATRNVVAM PDGSVLNRYW DASDTPRDES YLEDVKTAQQ ASGRPAAEVW RDLRAAAESG WDFSSRWFGD NRTLATIRTT AIVPVDLNSL MFNLETTIVK GCAVTRDFAC VAEFAGRAGK RAVAINRYLW NRNGYYGDYD WKLGKPRDNL SAAALYPLFA GVAWPERAKQ TAKNVQKALL KPGGLATTTY DTAQQWDAPN GWAPLHWIAL VGLRHYGEKS LADDIGTRFL ADVKGVYAAQ GKLVEKYIVE GVGTGGGGGE YPLQDGFGWT NGVTLKLLDL YGG
|
| |