Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A1257 |
Symbol | |
ID | 3693824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | - |
Start bp | 1568977 |
End bp | 1570599 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637731511 |
Product | hypothetical protein |
Protein accession | YP_336414 |
Protein GI | 76817239 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACGCTCA CCCTCGCGAG CTGCGGCGGT GACGATCTGA CGCCCGCCGC GCAGCGCTGG GCGATGCCCG GCACCGAACT GCCGCTCGGG CCGCAGGGCC TCGCGCAGAG CGTGTCGACG CAGACGCTCG CCGCAGGCGT CGCCTATTAC CAGATCAAGC GCGGCGCGGC GAGCGCGGCC GATTTCTGGA CCGTCAACCT CGGCTTCTAC GCGACGCAAG CCGCGGCGCA GGCCGATGCG GCGAATCTCG CGGCGGCCGG CTTCGCGACG CGCGTCGACG CGTCGGCGGG CACCGACCTG CAGGGCAAGG TGCTCGGCTA CTGGCTGTCG GCCGGCCGCT ACGCGACGCA GGCCGAGGCG ACGGCGGCCG CCGCGCGCAT CGCGCAGGCC ACGCAGAACC GCTACAAGCC GGGCACGCGG CATACGTCGC TCGCCGGCGC GCCGACGACG GGGCCGTGGA TCGTCAACGT GCTCGCGATC GACCCGTCGC GCGCCGGCGC GGCGCTGTCG CTCGCGCTGC CGGGCGGCGA CGATCTCGGT GCGGGCGGCG AGACGGTTTC GGCCGCGCGG GCGCGTGTGA ACGCGCTCGC CGGCGTCAAC GGCGGCTTTT TCACGAACAT CAATCCGTTC GGCGCGCCGC TGCCGCCGCG CTCGCCCGTC GGCGCGACGG TAGTCGACGG GCGGCTCGTC GCGGCAGCGA TCGGCAGGCG CCCCGGCCTG CTGCTCGCGC GCGACGCGAA CGGCCGCCAA CGCGCGACGG TCGTGCGCAA TCTCGCGACG TCGATCACGC TGACCGACGC GCAAGGCCGT GCGATCGCGG TCCAGACGCT GAACCGGCCG ATCCTCGGCA CGGTCGTCAA TTGCGGCGCG CAGGCGCGCA CGCCGACGAG CGAGCCGGCG CAGGACACGG TGTGCACGAA CTACGATGAC CTCGTGATGT ACGACTCGCT ATATCTGCGC GGCGGTGCGT CGAACACGCT CGTCGACGCC AGCTACCAGG GCGCGCGATA CGAACTCGTG GTCGACGCGA ACGGCGCCGT CGTCGCCGGC CATGCGACGC TCGGCGCGCC GCCGCCGCCG AACGGCTACG TGCTGCAGGG GCTCGGCGCG AGCGCCGCGT GGCTGCAGGC GCATGCGACG CCGGGCACGC GCCTCGCGGT ATCGCGCCGG CTGTCGGCCG ACGGCGCGGA TCTCGCGCTC GCGTCGGGCA TGTCGCTCGT CGAGGCGGGG CCGACGCTGT CCGTGCCGAA TCTCGCGCAA AGCGCCGCGC AAGAGGGCTT CGCGCCGACG GTGGGCGGCG TCGACGCGGG CGAAGGCGCC GCGGCGAACG GCAACTGGTA CAACGGCTGG TATGTCGCGC GCAATGGGCG CACCGCGGCG GGCGTCGCGG CGGACGGCAC GATCCTGCTC GTCGAGATCG ACGGCCGGCA GCCCGCGTTG AGCGTCGGCA CGAGCATTCC GGAGACGGCG GCGGTGATGG CATGGCTCGG TGCGACGTCG GCCGTCAATC TCGACGGCGG CGGCTCGAGC AACATGGTGG TCGGCGGCAA GATGGTCGGA CATCCGTCCG ACGCCGTGGG CGAGCGGGGC GTCGGCGATA CGCTGATGCT GCTGCCGGGC TGA
|
Protein sequence | MTLTLASCGG DDLTPAAQRW AMPGTELPLG PQGLAQSVST QTLAAGVAYY QIKRGAASAA DFWTVNLGFY ATQAAAQADA ANLAAAGFAT RVDASAGTDL QGKVLGYWLS AGRYATQAEA TAAAARIAQA TQNRYKPGTR HTSLAGAPTT GPWIVNVLAI DPSRAGAALS LALPGGDDLG AGGETVSAAR ARVNALAGVN GGFFTNINPF GAPLPPRSPV GATVVDGRLV AAAIGRRPGL LLARDANGRQ RATVVRNLAT SITLTDAQGR AIAVQTLNRP ILGTVVNCGA QARTPTSEPA QDTVCTNYDD LVMYDSLYLR GGASNTLVDA SYQGARYELV VDANGAVVAG HATLGAPPPP NGYVLQGLGA SAAWLQAHAT PGTRLAVSRR LSADGADLAL ASGMSLVEAG PTLSVPNLAQ SAAQEGFAPT VGGVDAGEGA AANGNWYNGW YVARNGRTAA GVAADGTILL VEIDGRQPAL SVGTSIPETA AVMAWLGATS AVNLDGGGSS NMVVGGKMVG HPSDAVGERG VGDTLMLLPG
|
| |