Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0696 |
Symbol | |
ID | 4882417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 673402 |
End bp | 674532 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640126624 |
Product | hypothetical protein |
Protein accession | YP_001057748 |
Protein GI | 126441267 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGCG ACGCCGCCGC CGAGCGCGCG CAACGCATCG GCCTCGTCCT TTCCGACATT TCCGATAGCC GAATCCGTCG GCTTGCCAAC CGTGCGAACC TTCTTCCTGC TCTTCGTCGC GCCGGGCTTG TCTGCCGGAC AACCGGCCTC CTGCACGCCC GCGGTGCGTA CCGCCTTCGG CGGGAGCGGC CGATGAGCGC CGTCGCCTGC GCGCCGGCGC GCCCGTCGCT CGATCTGCGC GCGATCGGCG TGATGATCCT GCTGTGCGCG ATCTGGGGCT TTCAACAAGT CGCGATCAAG AGCGCGACGC ATGCGATTCC GCCGATGCTC CAGGCCGGGC TGCGCTCGGC GATCGCGGCG GTCGGCGTGT GGGCGTGGGC GCGCGCGCGC GGCACGCCGA TCTTCCGCAC GGACGGCACG TTAGGCGCCG GCATCGTCGC CGGCACGCTG TTCGCGGGCG AGTTCGTCTG TCTGTTCTTC GGCCTCACGC TGACGAGCGC CGCGCGCATG GCGATCTTCC TGTACACCGC GCCGTGCTTC ACCGCGCTCG GCCTTCACCT GTTCGCGCCG GGCGAGACAA TGCGCCGCCA GCAATGGGCG GGCGTCGCGA TCGCGTTCGC GGGCATCGCG GTCGCGTTCG CCGACGGTTT CGCGCGACCG GCCGCCGGCG GCGCATCGGC GCTCGCCGGA CTCGCGGGCG ACGCGCTCGG CGTGCTCGGC GGCGTGATGT GGGCGGCGAC GACGGTCGTC GTGCGTTCGA CGTCGCTCGC GCACGCGAGT GCGAGCAAGA CGCTGTTCTA TCAGTTGACC GTGTCGTCGG CGGTATTGCT CGGCCTCGCG GTCGTCACCC GCCAGACGAC GTTCGCGAAC GTGACCCCGC TCGCCGTCGC AAGCCTGGCC TATCAGGGCG TGATCGTCGC GTTCGCGAGC TATCTCGCGT GGTATTGGCT GCTCACGCGC TACAGCGCGG CGCGGCTCTC GGTGTTCACG TTTCTCGCGC CGCTCTTCGG CGTGAGCTTC GGCGTGCTGC TGCTCGGCGA TGCGATCGGC CCGCGCTTCG TCGCGGCGGC CGCGCTCGTG CTCGCGGGCA TCGCGCTCGT CAACGCGCCG CCGCGCGGCG CTCGCAATTA G
|
Protein sequence | MRRDAAAERA QRIGLVLSDI SDSRIRRLAN RANLLPALRR AGLVCRTTGL LHARGAYRLR RERPMSAVAC APARPSLDLR AIGVMILLCA IWGFQQVAIK SATHAIPPML QAGLRSAIAA VGVWAWARAR GTPIFRTDGT LGAGIVAGTL FAGEFVCLFF GLTLTSAARM AIFLYTAPCF TALGLHLFAP GETMRRQQWA GVAIAFAGIA VAFADGFARP AAGGASALAG LAGDALGVLG GVMWAATTVV VRSTSLAHAS ASKTLFYQLT VSSAVLLGLA VVTRQTTFAN VTPLAVASLA YQGVIVAFAS YLAWYWLLTR YSAARLSVFT FLAPLFGVSF GVLLLGDAIG PRFVAAAALV LAGIALVNAP PRGARN
|
| |