Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2770 |
Symbol | |
ID | 4884276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 2736908 |
End bp | 2738116 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640128698 |
Product | hypothetical protein |
Protein accession | YP_001059791 |
Protein GI | 126442200 |
COG category | [S] Function unknown |
COG ID | [COG4394] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.593896 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTCGT CCGCCCCGCT TCCCCCGCCC GCCGACACGG CGTCGCCCCT GCAAGCGGCA AGCCCGGTCG CGTGCGACAT CTTCTGCGCG GTCGTCGACA ACTTCGGCGA CATCGGCGTG TGCTGGCGTC TCGCGCGCCA GCTCGCGCTC GAGCACGGCT GGCAGGTGCG GATCTTCGTC GACGCGCTCG CGACGTTCGC GCGCCTGCAG CCGGCCGCGT TGCCCGACGC CGCGCGGCAG ACCGTCGACG GCATCGTCGT CGAGCACTGG CGCGCGCCCG CGCACGCGGG CGACACGCTC GAGATCGCCG ACATCGTGAT CGAGGCGTTC GCCTGCGAGC TGCCGGGCGC GTATGTCGCC GCGATGGCGC GCCGCGCGCG GCCGCCCGTC TGGATCAACC TCGAATACCT GAGCGCCGAG GACTGGGTCG GCGAATTCCA TCTGCGCCCG TCGCCGCATC CGCGCTATCC GCTCACGAAG ACGTTCTTCT TCCCTGGCCT CGGGCCCGGC ACGGGCGGCG TGCTGAAGGA GCGCGATCTC GACGCGCGCC GCGCCGCGTT CGAAACCGGC GACGATGCGC GCCGCACGTG GTGGCAAAAC GTCGCGGGCG CGCCGATGCC TGCTCCGGAC ACCACCGTCG TGTCGCTCTT CGCGTACGAG AATCCTGCGC TCGACGCGCT GCTCGAACAG TGGCGCGACG GCCGCGAGCC GGTCGCGCTG CTCGTGCCCG AAGGCAGGAT CTCGGCGCGC GTCGCGCGCT TCTTCGGGGC CGGCGCGTTC GGCGCCGGCG CGCACGCGGC GCGCGGCAGC CTCGTCGCAC ACGGTCTCGC CTTCGTCGCG CAGCCCGACT ACGACCGGCT GCTGTGGGCG AGCGACGTGA ACTTCGTGCG CGGCGAGGAT TCGTTCGTCC GCGCGCAATG GGCGCGCCGG CCGTTCGTCT GGCAGATCTA TCCGCAGGCC GACGACGCGC ATCTGCCGAA GCTCGACGCG GCGCTCGCGC ACGTCACCGC ACGCGTCGAT CACGCGACGC GCGCGGCGAC CGAGCGCTTC TGGCACGCCT GGAACGGCGC GGGCACGCCC GATTGGACCG ATTTCTGGCG GCACCGCGCG GCGCTCGCCG CGCGCGCCGC GAGTTGGGCG GACGAGCTCG CGGCCGTCGG CGACCTCGCC GGAAATCTGG CGAATTTTGC AAAAACTCAG TTAAAATAA
|
Protein sequence | MTSSAPLPPP ADTASPLQAA SPVACDIFCA VVDNFGDIGV CWRLARQLAL EHGWQVRIFV DALATFARLQ PAALPDAARQ TVDGIVVEHW RAPAHAGDTL EIADIVIEAF ACELPGAYVA AMARRARPPV WINLEYLSAE DWVGEFHLRP SPHPRYPLTK TFFFPGLGPG TGGVLKERDL DARRAAFETG DDARRTWWQN VAGAPMPAPD TTVVSLFAYE NPALDALLEQ WRDGREPVAL LVPEGRISAR VARFFGAGAF GAGAHAARGS LVAHGLAFVA QPDYDRLLWA SDVNFVRGED SFVRAQWARR PFVWQIYPQA DDAHLPKLDA ALAHVTARVD HATRAATERF WHAWNGAGTP DWTDFWRHRA ALAARAASWA DELAAVGDLA GNLANFAKTQ LK
|
| |