Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2444 |
Symbol | |
ID | 4885528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2410198 |
End bp | 2411469 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640128373 |
Product | hypothetical protein |
Protein accession | YP_001059477 |
Protein GI | 126439009 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.145417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAGT TGACACTCGG ATTGATCGGC GCGGGCGCCG TCGTGGTGGG CGGCGTCGTG GTCTACAACG CGTGGCAGGG CGCGAAGGTG CGCCGCAGGA TGCCGCGCCC GATGCCGAGC GAGGCGGAGG CCGCCGCACG GCATGAGCGC GAAGACGACG CGCCTTTCAT CGAGCCGGTG CGCCAGCCGG CGCGCCGCGA CGCGGCGGCG GGCGCCGCGG CGGCGGGCGC GGCGGGCGGC GATGCGGTGC GCGTCGAGCC GACGTTCGGC GGCGGCGCGG CGCCCGCCGA TACGCCGGCC GATCTTCAGG CCGAGGCGAG CGTCGCGAAC GGCGCGCTCG AGCCGGCCGC CGAGGCCGCG CATGGCGGCG AAGCGGCCGT CGCGGCCGAT GCCGCGCGCG ACGAGCCGCT CGAGCCGGTG CTGCCCGCCG CGACGACGAT TTCCGCGGCG CCGCCGGCGG TCGTCGATCG CCGGATCGAC TGCATCGTGC CGATCCGCCT CGCGAGCCCG CTCGCGGGCG ACAAGATCCT GCCCGCCGCG CAGCGGCTGC GCCGCGCGGG CAGCAAGCCG GTGCACATCG AGGGCAAGCC CGACGGCGGC GATGCATGGG AGCTGCTGCA AAACGGCGTG CGCTACGAGG AGCTGCGCGC GGCCGCGCAG CTCGCGAACC GCAGCGGCCC GCTCAACGAG CTCGAGTTCT CCGAGTTCGT GACGGGCGTG CAGCAGTTCG CGGACGCGAT CGACGGCGCG CCGGAGTTCC CGGACATGAT GGAAACGGTA TCGATGGCGC GCGAGCTCGA CGGCTTCGCC GCCCAATGCG ACGCGCAACT GTCGATCAAC GTGATGTCGG ACGGCGCGCC GTGGTCGGCC AATTACGTGC AGGCGGTCGC GTCGCAGGAC GGGCTGCTGC TGTCGCGCGA CGGCACGCGC TTCGTGAAGC TCGACGCGAA GCAGAACCCC GTCTTCATGC TGCAGTTCGG CGATACGAAC TTCCTGCGCG ACGATCTCAC GTACAAGGGC GGCAACCTGA TCACGCTCGT GCTCGACGTG CCGGTGGCCG ACGAGGACAT CCTGCCGTTC CGGCTGATGT GCGATTACGC GAAATCGCTG TCCGAGCGCA TCGGCGCACG CGTCGTCGAC GATCAGCGCC GGCCGCTGCC CGAATCGACG CTGCTCGCGA TCGAGCAGCA ACTGATGAAG CTGTACGCGC GGCTCGAGGA GGCCGGCATT CCGGCCGGCT CGCCCGTCAC GCGGCGGCTG TTCAGTCAGT AG
|
Protein sequence | MDELTLGLIG AGAVVVGGVV VYNAWQGAKV RRRMPRPMPS EAEAAARHER EDDAPFIEPV RQPARRDAAA GAAAAGAAGG DAVRVEPTFG GGAAPADTPA DLQAEASVAN GALEPAAEAA HGGEAAVAAD AARDEPLEPV LPAATTISAA PPAVVDRRID CIVPIRLASP LAGDKILPAA QRLRRAGSKP VHIEGKPDGG DAWELLQNGV RYEELRAAAQ LANRSGPLNE LEFSEFVTGV QQFADAIDGA PEFPDMMETV SMARELDGFA AQCDAQLSIN VMSDGAPWSA NYVQAVASQD GLLLSRDGTR FVKLDAKQNP VFMLQFGDTN FLRDDLTYKG GNLITLVLDV PVADEDILPF RLMCDYAKSL SERIGARVVD DQRRPLPEST LLAIEQQLMK LYARLEEAGI PAGSPVTRRL FSQ
|
| |