Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_0732 |
Symbol | |
ID | 4678476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008784 |
Strand | - |
Start bp | 741746 |
End bp | 743404 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639843256 |
Product | hypothetical protein |
Protein accession | YP_990337 |
Protein GI | 121597986 |
COG category | [S] Function unknown |
COG ID | [COG3455] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03349] type IV / VI secretion system protein, DotU family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0245739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGATC GCGCGGCGGC GAATCCGGAT CTCGTGACGC GCGTGCCGAC GCTGTCGTCG CTGTCGAGCG CGGCGATGAC GAGCTCCGCC GTATCGACCA CCGGCACGAC GAACACGGTG GCGTCGGGCG GCGCGGCCGC GGGCGCGCCG AGCTTCGCGC CGCCGGCCGC CGACGCGTTT CCCCAAGCCG ACGGCGCGCC CCGCAACCCC GCGGTGCTGC AATTCCCGGT GCCGGGCGGC GCGCCCGCGC CCGCCGACGC GCGGGTGGCC GCGCCCGTCG TCTATAGCGC GCAGGGCGAG CAGGCCGCGA TCATGAAAGC AGGCCTGCAG CAGGCGAGCT GGAACAACCC GTTCGTGTCG CACGCGCTGC CCGCGGTGCT GCAACTGCAG CGCCACCTCG CGGCCGGCCC GCTCAATCAG GCCGCGATCC GCACGCAGCT CGGCCTCGAG GTGCGGCTCT ACCGCGAGCG GCTCGCCGCC TCCGGCTGCG AATGGGAGCA GATCCGCGAC GCATCGTACC TGCTCTGCAC GTATCTCGAC GAAACCGTCA ACGACGCGGC GCGCGAGCAC GCGCAAGTCG TCTACGACGG CGAGCGCAGC CTGCTCGTCG AATTCCACGA CGACGCGTGG GGCGGCGAGG ACGCGTTCGC CGACCTGTCG CGCTGGATGA AGACCGAGCC GCCGCCGATT CCGCTTCTGT CGTTCTACGA ACTGATCCTG TCGCTCGGCT GGCAGGGCCG CTACCGCGTG CTCGACCGCG TCGACGTGCT GCTGCAGGAT CTGCGCTCGC AACTGCACGC GCTGATCTGG CATCACGTGC CGCCCGAGCC GCTCGGCACC GAGCTCGTCG CGCCCGCGAA GCGGCGCCGC TCGTGGTGGA CGGCCGGGCG CGCGGCGGCC GTCGCGCTCG GCGTGCTGGT GCTCGCGTAC GGCGCGATCA GCTTCTGGCT CGATTCGCAG GGCCGCCCGA TCCGCAACGC GCTCGCCGCG TGGATGCCGC CCACGCGCAC GATCAACATC GCCGAGACGC TGCCGCCGCC GCTGCCGCAG ATTCTCACCG AAGGGTGGCT CACCGCGTAC AAGCATCCGC AAGGATGGCT GCTCGTGTTC AAGAGCGACG GCGCGTTCGA CGTTGGCAAG GCGAACGTGC GGGCGGACTT CATGCACAAC ATCGAGCGGC TCGGCCTCGC GTTCGCGCCG TGGCCGGGCG ACCTCGAGGT GATCGGCCAC ACCGATTCGC GGCCGATCCG CACGAGCGAG TTCCCGGACA ACCAGGCGCT GTCCGAAGCG CGGGCGCGCA ACGTCGCCGA CGAACTGCGC AAGACCGCGC TGCCGGGCGG CGCGCGCGCG CCGGAGAACG CGGTGCAGCG CAACATCGAG TACTCGGGGC GCGGCGACGC GCAGCCGATC GACACCGCGA AGACGGCCGC CGCGTACGAG CGCAACCGCC GCGTCGACGT GCTGTGGAAG GTGATTCCCG ACGGCGCGCA GCAATCGGGC CGCAGCCTGA ACCTGCAGCA GCCGGAGAAG CCCGGGCAGG TGCCGATGCG TCCGGCGATG CCGGAGGGCG TGGAGATCGC GCCTGACGGG CAACTGCCGT ATGCGACGTC AACCACGATG CCAGCAACGC GACCGACCAC GGAGGGCCGT CAGCCATGA
|
Protein sequence | MLDRAAANPD LVTRVPTLSS LSSAAMTSSA VSTTGTTNTV ASGGAAAGAP SFAPPAADAF PQADGAPRNP AVLQFPVPGG APAPADARVA APVVYSAQGE QAAIMKAGLQ QASWNNPFVS HALPAVLQLQ RHLAAGPLNQ AAIRTQLGLE VRLYRERLAA SGCEWEQIRD ASYLLCTYLD ETVNDAAREH AQVVYDGERS LLVEFHDDAW GGEDAFADLS RWMKTEPPPI PLLSFYELIL SLGWQGRYRV LDRVDVLLQD LRSQLHALIW HHVPPEPLGT ELVAPAKRRR SWWTAGRAAA VALGVLVLAY GAISFWLDSQ GRPIRNALAA WMPPTRTINI AETLPPPLPQ ILTEGWLTAY KHPQGWLLVF KSDGAFDVGK ANVRADFMHN IERLGLAFAP WPGDLEVIGH TDSRPIRTSE FPDNQALSEA RARNVADELR KTALPGGARA PENAVQRNIE YSGRGDAQPI DTAKTAAAYE RNRRVDVLWK VIPDGAQQSG RSLNLQQPEK PGQVPMRPAM PEGVEIAPDG QLPYATSTTM PATRPTTEGR QP
|
| |