Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2956 |
Symbol | |
ID | 4886523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2808897 |
End bp | 2810768 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640132892 |
Product | hypothetical protein |
Protein accession | YP_001063947 |
Protein GI | 126442987 |
COG category | [S] Function unknown |
COG ID | [COG3519] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03359] type VI secretion protein, VC_A0110 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATACGC GCCTGCTCGA CTACTACAAC CGCGAGCTCG CGTATCTGCG CGAGTTGGGC GGCGAGTTCG CGCAGCAGTT TCCGAAAGTG GCCGCGCGCC TGCGGATGCA CGAATCGGGG CCGCCCGATC CGTACGTCGA GCGGCTGCTC GAAGGCTTCA GCTTTCTCAC CGCGCGCGTG CAACTGAAGA TGGACGCGGA GTTTCCGCGC TTCACGCAGG CGCTGCTCGA CGCGGTGTAT CCGGGTTACG TCGCGCCGCT TCCGTCGATG GCGATCGTGC AGTTCACGCC GATGATGAAC GAAGGCAGCC TCGCGCAGGG CTACCGGCTG CCGGCGGGCA CCGCGCTGCG CGCGCGGCCC GCCGCGGCCG AACAGACCGC GTGCGAGTTT CGCACCGCGC ACGATCTGAC GCTGTGGCCG CTGGAGCTCG CGGGCGCTTC GGTGACGGGC GCGCCCGCGT ATCTGCCGCG TTCGGCGACG GCCGCGCGCC GCGACGTGCG CGGCGCGCTG CGCATCCGGC TGAAGGCGCG CGGCGGCGCG GGCCTCGCGC AACTGCCGAT CGATCGGCTG ATGTTCCACC TGGCGGGCCC CGAGCGCGAC GCGCTGCATC TGCTCGAACT GATCGCCGGG CATACGATCG GCGTCGTCTG CCACGACGCG GCGCAGCCGC CGCGCTGGCT GCACGCGCTC GGCGCGCACG CGCTCGCGCA TCAGGGCTTC GACGCCGATC AGGCGCTGCT GCCCGACGAA GGCCGCAGCT TCCACGGCTA CCGGCTGCTG CGCGAGTACT TCGCGTTTCC CGCGCGCTTC CTGTTCTTCA GCATCGAAGG ATTGCGGCCC GCGCTCGCGC GCGCGACGGG CGACACGTTC GAGCTGACGC TGCTGCTCGA TCGGCACGAC GCGGCGCTCG AGAACAGCGT CGATGGGCGG CGCCTCGCGT TGAACTGCAC GCCGGCCGTC AACCTGTTCG CGCGGCGCGC GGACCGCATT CCGGTCCATC CGGGCGCCCG CGAGCATCAT GTCGTCGTCG ATCGCAGCCG GCCGCTCGAC TACGAGGTCT ACGCGGTGCG GCGGCTCGCG GGCGAGCAGC GCGACGACGG GCGGACGCGC GCGTTCCGGC CGTTCCATGC GTCGTTCGCG GGCGACGGCG GCAATTACGG CGCGTACTAC ACGGTGCGCC GCGAGCCGCG CCTCGTGTCC GCGCAGGCGC GCGCGAACGG CACGCGCACC GGCTACGTCG GCAGCGAGAC GTTCGTGTCG CTCGTCGATA GCGCGTGCGC GCCGTATGAC GAATCGATCC GCTATCTGTC CGTCGACACG CTGTGCACGA ACCGCGATCT CGTCCTGCTG TTGCCGGCGG GCGACGCGAA CGCGTTCACG CTGCGCGTGT CGGCGCCCGT CGAGCGGATC GCCATGATCC GCGGGCCGTC GCGGCCGCGC CCGCCGCTCG CCGACGCGCA GAGCGCGTGG CGGCTCGTGA GCCATCTCGG GCTCGCGCGC CACACGCTGA CCGATGTCGA CGACGAAGAA GGCGCGCGCG TGCTGCGCGA ATTGCTCGGT CTGCACGCGG ACCCGGCCGA TGCGGCGATG CGCCGGCAGA TCGACGGCGT GCATCGTGTC GCGTTCGCGC CGGTGTTTCG CCGGCTGCCC GCCGCCGGGC CGCTGATGTT CGGGCGCGGC GTGCAGGTGG ACGTGACCGT CGACGATCAT GCGTTTTCCG GCGACAGCCC CTATTTGCTC GGCGCGGTGC TCGAGCAGTT TTTCGCGCGG CACGTGTCGA TCAACTCGTT CGCCGAATGC GTGCTGAGCA GCGCGCAGCG CGGCAGGCTC GCGCAATGGC CGGCGCGCGT CGGCAGGCGG CCCGCGATAT GA
|
Protein sequence | MDTRLLDYYN RELAYLRELG GEFAQQFPKV AARLRMHESG PPDPYVERLL EGFSFLTARV QLKMDAEFPR FTQALLDAVY PGYVAPLPSM AIVQFTPMMN EGSLAQGYRL PAGTALRARP AAAEQTACEF RTAHDLTLWP LELAGASVTG APAYLPRSAT AARRDVRGAL RIRLKARGGA GLAQLPIDRL MFHLAGPERD ALHLLELIAG HTIGVVCHDA AQPPRWLHAL GAHALAHQGF DADQALLPDE GRSFHGYRLL REYFAFPARF LFFSIEGLRP ALARATGDTF ELTLLLDRHD AALENSVDGR RLALNCTPAV NLFARRADRI PVHPGAREHH VVVDRSRPLD YEVYAVRRLA GEQRDDGRTR AFRPFHASFA GDGGNYGAYY TVRREPRLVS AQARANGTRT GYVGSETFVS LVDSACAPYD ESIRYLSVDT LCTNRDLVLL LPAGDANAFT LRVSAPVERI AMIRGPSRPR PPLADAQSAW RLVSHLGLAR HTLTDVDDEE GARVLRELLG LHADPADAAM RRQIDGVHRV AFAPVFRRLP AAGPLMFGRG VQVDVTVDDH AFSGDSPYLL GAVLEQFFAR HVSINSFAEC VLSSAQRGRL AQWPARVGRR PAI
|
| |