Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2544 |
Symbol | |
ID | 4887937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2454523 |
End bp | 2456472 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640132480 |
Product | hypothetical protein |
Protein accession | YP_001063536 |
Protein GI | 126444793 |
COG category | [S] Function unknown |
COG ID | [COG4655] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.7801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCACT GGCGCGATAC GCGCGCAGCC GGCGACATCG CGCGATGTCG GCCGGCCGGA TGCGCCGCGG ACGACCACAC AGGCCCGATG ACGCCGCTCG ACCGCGCATC GCGTCGACAA CGACACGCAA CGCCGACGAA GCACTACGAG GTTGGAATGA ACGAGCCGAC ACGCCAGACC ACCGGGGACC TTTCCGAGCG TCGCGGGCCG AATCGACGGC GCCGCGCGCC GCGCTCGCCG GCGCGCCAGC GCGGCTCGCT CGCCATCATC GCGGCAATCG CGATCGGCGT CGTGATCGCC GCGCTCGGCG CGGTCGACCT CGGCAATCTG TTCTATCAGC GCCGCGCGCT GCAAAGCATC GCCGACCTCG CCGCGCTCGC GGCCGCGCAG ACGATGGACG ACGGCTGCGC GAAGCCGGCC GCCACCGCGC AATCGGCCGC GCTCGGCAAC GGCTTCGACA GCACCGCGTC GGGACAATCG ATGACGGTCG TCTGCGGCAG ATGGGACGTG AAGGACAACG TCGGCCCGAG CTTCTTCGCG GGTTCGGCAT CGGGCGCCGC GGCCGGCAGC GACGCGCAGC TCAACGCGGT TCAAGTGACG GTCACGCGCG CCGTGCCTTA CTACTTCCTC GGCGCGCAGC GCACGATCGC GGCGACCAGC ACCGCAGAGG CGACCAATGT CGGCGCCTAC TCGATCGGCA CGACGCTCGC GCAACTGCAA GGCGGCGTCG TGAACGCGCT GCTCAACGGG CTGCTCGGCA CGAATCTGAA TCTGTCGGTG TTGTCGTATC AAGGCCTTGC CAATGCGCGA ATCAGGATCA AGGACCTGAT GGCCGCCGCG AACGTCGGCA CCGTGAGCGC GCTGCTGAGC ACGCAGACGA CCGTCCCGCA GCTCGCGAAC TGGATGCTGA GCGCGCTGTC GCAGACCTCG GTCGCGAATG CCGACTTGCA GACGAGCATC GGCGCGCTAC AGACGATCGT CAGCGCGAAC ATTCCGGGCG GCCGGACTTT CACGATCGGC AACACCGCGA ATTCGGCGGG CATCTTCTCG ATCGGCCTGT CCAATCCGCA GGCCGCGCTC GACGCGACAT TCAGCCCGTT CGACGCACTT CTCGTCGCGG CCGAGATCGC GACCGGGCAA ACGGCGTTCT CGCTCGCGAA CGGGCTGAAC ATCGGCGGGT TGAACGCGAA TCTGCAAGTG CAGATCATCC AGCCGCCCGT GCTCGGCATC GGCGAAGCGG GCATCGACCC CGTCACGAAA ACGTGGCGCA CGATCGCACG CACCGCGCAG GTGCGACTCT ATCTGAACAT CGGACTCGGC ACGGCGAACC TGCCGCTCGG GCTGCTCGGC GCGCTCCTGC CGGTGCAGGT GAATCTGCCG CTATCGCTGC AGATCGCGCC GGGCCAGGCG TGGCTGCAAT CGGCGAGCTG CACGGCGTCG CCGTCGACTT GCGCCTCGGC CATCGGCGTG CAGACGGGCC TCACGAATCT GTGCATCGGC GACACGCCGG CCAACATGTC CGCGTCGCTG CCGTTCACCT GCTCGACGCC CGCGACGCTC GTCAATGTCG CGAACCTCGT GACGATCAAG TCGCTCGTGT CGTTCCCGGC CGACGTGCCC GCGAGCCAGA CGCCGACGCT CACGTTCTAC GGCACGACGG GCGGCTATCA GAGCACGAAC TCGAACGGCG TCGGCAGCGT GCTCGGCAAT GCGCTGTCCG GCCTCGGCGC ATCGCTGCAG CAGACGCAGA TCTCGCTGTT CGGCATCAGC CTGCCGCTCG GCCCGATCCA GACCGCGCTC AATGCGTTCC TGGGCGGCGT GCTACCGCCG CTGCTGTCGG GGCTCGACGC CGCGATCGTG CCGCTGCTGC AACTGCTAGG CGTGCAGGTC GGCGAAAGCA CGATTCACGA CATGTCGCTG ACTTGCGGGG TGTCGCAGCT CGTCTATTGA
|
Protein sequence | MMHWRDTRAA GDIARCRPAG CAADDHTGPM TPLDRASRRQ RHATPTKHYE VGMNEPTRQT TGDLSERRGP NRRRRAPRSP ARQRGSLAII AAIAIGVVIA ALGAVDLGNL FYQRRALQSI ADLAALAAAQ TMDDGCAKPA ATAQSAALGN GFDSTASGQS MTVVCGRWDV KDNVGPSFFA GSASGAAAGS DAQLNAVQVT VTRAVPYYFL GAQRTIAATS TAEATNVGAY SIGTTLAQLQ GGVVNALLNG LLGTNLNLSV LSYQGLANAR IRIKDLMAAA NVGTVSALLS TQTTVPQLAN WMLSALSQTS VANADLQTSI GALQTIVSAN IPGGRTFTIG NTANSAGIFS IGLSNPQAAL DATFSPFDAL LVAAEIATGQ TAFSLANGLN IGGLNANLQV QIIQPPVLGI GEAGIDPVTK TWRTIARTAQ VRLYLNIGLG TANLPLGLLG ALLPVQVNLP LSLQIAPGQA WLQSASCTAS PSTCASAIGV QTGLTNLCIG DTPANMSASL PFTCSTPATL VNVANLVTIK SLVSFPADVP ASQTPTLTFY GTTGGYQSTN SNGVGSVLGN ALSGLGASLQ QTQISLFGIS LPLGPIQTAL NAFLGGVLPP LLSGLDAAIV PLLQLLGVQV GESTIHDMSL TCGVSQLVY
|
| |