Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0710 |
Symbol | |
ID | 4904127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 697847 |
End bp | 700063 |
Gene Length | 2217 bp |
Protein Length | 738 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640143816 |
Product | hypothetical protein |
Protein accession | YP_001074746 |
Protein GI | 126456216 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACATA TCAAACCACA AGCGGCCCTC GTGGCCACGA CCAACACGCA GATCGGCGCG CAGCCGATGC TCGGGATCAG CGTCGGGATC GGGTTCCGGC TCGATCAGCC GTCGATTCTC GTGCACGAAG CCGCCGTCTG GGAGGCGTTG AAGGCGGCCG CGCCGTCACT GCCGCTGTAT GAAGCCGCCT TGCCGAAGCA GCGCGCCGAA TGGCTGCTCG CCGGCCACTC CGTGCATGCG GTGGGCGCCG GCGCCCGCGC GCGGGACGTC GACTGGACGG CGTGGGTCGA ACTCGACGGC GTGCGCAAGG TCGTTTCGTG CGCGACGTCA CTGGGCGACG AACAGGCGCA GAGCGGTTAC GCGCGCATCG CGGTCGATCA CCGGCACGCG GCGGCGGGCG GCGCGCGGGA GAACCCGTTC GGCGTGGCGT CCGGCACGCC GCCGCTGCAG CAACTGCGCA CGTTCGGTGT CGGCCCCGCG CCGCTTGCGG CGATGGGCGC GATCAACCCC GACTGGCCGG AGCGCGCGCA GTGGATGCCG ACGCGGCCCG GCACGCTCGA CGCGATGGCG CAGGACGGCA CCCACATGGG CTGGCCCGCG GAAGTCGACC TGCGCTTCTT CCAGCAGGCC GCGCCCGACC AATGGGCCCG CGGCGAATGC TGGACGCCCG GCGCGCGTTT CGAGCTGAGC GGCTTCGGGC CGCGGGGCGA GGGCTTCGCG GGCGAACTGC CGCGTCTCGC GCCGGTCGCG CTCGTGACGC GCAACGGCCG CCCGGGTATC GAGCGGCTGT CGTTCAAGCA GCAGACGGCG TGGTTCCTGC CCGATCGCGG CATCGGCGTG CTGTGGTGGA ACGGCGCGGT CGCGCTCGAT TTCCTGCTCG ACGACAGCCC GACGATGCTC GTCACCGCAT TCAAGGACGA AGCCGAGCGG ATCGACGTCG ACGCGCTGAT GAAGTTCGCC GATCAGCGTG CCGACCTGAA CTGCACCGAT CCGCTGCAGC AGGCGGATCA CGAACTGATG CCCGCGATTA CGAGGGGCTG GACCTGGGAG ATGATCCTCG ACACGGAAGA CCACCCGCGT TTCGCTCCGG CGCCGCGCGG CTACGAAGAA GTCCGTGCGC GGGTCGAGCA GAATCGTCGC GAGTTGGTCG AGGCGCGCGA TGCGAGCGAG CGGCTGTCGG CGTTCGAGGA AGCGAACCGC AACGCGAAGC TGCCGGGCGC GCCGCGCGGC GGAGAGAACT GGCGCACGCG GCTGCGTCAG GCGAAGACGC CCGAGCTCGC GAACGTGACG ATTCGCGACG CCGATCTGTC GTCGCTGCGC TTTGACGGCT GGAAGTTCGA CGACGTGCGC TTCGAGCGCT GCACGCTCGA TCGCAGCGAA TGGACGAACT GCCGGCTCAA TCAGGTGCAT GCGGTCGACT GCTCGTTCGC CGACGTCAAG ATGAGTGACG GCTGGTGGAA GGGCGGCAAG ATCCAGCGCT GCAATCTCGA ACGCAGCGCG TGGTTGAACG TCGAGATCGA GCGGATCTCG CTCGACGAAT GCCGGCTCGA CGATCTGAAG GTGGCGGGCG GATCGTGGTC GATGCTGTCG GTGCAGGGGC GCGGCGGCGT GCGCGGCGAC GTTCAGGACG TCCAATGGAA TTCGGTGTCG TGGTCCGAGG TGAGCGCGCC CGGCTGGACC TGGACCCGCG TGCGCGCCGA CGATCTCGCG ATCGTCGAAT GCGCAATGGC GGGCCTCGCG GTATCGCAGT GCACGCTCGC GAAGCCGAGC ATCCTGCTCA CCGACCTGTC CGCGAGCGTC TGGCAGCGCA GCATGCTGAC GTTCGCGGTG CTGTCGCACG GCACGTCGAT CAACGGCGCG CGGCTCACCG ATTGCGTGTT CAAGTCGTCG AGCCTGCAGG AGCTGCGTGC GGATCGGGTT CAGGTCGATC ACTGCTCGTT CATGCAATTG AACGCGCAGC ATCTGCACGC GCAGCAGTCG CATTGGAGCC GCACGGTGCT CGACGGCGCG AACGTGATGC ATGCGCAACT GACGGGCACG TCGTTCGACC GCTGCTCGCT GAAGGAGGCG ATGTTCTATG GCGCCGACAT GCGGCAGACG CGCATGCGCG ACTGCAATCT CGTCAGGGTC CGCACGTCGT GGATCCATCC GCCGGAAGCG GGCGCGTGGC GCGGCAATCT GAGCGCCGGC CAGCTCGACG TGCCGAGGAG GGTGTGA
|
Protein sequence | MRHIKPQAAL VATTNTQIGA QPMLGISVGI GFRLDQPSIL VHEAAVWEAL KAAAPSLPLY EAALPKQRAE WLLAGHSVHA VGAGARARDV DWTAWVELDG VRKVVSCATS LGDEQAQSGY ARIAVDHRHA AAGGARENPF GVASGTPPLQ QLRTFGVGPA PLAAMGAINP DWPERAQWMP TRPGTLDAMA QDGTHMGWPA EVDLRFFQQA APDQWARGEC WTPGARFELS GFGPRGEGFA GELPRLAPVA LVTRNGRPGI ERLSFKQQTA WFLPDRGIGV LWWNGAVALD FLLDDSPTML VTAFKDEAER IDVDALMKFA DQRADLNCTD PLQQADHELM PAITRGWTWE MILDTEDHPR FAPAPRGYEE VRARVEQNRR ELVEARDASE RLSAFEEANR NAKLPGAPRG GENWRTRLRQ AKTPELANVT IRDADLSSLR FDGWKFDDVR FERCTLDRSE WTNCRLNQVH AVDCSFADVK MSDGWWKGGK IQRCNLERSA WLNVEIERIS LDECRLDDLK VAGGSWSMLS VQGRGGVRGD VQDVQWNSVS WSEVSAPGWT WTRVRADDLA IVECAMAGLA VSQCTLAKPS ILLTDLSASV WQRSMLTFAV LSHGTSINGA RLTDCVFKSS SLQELRADRV QVDHCSFMQL NAQHLHAQQS HWSRTVLDGA NVMHAQLTGT SFDRCSLKEA MFYGADMRQT RMRDCNLVRV RTSWIHPPEA GAWRGNLSAG QLDVPRRV
|
| |