Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3887 |
Symbol | |
ID | 4900326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 3791874 |
End bp | 3793514 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640137113 |
Product | hypothetical protein |
Protein accession | YP_001068108 |
Protein GI | 126454822 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3593] Predicted ATP-dependent endonuclease of the OLD family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.333396 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATCAAGG TCGATTTCCT GCATGCACAG CGGCATCTGT CGGACTCTGG TGGCGGAGCT CGATCCGAAG ATTTGTCGCG GTGCTTGAGT CGCTTCTATG AAAGAAACCT CGAGAAACAC GAGGAAGACT ATGACGCCCA ACGCGCACTC TATCAGTCGG AGGCACTGCT CAACGACCAT CTCGAACGTG TTTTCGGCCC GACACTCGAG CGGCTCTCGC ATCTCGGCTA TCCAGGCCTC AACAATCCCA AGCTGTTGAT CAAGACCGCA CTGAATCCTG CAACTATTTT GAGCAGCCAT GATGGTGCTC GGGTGCATTA CGCCGTTGGA CCAGAAGTAG GCATAGACAC GCAGACGACA CTGCCGGATC GCTACAACGG CCTGGGGTTC AAGAACCTTA TCTATATGGT GGTAGAGCTT CTAGACGCGC ATGCCAGATG GCTAGATATA GAAGAGAACC GTCCGCCTTT GCATCTGATC TTCATCGAAG AACCCGAAGC ACATCTGCAC GCTCAGTTGC AGCAGGTATT CATTCGGAAG GTTCTGGACA TTCTCGATAT TCCCGATGAA GATGCACCGC ACTGCACCAC TCAGATTGTT GTTTCGACAC ATTCCCCGCA CGTGCTCTTC GAGCGCGGCT TCCAACCGAT TCGCTATTTC CGCCGCGTTC GCGAGGCCGG AGTTCAGCGA TCGGAAGTTC TCAGTATGGC TTCGTTCTAT GAAACAGCTA ACGACCCGAA CGATCCTACT GACCGTACCC GGGATTTCCT CGAGCGCTAT CTAAGACTCA CGCATTGCGA TCTATTTTTC GCCGATGCTG CCATTCTGGT AGAAGGTAAC GTTGAACGCT TGCTGATGCC GCAGATGATT GCGAAGGTTG CGCCCGGCCT TCTGTCCACC TACGTGAGCA TTCTAGAAGT CGGAGGAGCA TTCGGCCATC GATTCAAAGG TCTTATCGAG TTCCTGGGGC TCACTACCCT GATCGTCACC GACATCGATA GTGTTATGGC GCCTGCGGCA GCCGATCCAG TAGCCCCCGA CGATGAGGAT GATCCCGATG CCGATCCCGA CCTTGAAGAA GCAGCCGAAG ATCGAGCCGC AGTTCGAGGC GGACGCAAAG CATGCATGGC CAATGAGCCC GGAGCATTGA CGTCAAATCA AACGCTAATC CAATGGCTTC CCGGGCGCAG CACCATCGCC GATTTGCTCA ACGCGACTGT CGAACAGCGC ATTCAAGTCC GCACTGTTGC TAGCGATGCG TTGATCCGCG TCTCATATCA GACATCGGTT AACGCAAGTT GGCGTGGGAC GACAGCTGCC ATGGTCGGTC GAACATTGGA GGAAGCCTTC GCGCTCGAGA ATCTGGCCTG GTGTCAGGAT GCGGCACGCG CCGAAATCCG CCTGCGCGTT CGAGGCTGCA ACAAATTGTC CCTTGAACAG ATCGCACAAA GTCTGCATCG CAAAATCAAG AGCGCGAATT TCCGCAAGAC CGACTTTGCA TTGGCGCTTC TCTCACAGGA CCCATCCGCG TGGACGGTGC CAGCCTACAT CTCTGAAGGT CTTCTCTGGC TCGAGCATGA GGTGGTTCCC CATCCTCCCG AAATCGCGGC TCCTCCGATC AACGCAGGAG CAGTAGCATG A
|
Protein sequence | MIKVDFLHAQ RHLSDSGGGA RSEDLSRCLS RFYERNLEKH EEDYDAQRAL YQSEALLNDH LERVFGPTLE RLSHLGYPGL NNPKLLIKTA LNPATILSSH DGARVHYAVG PEVGIDTQTT LPDRYNGLGF KNLIYMVVEL LDAHARWLDI EENRPPLHLI FIEEPEAHLH AQLQQVFIRK VLDILDIPDE DAPHCTTQIV VSTHSPHVLF ERGFQPIRYF RRVREAGVQR SEVLSMASFY ETANDPNDPT DRTRDFLERY LRLTHCDLFF ADAAILVEGN VERLLMPQMI AKVAPGLLST YVSILEVGGA FGHRFKGLIE FLGLTTLIVT DIDSVMAPAA ADPVAPDDED DPDADPDLEE AAEDRAAVRG GRKACMANEP GALTSNQTLI QWLPGRSTIA DLLNATVEQR IQVRTVASDA LIRVSYQTSV NASWRGTTAA MVGRTLEEAF ALENLAWCQD AARAEIRLRV RGCNKLSLEQ IAQSLHRKIK SANFRKTDFA LALLSQDPSA WTVPAYISEG LLWLEHEVVP HPPEIAAPPI NAGAVA
|
| |