Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2389 |
Symbol | |
ID | 4888851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2310841 |
End bp | 2312088 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640132327 |
Product | integrase |
Protein accession | YP_001063384 |
Protein GI | 126444968 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.557961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCGC GTCAGATGAA TCGATTGACC GCGCTCGGCA TCGGCAAGCT CGTTGATCCG GGATATTACG CTGACGGCGG CGGCCTGTAC TTGCAGATCA GCGCGAGCGG ATCGCGGTCA TGGATCTACC GCTTCTCGCT CGCCGGCCGC GCGCGGGAGA TGGGCCTCGG CTCGCTGTCG GTGTTGCCGC TCGCCGCGGC GCGCAAGGTA GCGGCAGACT GCCGCGCGAG CGTGAAGCAT GGCATCGATC CGATCGCTGC GCGGCGGCGC GCGCAGGTTA TGCGAGCCGC CGAGGGAGCG CCCGGCGTGA CGTTCAGGCA GGCGGCCGAG GCATTCATCG CCGATCGCGC GTCGGGCTGG CGCAACACGA AACATGCGAA GCAGTGGACA TCCACCCTGG AAGCCTACGC CTATCCCGTG ATCGGCGATA TCGACGTGCG CGACATCGAC ACGGAAATGA TCGTGCGCAT CCTGCAGCCG ATCTGGATGA AGAAGGGCGA GACGGCGCGG CGCGTGCGCG GGCGCGTGAA AGCGATCCTC GATGCCGAGA CAGTGCTCGG CCACCGGACA GGCGACAACC CGGCGCGCTA CGTCGACCAC CTCGATCGCG TGCTGCCGCG GGTGAAGAAG CGCAACAGCG TGAAGCATCA CCCGGCGCTG TCGTGGGAGG AGATGCCCGC GTTTTTCGCG GCGCTGCGCC AGCGCCCCAA GCGCGCCGCG CAGGCGCTGC GTCTGCTGAT CCTCACGGCG ACGCGCACGA ATGAAGTATT GTTCGCGCGG CCTGAGGAGT TCGACCTCGA TGCGCGCGTC TGGACAATTC CGGGTGACCG GATGAAAGCA GAGCAGGAGC TGCGCGTGCC CCTGTGCGAC GAAGCCGTCG AGCTCGTGCG CATGCAGATC GCGACAAAGG CAAAGTGGGG ATGGCTGTTT CCGGGGTACA AGGAGGGGCG CCCGCTGTCG AATATGGCTA TGCTCCTGTT GCTGCGCCGC ATGGACCGCA GCGACATCAC AGTGCACGGG TTCCGTTCGA CGTTCCGGGA TTGGATTGCG GACTGCACAG ACTATCCCGA TTCACTCGCC GAGCAGGCGC TCGCGCACAC GATCTCGTCG ACGACCGTTT CCGCATACCG GCGCCGAGAT ATGCTCGAGC GCCGGCGCGG GATGATGGAG GACTGGGCGC GGTACTGCGC GGGGCAGACC GCGACCGTTG TGCCGTTCAC GCACCCTGCT GCGCACACAG CTGCGTAA
|
Protein sequence | MASRQMNRLT ALGIGKLVDP GYYADGGGLY LQISASGSRS WIYRFSLAGR AREMGLGSLS VLPLAAARKV AADCRASVKH GIDPIAARRR AQVMRAAEGA PGVTFRQAAE AFIADRASGW RNTKHAKQWT STLEAYAYPV IGDIDVRDID TEMIVRILQP IWMKKGETAR RVRGRVKAIL DAETVLGHRT GDNPARYVDH LDRVLPRVKK RNSVKHHPAL SWEEMPAFFA ALRQRPKRAA QALRLLILTA TRTNEVLFAR PEEFDLDARV WTIPGDRMKA EQELRVPLCD EAVELVRMQI ATKAKWGWLF PGYKEGRPLS NMAMLLLLRR MDRSDITVHG FRSTFRDWIA DCTDYPDSLA EQALAHTISS TTVSAYRRRD MLERRRGMME DWARYCAGQT ATVVPFTHPA AHTAA
|
| |