Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A4976 |
Symbol | |
ID | 3750184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | + |
Start bp | 2004236 |
End bp | 2005831 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637763272 |
Product | transposase IS66 |
Protein accession | YP_369214 |
Protein GI | 78066445 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000420547 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.71393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAATC TCGACGATCT GCCCGACGAT GTCGCTGCAT TGAAGGCCAT GCTGGCTGAG GCTCGCGCGT CGGCAATCGA AAGAGAACTC GAAATCGAGC AACTGCGCCG CGAGATTGCC GAGAGCGATC TTGAGATTGC ACGGCTGAAA CTCCTGATCG ACAAGCTCAA GCGAATGCAG TTCGGCCGCA AGTCGGAGCA GCTCGCCCGA GAGATCGAGA GGCTTGAATT GCGGCTCGAG GATCTGAGCG CCGGGAGCAG CGTCGCCGAC GTGCAGCATG CGAAGGTCCG GCGAGAAAAG CCAGCGACAG GTGGCGAGTC GTCGGCCCGG GAGCCGCTGC CGCCACATCT GCCGCGTGAA GACCGCGTGC TCAAACCGGA TTCGATCTGC CCCAAGTGCG ATGGCACCAT GCAGAGCCTC GGCGAGGACG TCTCCGAACA GCTCGCCCGC GTCGCGGCGA TGTTCAAGGT GATTCGCACG ATCCGGCACA AGATGGTCTG TCCCAGCTGT GGCCACATCG AGCAGCCGTC GATGCCGGGG TTGCCGATCG AGCGTAGCAT CGCCCATCCG AGTCTGCTTG CCGACATCCT GGTATCGAAG TACGCAGATC ATGCGCCGCT GTACCGCCAG TCGGAGATCG CCGCGCGCGA CGGTGTGACG CTTGATCGCG CCAGCATGGG CCGCTGGGTC GGGCAATGCG AGGCGCTCTG CCGCCCGCTG ACCGACGCAC TGCGCCGGTA CACGATGGCG GGCACGAAGC TGCACGCGGA CGACACGCCG ATCCCCGTGC TTGCGCCGGG CAACAAGAAA ACGAAGACCG GACGACTCTG GGTGTACGTG CGCGACGACA GCCGTTCGGG CTCGACGGAG CCGGCCGCGG TGTGGTTCGC GTACTCGCCC GATCGCAAAG GCATCCATCC CCAGACCCAT CTCGCTGGAT TCGAAGGCAT CCTGCAAGCC GATGCATATG GCGGCTTCGA CGAGCTGTAC GTGAACGGCA AAATCTGCGA GGCTGCGTGT TGGGACCACG CGCGGAGAAA ATACTACGAG GTCCACGCTA GCACGCCGAC GGACGAAACC AAGAGCTTGC TCGAAATGAT CGGCGAGCTC TACAGCATCG AAGCCGACAT CCGTGGCAAG CCGCCCGATG AACGAAAGCG CGTGCGGCAT GAGAAAAGCA AGCCACTGCT GGAGGCCTTC GAAGCGAGGA TTCGGGGCAA GCTCGCAACG CTGTCGCGCA AGTCGGAGCT GGCCGGCGCG ATCCAGTACT CGTTGAATCA CTGGAATGCT CTGACGTTGT TCTGCGAGGC CGGGCAAGCC GAGATCAGCA ACGCGCTGGC CGAGAACGCT CTGCGCTGCG TGAGCCTTGG GAGGAAAAAC TTCCTGTTTG CCGGCTCCGA CAGCGGAGGC GAGCGGGCCG CCGCAATGTA CAGTCTGCTT GGGACATGCA AGCTCAGCGG TATCAACCCG CGCGCCTATC TGGAATACGT CCTGACCTAC ATTGCTGATC ACGTCGCCAA TCGCGTTGAC GAACTGTTGC CCTGGAACGT GGCTGACAAA CTGAGGCTAG CCACTCCGCC CAAGGCGAAC ATCTGA
|
Protein sequence | MPNLDDLPDD VAALKAMLAE ARASAIEREL EIEQLRREIA ESDLEIARLK LLIDKLKRMQ FGRKSEQLAR EIERLELRLE DLSAGSSVAD VQHAKVRREK PATGGESSAR EPLPPHLPRE DRVLKPDSIC PKCDGTMQSL GEDVSEQLAR VAAMFKVIRT IRHKMVCPSC GHIEQPSMPG LPIERSIAHP SLLADILVSK YADHAPLYRQ SEIAARDGVT LDRASMGRWV GQCEALCRPL TDALRRYTMA GTKLHADDTP IPVLAPGNKK TKTGRLWVYV RDDSRSGSTE PAAVWFAYSP DRKGIHPQTH LAGFEGILQA DAYGGFDELY VNGKICEAAC WDHARRKYYE VHASTPTDET KSLLEMIGEL YSIEADIRGK PPDERKRVRH EKSKPLLEAF EARIRGKLAT LSRKSELAGA IQYSLNHWNA LTLFCEAGQA EISNALAENA LRCVSLGRKN FLFAGSDSGG ERAAAMYSLL GTCKLSGINP RAYLEYVLTY IADHVANRVD ELLPWNVADK LRLATPPKAN I
|
| |