Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2850 |
Symbol | |
ID | 3849516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 3268223 |
End bp | 3270922 |
Gene Length | 2700 bp |
Protein Length | 899 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637842518 |
Product | exodeoxyribonuclease V, alpha subunit |
Protein accession | YP_443362 |
Protein GI | 83719901 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member |
TIGRFAM ID | [TIGR01447] exodeoxyribonuclease V, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.934016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTGG TCGACGAACG TTTCGCATGG ACGGGCGGTC TGGCGGACCG CCTGCCCGAG CCGGCCGATT TCGGGCTCGC GCTTGCCGAA GGTTTCGCGC GCCGGATCGG GATGCTGTCG CGCCGCGTCG GCGCGAGCGC CGCGGCCGCG CGCTGGGCGG CGCGCGCGGC GTTCGCCGCG AGCCGCGCGA CGGCGGCGGG GCACGTCTGC GTGTCGCTGC GCGCGCTCGC GCAACGCTAC GACGAGCCGT TCGCCGACGT GCGCGACGCG CTCGCCGCGA GCGGCGTGAC CGCGTTCGAC AGCATCGCGC GCGGCGCCGA GCGTCCGCTC GTCGTCGACC GCGACGGCCG GCTGTATCTC GCGCGCTACT TCGAATACGA GACGCGGCTT GCGAGCGCGC TCGTCGCGCG CGCCCGGTGC GGCGGCGCGC AAGCGGGCGA TGCCGGCGAA TTGATGCCCG AGGCGCTCGG CGAACGGCTC GTCCGCTATT TCGGGCCGCA GAAGGAGCGG GGCGTCGACT GGCAGCGCGT CGCGGCGCTC GTCGCGCTCA CGGGCCGGGT GACGATCGTG AGCGGCGGGC CGGGCACCGG CAAGACGACG ACGGTCGTCG GCGTGATCGC GTGTCTGCTC GACGCGCACC CCGACTTGCG GATCGCGCTC GCCGCGCCGA CCGGCAAGGC GGCGCAGCGG ATGCAGGAGG CGCTGCACGC ACGCGCCGGC AGCCTGCCGG CCGAGCTCGC GGCGCGTCTG CCGCGCACGT CCTATACGCT GCACCGGCTG CTGGGCGGCG GTCCGGGCGG GCGCTTCGCG CATCATCGCG ACAACCCGTT GCCATACGAT CTCGTCGTCG TCGACGAGGC GTCGATGATC GACGTCGCGC TCGCCGCGCA CTTGCTCGAT GCGCTCGCGC CGAACGCGCG CCTCGTGCTG CTCGGCGACA AGGATCAGCT CGCGGCCGTC GAGGCGGGTG CGGTGTTTGC CGAGCTGAGC GCGCGTCCCG CGTTCAGCTC GGCGACGTGC GCGACGATTG CGCGCGCGCT CGGCGTCGGA GAGGCCGAAT TCGTGGCGGC GCTGCCGCAG GGGGGCGTTC TGCATGCGGC CGATGCGGCG AACGCGGGCG CGAGCGCGGA CGCACGCGCG ACTTCGGTCG ACACGGGCGC GGCGGCGCTG GCCGCACGAG CGACGGGTGC CGTCGCCGCG GCGAAGGTCG CGCCGGAAGC CCCGGCTCAG CGCGCGCCGG CGCCGCGCGC GGGCCGGCGC GGCGGCGGGC GCGAATCGAA ACGGGGTGTC GACGACGCGC AGGGATGGCT GTTCGCGTTC GAAGACGAAG ATGCGTTCAG CGCGGAGGCG TCAGGTGCCG GTGCGTCAGG CGCCGATGAG TCGGGCGCGG AGGCGTTGGG CGTCGGTACG TCGGGTGCCG ATGCGCAGGA TGTCGGCACG TCGAGCATCG ATCGCCGCGA CGATCGTGCA TCGCTTGATA CCCCGCGCCG CCGGCGTGAC GCGCTCGCCA CGCATGTTCG AGCGGATTCG CCGGGCGCGC GCGACATTGA AAACGCCGCG TACGTGCACG CACAGGCCGA TGAAGCCGGC GTCGCCGATT TGCCGTGGGA TGCGTCTTTC GCCGAGCCCG CCGCGTGGAT CGAGGCTGGC GAACTCGAGT GGCTCGACAA CGCGGACTTT TCGATATCCG AAGGCGCTTC GGCGAATGCC GGCTCCGATG AGCGGCGAAC CGCGACGGCA ACGGATGCAA GCGCCGGCGG CGCGCGCGCA CAGGCCGGCC GGGTGCCCGC CGCCCACGCC GGGCCGGATA TCGAACCGCA AGCGTCGGCA CCGGCGAACC CGGCCGGGGC CCCGTCACCG GACGCCGATG CCGCCGCCGC GCCGTTGACC GACTGCGTCA TCTGGCTCGA ACGCAATTAC CGCTTCGGCC TCGACTCGCC GATCGGCAGG CTGTCGCTCG CGATCCGCCG CGGCGCCGTG CAGGACGCGC TCGATGCGTT GTCGACGGCC GACGACGCGG CCGCGCGCTG GTGCGACGAC GGCGGCGCGA CGCTGTCGGC CGCGACGGTC GGGCAACTCG CGCGGGGCTT CGCCGGCTAC GCGGCCGCGT TGCGCGACGC GCTCGCGACG CCCGATCCCG ATCCGCTGCC GCTCTTCGAC GTGCTCAACC GCTTTCGCGT GCTCTGCGCG ACGCGAACGG GCGCGCGCGG CGCGGACGAG GTCAACGCGC TCGTCGCGGC CGAGGTGCGG CGAGCGGTGC GCGTGCCGCT CGCGCTCGGC GCGCACTGGT TCGCGGGGCG GCCCGTGATG GTGACGCGCA ACGATTACGC GCTCGGACTC TTCAATGGCG ACATCGGCAT TGCGCTGCCC GGCGCGCGTG GCGCGTTGCG CGTCTGGTTC CGCGGCGCGG ACGGCCGCGC GCGCGCGGTG TCGCCCGCCG CGCTGCCGCC GCACGATACG GCGTTCGCGC TGACCGTCCA CAAATCGCAG GGCTCGGAGT TCGACGACGC GGCGCTGGTC CTGCCCGCGT CGTTCAACCG TGTGCTGTCG CGCGAGCTTG TCTACACCGC GATCACGCGC GCGCGCTCGC GCGTGCGCGT GATCGGCGCG CGCGCGGTGC TCGCGCTCGC GATCGCGACG CGCACGGCGC GCGATTCCGG TCTCGCCGCA CGGATCGCCG ATGCGCTGCG CACACGATAG
|
Protein sequence | MSVVDERFAW TGGLADRLPE PADFGLALAE GFARRIGMLS RRVGASAAAA RWAARAAFAA SRATAAGHVC VSLRALAQRY DEPFADVRDA LAASGVTAFD SIARGAERPL VVDRDGRLYL ARYFEYETRL ASALVARARC GGAQAGDAGE LMPEALGERL VRYFGPQKER GVDWQRVAAL VALTGRVTIV SGGPGTGKTT TVVGVIACLL DAHPDLRIAL AAPTGKAAQR MQEALHARAG SLPAELAARL PRTSYTLHRL LGGGPGGRFA HHRDNPLPYD LVVVDEASMI DVALAAHLLD ALAPNARLVL LGDKDQLAAV EAGAVFAELS ARPAFSSATC ATIARALGVG EAEFVAALPQ GGVLHAADAA NAGASADARA TSVDTGAAAL AARATGAVAA AKVAPEAPAQ RAPAPRAGRR GGGRESKRGV DDAQGWLFAF EDEDAFSAEA SGAGASGADE SGAEALGVGT SGADAQDVGT SSIDRRDDRA SLDTPRRRRD ALATHVRADS PGARDIENAA YVHAQADEAG VADLPWDASF AEPAAWIEAG ELEWLDNADF SISEGASANA GSDERRTATA TDASAGGARA QAGRVPAAHA GPDIEPQASA PANPAGAPSP DADAAAAPLT DCVIWLERNY RFGLDSPIGR LSLAIRRGAV QDALDALSTA DDAAARWCDD GGATLSAATV GQLARGFAGY AAALRDALAT PDPDPLPLFD VLNRFRVLCA TRTGARGADE VNALVAAEVR RAVRVPLALG AHWFAGRPVM VTRNDYALGL FNGDIGIALP GARGALRVWF RGADGRARAV SPAALPPHDT AFALTVHKSQ GSEFDDAALV LPASFNRVLS RELVYTAITR ARSRVRVIGA RAVLALAIAT RTARDSGLAA RIADALRTR
|
| |