Gene BTH_I2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I2003 
Symbol 
ID3848868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp2266852 
End bp2268153 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content74% 
IMG OID637841672 
Productphage SPO1 DNA polymerase domain-containing protein 
Protein accessionYP_442527 
Protein GI83721489 
COG category[L] Replication, recombination and repair 
COG ID[COG1573] Uracil-DNA glycosylase 
TIGRFAM ID[TIGR00758] uracil-DNA glycosylase, family 4 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.262884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTGG CTGAAGCGGC GCTCGAAGAG CTGGGACTCG CGCCCATGTG GGTGCGGCGC 
GGCGCGGCGC GCGCCGGCGG CGTGAACGAG GATGCGACGG GCGCGGCGCG GGAGACCGAC
GTCGCGGCGG TTGCGCTAGG GGCGCAACCG GTGTCGGATG CGGCGCGCCG GATGTCGCAT
GACGGCGCGC AAGGCAGGGG GCGCGGTGGT GCGGGGGCGC CGGCCGCGTC GACGTCTGCG
GACGACGCAT CGGCCGACGG CGCCGCGCAC GCGGAATCGA TCGGGACCGC CGCGCTCGCC
GACGGTCGCG CGGCGCCTGC GCGGCAGGCT GCCGACGCGC GCGGCCAAGC GCGGCAAGCG
GCGGCCGAAT CCGGCATGCG GGCGGCGGAT GCGCCTGCGG CGTTCGAGTC GGGCGCGCGA
AATTCGCGGG ACTCGACGAT CGCGCGCGCG GCTTCGACGG TCGAGCCGGT CGCCGCGGGC
GGCGCGCGTC GCGTGCCGCC GCACGCCGCC GTCGCCGCCG CTGCCGCGGA ATCGGCGGCC
TTCGAGCAGG ATGCGGCGCA GCCGGCACGT TCGTCGGCGG CGTCCGCGCC GGCCGCACGA
ACCGGGGGCG ACGCCGGCGC GGCGGCAGCC GACGAAGACA TGTCGTGGTT CGATCTCGAG
CCGGGGGTCG AGCCCGCGCC GCCTGACGTC GCCGCGGAGC CCGCCGCTCG CGCGCCGTCC
GTCGCCGAGC TCGGCTGGGA CGAATTGCGC GCGCGCGTGG CCGACTGCGA GCGCTGCCGT
CTTTGCGAGA AGCGCACGAA CACGGTGTTC GGCGTCGGCG ACGAGCGCGC GGACTGGATG
CTCGTCGGCG AGGCGCCGGG CGAGAACGAG GACAAGCAGG GCGAGCCGTT CGTCGGCCAG
GCGGGCAAGC TGCTCGACAA CATGCTGCGC GCGCTCGCGC TCAAGCGCGG CGAGAACGTC
TATATCGCGA ACGTGATCAA GTGCCGGCCG CCCGGCAACC GCAATCCCGA GCCCGACGAG
GTCGCGCGCT GCGAGCCGTA TCTGCAGCGA CAAGTCGCGC TCGTGAAGCC GAAGCTGATC
GTCGCGCTCG GCCGCTTCGC CGCGCAGACG CTTCTCAAGA CGGACGGAAG CATCGCCTCG
ATGCGCGGGC GCGTGCACGA GTACGAAGGC GTGCCCGTGA TCGTCACGTA CCATCCGGCG
TATCTGCTGC GCAGCCTGCA GGACAAGGCG AAAGCCTGGT CCGATCTGTG CCTCGCGAAC
GATACCTACC GGAGTGCCGC GCCGGCCGCC GATCCGCAAT GA
 
Protein sequence
MALAEAALEE LGLAPMWVRR GAARAGGVNE DATGAARETD VAAVALGAQP VSDAARRMSH 
DGAQGRGRGG AGAPAASTSA DDASADGAAH AESIGTAALA DGRAAPARQA ADARGQARQA
AAESGMRAAD APAAFESGAR NSRDSTIARA ASTVEPVAAG GARRVPPHAA VAAAAAESAA
FEQDAAQPAR SSAASAPAAR TGGDAGAAAA DEDMSWFDLE PGVEPAPPDV AAEPAARAPS
VAELGWDELR ARVADCERCR LCEKRTNTVF GVGDERADWM LVGEAPGENE DKQGEPFVGQ
AGKLLDNMLR ALALKRGENV YIANVIKCRP PGNRNPEPDE VARCEPYLQR QVALVKPKLI
VALGRFAAQT LLKTDGSIAS MRGRVHEYEG VPVIVTYHPA YLLRSLQDKA KAWSDLCLAN
DTYRSAAPAA DPQ