Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A6507 |
Symbol | |
ID | 3751742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 3679660 |
End bp | 3680619 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637764830 |
Product | DNA-3-methyladenine glycosylase II |
Protein accession | YP_370745 |
Protein GI | 78067976 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000337261 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0896284 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCATCG CAAAAACAAC GATTACGTCC ATCGTGAACG GCGGCCAGGT GCCGCCGTCC GCGCAAATGG AGCTGCGCTT CAATGCGCCT TACGACTGGG CGCGCGTGCT GCGCTTCTTC AGCGGGCGCG CGATTCCGGG CGTCGAGCAG GTGGCGGACG GCGTGTATCG ACGCATCGTC GACCTGCACG GCGACGCCGG CCGGCTCACG GTCGCGAAAC ATCCGCGCAA GCATTGCCTG GTCGCGACGG TCGAAGGCGC GGTCGCGCGT CATGTCGATG ACGCGTTCGC CGCGCGTGTC GCGACGATGT TCGACCTCGG CGCCGATCCG GCCGCCATCG GCAGCGGGCT CGCGCGCGAT CCGTGGTTCG CGCCGCTCGT CGAGGCCGCG CCGGGGCTGC GCGTGCCGGG CGCATGGTCG GGGTTCGAAC TGGCCGTGCG CGCGATCGTC GGCCAGCAGG TGAGCGTGAA GGCCGCGACG ACGATCGTCG GCCGGCTCGT CGAGCGGGCC GGAGAGCGAG TGGTTCATGA AGACGACGGC GCGCCTGCGT GGCGCTTTCC GACGCCCGAT GCGCTGGCCG CCTGTGACCT CGACAAGATC GGGATGCCCG GCAAGCGGGT GGCGGCACTG ACGGGCGTGG CGCGCGCGGT GGCGGCCGGC GACGTGCCGG TCGATCGCGA GCACGCCGAT CTCGCGACGC TGCGCAGTGC GTGGCTCGAT CTGCCCGGCA TCGGCCCGTG GACCGTCGAA TACATCGCGA TGCGCGCATG GCGCGATCCG GACGCATGGC CGGCCTCCGA TCTCGTGCTG ATGCAGTCGA TTACCGCGCG CGATCCCACG CTCGACCGGC TCGCGACCCA GAAGCACCGC ACTGAGGGCT GGCGGCCGTG GCGCGCGTAT GCGGCGCTGC ATTTGTGGAA TGAAGTGGCC GACCGGGCGG GCGGTGCGCG CGGCGGGTGA
|
Protein sequence | MSIAKTTITS IVNGGQVPPS AQMELRFNAP YDWARVLRFF SGRAIPGVEQ VADGVYRRIV DLHGDAGRLT VAKHPRKHCL VATVEGAVAR HVDDAFAARV ATMFDLGADP AAIGSGLARD PWFAPLVEAA PGLRVPGAWS GFELAVRAIV GQQVSVKAAT TIVGRLVERA GERVVHEDDG APAWRFPTPD ALAACDLDKI GMPGKRVAAL TGVARAVAAG DVPVDREHAD LATLRSAWLD LPGIGPWTVE YIAMRAWRDP DAWPASDLVL MQSITARDPT LDRLATQKHR TEGWRPWRAY AALHLWNEVA DRAGGARGG
|
| |