Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2020 |
Symbol | |
ID | 3849130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2288594 |
End bp | 2289859 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637841689 |
Product | hypothetical protein |
Protein accession | YP_442544 |
Protein GI | 83721011 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAGT TGACACTCGG GTTGATCGGC GCGGGCGCCG TCGTGGTGGG CGGCGTCGTG GTCTACAACG CGTGGCAGGG CGCGAAGGTG CGCCGCAAGA TGCCGCGCCC GATGCCGACC GAGGCGGCCG AGGCCGCCGC GCGGCACGAG CGCGATGACG ATGCGCCCTT CATCGAGCCC GTGCGGCAGC CGGCGCGCCG CGAGGCCGCA GCGGGCGGCG CGTCGGACGC GCGAAGCGAA GACGCGGTGC GCGTCGAGCC GACGTTCGGC GGCGCGGCGC CCGCCGATAT GCCGGCTGAC TTGCAGGCCG AGGCGACCAT CGCGAACGGC GCGGTTGTTG CGCCCGCCGC CGAGGAGACG GACGGCGAAG CGGCCGCGCC GGCTCCCGCG CACGACGAGC CGGTCGAACC GGTGCTGCCC GCCGCGACGA CGATTTCCGC GGCGCCGCCC GCTATCGTCG ATCGCCGCAT CGACTGCATC GTGCCGATCC GCCTCGCGAG CCCGCTTGCG GGCGACAAGA TCCTGCCCGC CGCGCAGCGG CTGCGCCGCG CGGGCAGCAA GCCGGTTCAC ATCGAGGGCA AGCCGGACGG CGGCGACGCA TGGGAGCTGC TGCAAAACGG CGTGCGCTAC GAAGAGCTGC GCGCGGCCGC GCAGCTCGCG AATCGCAGCG GTCCGCTCAA CGAACTCGAG TTCTCCGAAT TCGTGACGGG CGTCCAGCAG TTCGCGGACG CGATCGACGG CGCGCCGGAA TTCCCGGACA TGATGGAAAC GGTGTCGATG GCGCGCGAGC TCGACGGCTT CGCCGCGCAA TGCGACGCGC AGTTGTCGAT CAACGTGATG TCGGACGGCG CGCCGTGGTC GGCGAACTAC GTGCAGGCGG TCGCGTCGCA GGACGGGCTG CTGCTGTCGC GCGACGGCAC GCGCTTCGTG AAGCTCGACG CCAAGCAGAA CCCCGTCTTC ATGCTGCAGT TCGGCGACAC GAACTTCCTG CGGGACGATC TCACGTACAA GGGCGGCAAT CTGATCACGC TCGTGCTCGA CGTGCCCGTC GCCGACGAGG ACATTCTGCC GTTCAGACTG ATGTGCGACT ATGCGAAATC GCTGTCCGAG CGAATCGGCG CGCGCGTCGT CGACGATCAG CGCCGGCCGC TGCCCGAATC GACGCTGCTC GCGATCGAGC AGCAGTTGAT GAAGCTGTAC GCGCGGCTCG AGGAGGCGGG GATTCCGGCC GGCTCGCCCG TCACGCGGCG GCTGTTCAGC CAGTAA
|
Protein sequence | MDELTLGLIG AGAVVVGGVV VYNAWQGAKV RRKMPRPMPT EAAEAAARHE RDDDAPFIEP VRQPARREAA AGGASDARSE DAVRVEPTFG GAAPADMPAD LQAEATIANG AVVAPAAEET DGEAAAPAPA HDEPVEPVLP AATTISAAPP AIVDRRIDCI VPIRLASPLA GDKILPAAQR LRRAGSKPVH IEGKPDGGDA WELLQNGVRY EELRAAAQLA NRSGPLNELE FSEFVTGVQQ FADAIDGAPE FPDMMETVSM ARELDGFAAQ CDAQLSINVM SDGAPWSANY VQAVASQDGL LLSRDGTRFV KLDAKQNPVF MLQFGDTNFL RDDLTYKGGN LITLVLDVPV ADEDILPFRL MCDYAKSLSE RIGARVVDDQ RRPLPESTLL AIEQQLMKLY ARLEEAGIPA GSPVTRRLFS Q
|
| |