Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2054 |
Symbol | |
ID | 3848389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 2327068 |
End bp | 2328288 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 637841723 |
Product | hypothetical protein |
Protein accession | YP_442578 |
Protein GI | 83720526 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.594988 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTCGC TGATCACGGC CGCGGCGCGC GCGCTCGCGG CGGGTGATCC GCTCGGCGCG CTGAACCGCG TCGCGCTGCG CGACGATGCG CCGGCGCTCG CGCTGCGCGG CGTCGCGATG GCGCAGCTCG GCGATTTCGC GCGGGCGAAG GCGCTCGTGC GGCGCGCGGC CCGCGCCTTC GGCCCGAACG AGGCGCTCGC CCGTGCGCGA TGCGTCGTCG CAGAAGCCGA GATCGCGCTC GCGGCGCGCG AGCTCGGCTG GCCCGAGCAA GCGCTCGACG CGGCGCGCGC GACGCTCGAC GCGCACGGCG ACCGCGTCAA CGCCGCGCAT GCGCGGTATC TCGCGGTCCG TCGCCTGTTG CTGATCGGCC GCGTCGGCGA AGCCGAGCGC AGGCTCGCCA CGCTCGATTT CGCCGCCTGC CCGCCCGCGC TGCGGGCCGC GCACGAGTTG ATCGTCGCGG GCATCGCGCT GCGTCGCATC GAGACGAAGC CCGCGCGCGC GGCGCTCGCC CGCGCCGAAC GCGCGGCGCG CGACGCCGGC ATTCCCGCGC TCGCCGCCGA AGTCGAGCAT GCGTTTCGCG CCCTCGACGC GCCGGCCGCG CGCCTCGTCG CGTGCGGCGA AACACGCGCG CTGCGGCTCG ACGAAGTCGA GGCGTGGCGC GCATCGGCAT CGCTCGTCGT CGACGCGTGC CGCCACGTCG TGCACGATGC GCGCACGACG GTCTCGCTCG CGAAACGCCC CGTGCTGTTC GCGCTCGCGC GCGCGCTCGG CGAAGCGTGG CCTGGCGACG TCCCGCGCGA GGCGCTCGTC GCCCGTGCGT TCCGCGCGAA GCACATGGAC GAATCGCACC GCGCGCGGCT GCGCGTCGAG ATCGGACGGC TGCGCGCGCT GCTCGGCGAA CTGGCCGACA TTCGCGCGAC GAAGCGAGGA TTCGCGCTGA CGCCGCGCGA AGCGCGCGAT GTGGCCGTGC TCACGCACCT CGTCGAAGAC GCGCACGCGG CGGTGCTCGC CCTCCTCGCC GACGGCGAGT CGTGGTCGAG TTCCGCGCTT GCGCTCGCGC TCGGCGCAAG CCAGCGCACC GTGCAACGCT CGCTCGACGC GCTCGCGGCG GCCGGCAACG CGCAGTCGTT CGGCCGCGGC CGCGCACGCC GCTGGACGAC CCCGCCCGCG CCGGGATTCG CGACGATCTT GTTACTCCCG GCCCCGCTGC CGGGCGATTA G
|
Protein sequence | MDSLITAAAR ALAAGDPLGA LNRVALRDDA PALALRGVAM AQLGDFARAK ALVRRAARAF GPNEALARAR CVVAEAEIAL AARELGWPEQ ALDAARATLD AHGDRVNAAH ARYLAVRRLL LIGRVGEAER RLATLDFAAC PPALRAAHEL IVAGIALRRI ETKPARAALA RAERAARDAG IPALAAEVEH AFRALDAPAA RLVACGETRA LRLDEVEAWR ASASLVVDAC RHVVHDARTT VSLAKRPVLF ALARALGEAW PGDVPREALV ARAFRAKHMD ESHRARLRVE IGRLRALLGE LADIRATKRG FALTPREARD VAVLTHLVED AHAAVLALLA DGESWSSSAL ALALGASQRT VQRSLDALAA AGNAQSFGRG RARRWTTPPA PGFATILLLP APLPGD
|
| |