Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1095 |
Symbol | |
ID | 3848638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 1235758 |
End bp | 1236975 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637840767 |
Product | hypothetical protein |
Protein accession | YP_441648 |
Protein GI | 83719508 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000000793055 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCAGC GAACCGCTAA ACGCCTCCCT CCCGATGCCG ACAAACTGGT CGGTCTGTCG CTCGCGCTGT TTGCGTCCGG CAGCCGCGTC GAAGATCGCT TCTGGGAAGC CAAGCTCGAC GCCTTGCTCG CAAAGATCGT CCGCAATGGC AACCAGACCA CGCTCGACGC CGCGCTCGAT CATCTCCAGC AGAATCACCC GGACGCCTAC GGCGCCCTCG CCGACATGGC CGAGACGCAC AGCGAATCCC TGTCTGTCGA GCATGACGGC AAGCCGTACG AAGCGCTCCT CGTCGCGATC CCCGTACTCG CGTGGACCCG GTACATGATT CCGTCCGGCG CGCTCAAGAC CGAGATCGCC GACGTGTTGC GTACCCACTT GCAAGCGCAC GTGCTCGCGC AAGGCACGCT CGTCGCGATG GCGCCCTTTC TCTACAGCAT CGATCAACTG CCGCGCCATC ATGTCGAAAC TTACCGCCTC GCGCAGCAGC TCGCGCATGC GGCGCTCGGC AACCATTCGG TCAAGCTCAA TTACGGCGAC CTGCCCGAGA CGTCGCCGAT TCTGGCCGAC CCGCGCTTCC TGCTCGCCGT CGTCGCCGCA CCCGCCGGCA CCCCGCTCTT TCGCTGGCAA GAGGAGGAGC ACGGCTCACG GATCGAGCGC GGCCAATGCC TTGAGCAGTG GGCCGCACAG GGCGGCGTCA ACCTGTCCGC CGCGCTCCCC GGCTGCGAAT TCGAGTGCCT GCTGCCCGAC GCGTATTACT CCGCGTGCCG CGATGCCGAC GAGCGAATCC GTCCGCACAC CGTACGCACC GCGATCCGCT ATCTGTTCGA CACGATCGGC GCCGCGCCGC AAGAACTGCG GGCCGTGATC GCGGGCTTCG GCGAGCACCG GATCGACGAA TATCGCGTCG CATTCACTCG CCGCGGAAGC AACGACGTCA TCTATGGCGT CGTCTGGCCG CTCTACGGCC GCGAGAATGG CGAGGCATCA GTCGACGAGG CGACGCTCGA AGCCGAGGCG CCCGCCGACG GACCGCTCGA AGAAATCGCC TCGCTGCTGA AGGAGGCGGG CGTGACCGAC ATCCGTCGCC ACGCCGGCCG ATTCGAGCCC GAATATTGCG ACGATTGCGG CGTTCCGTTG TACGCCGATC CGCTCGGCGA GATCGTCCAT GCGGAGATGC CGGAGGACGC GACGCCCGCG CAACCGCACT TCCATTAA
|
Protein sequence | MRQRTAKRLP PDADKLVGLS LALFASGSRV EDRFWEAKLD ALLAKIVRNG NQTTLDAALD HLQQNHPDAY GALADMAETH SESLSVEHDG KPYEALLVAI PVLAWTRYMI PSGALKTEIA DVLRTHLQAH VLAQGTLVAM APFLYSIDQL PRHHVETYRL AQQLAHAALG NHSVKLNYGD LPETSPILAD PRFLLAVVAA PAGTPLFRWQ EEEHGSRIER GQCLEQWAAQ GGVNLSAALP GCEFECLLPD AYYSACRDAD ERIRPHTVRT AIRYLFDTIG AAPQELRAVI AGFGEHRIDE YRVAFTRRGS NDVIYGVVWP LYGRENGEAS VDEATLEAEA PADGPLEEIA SLLKEAGVTD IRRHAGRFEP EYCDDCGVPL YADPLGEIVH AEMPEDATPA QPHFH
|
| |