Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1034 |
Symbol | |
ID | 3846778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 1175761 |
End bp | 1177671 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637840706 |
Product | hypothetical protein |
Protein accession | YP_441588 |
Protein GI | 83719081 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.509069 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGGAA ATGCCTCGCC CGGCGGCCGG CGCACGCCCG CTTCGGCGCA TCGCGCCGGC TCGCCCGACA ATCACTCGCA CTCGCCGCCC GCCGTGGCGC TCGCGGCCGC CGCGCCCGGC GCCGCGGGAA CGGCCGGCGA GCGCGCCGCC GGATCGGGCG GCGCGGATGC AATCCGCACG CCGCTCGCCG GATTGCGCGT GTGGCTCGTC GCCGCGGCCG TGCTTTGCGC GTACCTGTTG CCGGGCATCC TCGGCCACGA TCCGTGGAAG CAGGATGAAA CCTACACGTT CGGCATCATC CAACACATGC TCGAAAGCGG CGACTTCGTC GTGCCGACCA ACGCGGGGCA GCCGTTCCTC GAAAAACCAC CGCTGTACGA CTGGGTCGCC GCCGGCCTCG CGTGGCTCTT TTCCCGCTAC CTGCCGCTGC ACGACGCCGC ACGGCTCGCG AGCGCCCTCT TCGCCGCGCT CGCGTTCGGC TTCACCGCGC GCGCCGCGCG CATCGCGACC GGCGCCGCGC GCTGGCTCGA ACTGCCGGTG ATCGGCACCG TCGCGCTGTG CGCGGGCTCG CTCGTCGTTA TCAAGCATTC TCACGACCTG ATGACCGACG TCGCGCTGAT GGCGGGCACC GCGATGGGCT TTTGCGGGCT GCTCGAACTC GTGATCCGGC ACGCCGGCGG CGCGAGCCGC GCCGCCCCCG GCCGGCAGCC CGCGAGCCGC TGCGCGGCCC CCCTGTTCGG GCTGGGCGTC GGCGTCGCGC TGATGTCGAA GGGCCTGTTC GTGCCGCTCG TGTTCGGCGC GACGCTCGCC GCAACGCTCG TTCTCTACCC GACCTGCCGC AGCCGCGCGT TCTTCCGCTC GCTCGCGATC GCCGCGCTCG TGTGCGCGCC GTTCGCGCTG ATCTGGCCGA CCGCGCTGTT CCTGCGCTCC GAATCGCTGT TCCTCGTCTG GTTCTGGGAA AACAACGTCG GCCGCTTCTT CGGTTTCTCG GTGCCGACGC TCGGCGCCGA AAACGACAAG CCGCTCTTCA TCTGGCGCGC GCTGCTGACG CTCGGCTTTC CGGTCGCCCC GCTCGCGCTC GTCGCGCTCG CGCGCAGCCT CTGGCGCGAC TGGCGCGCGC CGCACGTCGC GCTGCCGCTC GCGTTCGCGG GCGTCGGGAT GGTCGTGCTG CACATCTCAG CGACGTCGCG CCAGTTGTAC ATCCTGCCGT TCATCGCGCC GCTCGCGCTC GTCGCCGCGC AAGCGATCCC GCGCCTGCCG CAGCGACTGC ATACCGCGTG GGACCATGCG AGCCGGCTGC TGTTCGGCAC GGCCGCGGCG CTCGTGTGGA TCGTCTGGTC GCTGATGTCC GATCGCAACG GTCCGCGCGT CGGCTTGCAA TGGCTCGGCC GCTGGCTGCC GCTCGACTGG ACGATGCCGA TCGAGCCCGC GCTCGTGCTG TCCGCGCTCG CGATCACGAT CGGCTGGGTT GGCCTGATGC CGTCGCTGCG GCTTGCGGGC AAGTGGCGCG GCGCGCTGTC GTGGGCGATG GGCGCGCTCG TCGCGTGGGG GCTCGTCTAC ACGCTGCTGC TGCCGTGGCT CGACGTCGCG AAGAGCTATC GTTCGGTGTT CGAAGATTTG AATCGCCGGC TCGCGCTCGA ATGGAACGAC GGCGACTGCA TGGCGAGCGT CAATCTCGGC GAATCGGAAG CGCCGATGCT CTACTACTTC TCCGGCGTGC TGCACCAGCC CGTCGTCCGG CCGAACGCGA GCGCCTGCAC GTGGCTCATC GTGCAGGGCA CGCGTGCGAA CCCGCCCGCG CTCGACGTCG AATGGAAGCC CTTCTGGGCA GGCGCCCGGC CGGGCGACGA TCAGGAAATG CTGCGCGTCT ACGTGCGCAC GCCGGCCGCG GCCGCCATCG CCCGTCCTTG A
|
Protein sequence | MQGNASPGGR RTPASAHRAG SPDNHSHSPP AVALAAAAPG AAGTAGERAA GSGGADAIRT PLAGLRVWLV AAAVLCAYLL PGILGHDPWK QDETYTFGII QHMLESGDFV VPTNAGQPFL EKPPLYDWVA AGLAWLFSRY LPLHDAARLA SALFAALAFG FTARAARIAT GAARWLELPV IGTVALCAGS LVVIKHSHDL MTDVALMAGT AMGFCGLLEL VIRHAGGASR AAPGRQPASR CAAPLFGLGV GVALMSKGLF VPLVFGATLA ATLVLYPTCR SRAFFRSLAI AALVCAPFAL IWPTALFLRS ESLFLVWFWE NNVGRFFGFS VPTLGAENDK PLFIWRALLT LGFPVAPLAL VALARSLWRD WRAPHVALPL AFAGVGMVVL HISATSRQLY ILPFIAPLAL VAAQAIPRLP QRLHTAWDHA SRLLFGTAAA LVWIVWSLMS DRNGPRVGLQ WLGRWLPLDW TMPIEPALVL SALAITIGWV GLMPSLRLAG KWRGALSWAM GALVAWGLVY TLLLPWLDVA KSYRSVFEDL NRRLALEWND GDCMASVNLG ESEAPMLYYF SGVLHQPVVR PNASACTWLI VQGTRANPPA LDVEWKPFWA GARPGDDQEM LRVYVRTPAA AAIARP
|
| |