Gene BTH_I1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I1034 
Symbol 
ID3846778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp1175761 
End bp1177671 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content71% 
IMG OID637840706 
Producthypothetical protein 
Protein accessionYP_441588 
Protein GI83719081 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.509069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGGAA ATGCCTCGCC CGGCGGCCGG CGCACGCCCG CTTCGGCGCA TCGCGCCGGC 
TCGCCCGACA ATCACTCGCA CTCGCCGCCC GCCGTGGCGC TCGCGGCCGC CGCGCCCGGC
GCCGCGGGAA CGGCCGGCGA GCGCGCCGCC GGATCGGGCG GCGCGGATGC AATCCGCACG
CCGCTCGCCG GATTGCGCGT GTGGCTCGTC GCCGCGGCCG TGCTTTGCGC GTACCTGTTG
CCGGGCATCC TCGGCCACGA TCCGTGGAAG CAGGATGAAA CCTACACGTT CGGCATCATC
CAACACATGC TCGAAAGCGG CGACTTCGTC GTGCCGACCA ACGCGGGGCA GCCGTTCCTC
GAAAAACCAC CGCTGTACGA CTGGGTCGCC GCCGGCCTCG CGTGGCTCTT TTCCCGCTAC
CTGCCGCTGC ACGACGCCGC ACGGCTCGCG AGCGCCCTCT TCGCCGCGCT CGCGTTCGGC
TTCACCGCGC GCGCCGCGCG CATCGCGACC GGCGCCGCGC GCTGGCTCGA ACTGCCGGTG
ATCGGCACCG TCGCGCTGTG CGCGGGCTCG CTCGTCGTTA TCAAGCATTC TCACGACCTG
ATGACCGACG TCGCGCTGAT GGCGGGCACC GCGATGGGCT TTTGCGGGCT GCTCGAACTC
GTGATCCGGC ACGCCGGCGG CGCGAGCCGC GCCGCCCCCG GCCGGCAGCC CGCGAGCCGC
TGCGCGGCCC CCCTGTTCGG GCTGGGCGTC GGCGTCGCGC TGATGTCGAA GGGCCTGTTC
GTGCCGCTCG TGTTCGGCGC GACGCTCGCC GCAACGCTCG TTCTCTACCC GACCTGCCGC
AGCCGCGCGT TCTTCCGCTC GCTCGCGATC GCCGCGCTCG TGTGCGCGCC GTTCGCGCTG
ATCTGGCCGA CCGCGCTGTT CCTGCGCTCC GAATCGCTGT TCCTCGTCTG GTTCTGGGAA
AACAACGTCG GCCGCTTCTT CGGTTTCTCG GTGCCGACGC TCGGCGCCGA AAACGACAAG
CCGCTCTTCA TCTGGCGCGC GCTGCTGACG CTCGGCTTTC CGGTCGCCCC GCTCGCGCTC
GTCGCGCTCG CGCGCAGCCT CTGGCGCGAC TGGCGCGCGC CGCACGTCGC GCTGCCGCTC
GCGTTCGCGG GCGTCGGGAT GGTCGTGCTG CACATCTCAG CGACGTCGCG CCAGTTGTAC
ATCCTGCCGT TCATCGCGCC GCTCGCGCTC GTCGCCGCGC AAGCGATCCC GCGCCTGCCG
CAGCGACTGC ATACCGCGTG GGACCATGCG AGCCGGCTGC TGTTCGGCAC GGCCGCGGCG
CTCGTGTGGA TCGTCTGGTC GCTGATGTCC GATCGCAACG GTCCGCGCGT CGGCTTGCAA
TGGCTCGGCC GCTGGCTGCC GCTCGACTGG ACGATGCCGA TCGAGCCCGC GCTCGTGCTG
TCCGCGCTCG CGATCACGAT CGGCTGGGTT GGCCTGATGC CGTCGCTGCG GCTTGCGGGC
AAGTGGCGCG GCGCGCTGTC GTGGGCGATG GGCGCGCTCG TCGCGTGGGG GCTCGTCTAC
ACGCTGCTGC TGCCGTGGCT CGACGTCGCG AAGAGCTATC GTTCGGTGTT CGAAGATTTG
AATCGCCGGC TCGCGCTCGA ATGGAACGAC GGCGACTGCA TGGCGAGCGT CAATCTCGGC
GAATCGGAAG CGCCGATGCT CTACTACTTC TCCGGCGTGC TGCACCAGCC CGTCGTCCGG
CCGAACGCGA GCGCCTGCAC GTGGCTCATC GTGCAGGGCA CGCGTGCGAA CCCGCCCGCG
CTCGACGTCG AATGGAAGCC CTTCTGGGCA GGCGCCCGGC CGGGCGACGA TCAGGAAATG
CTGCGCGTCT ACGTGCGCAC GCCGGCCGCG GCCGCCATCG CCCGTCCTTG A
 
Protein sequence
MQGNASPGGR RTPASAHRAG SPDNHSHSPP AVALAAAAPG AAGTAGERAA GSGGADAIRT 
PLAGLRVWLV AAAVLCAYLL PGILGHDPWK QDETYTFGII QHMLESGDFV VPTNAGQPFL
EKPPLYDWVA AGLAWLFSRY LPLHDAARLA SALFAALAFG FTARAARIAT GAARWLELPV
IGTVALCAGS LVVIKHSHDL MTDVALMAGT AMGFCGLLEL VIRHAGGASR AAPGRQPASR
CAAPLFGLGV GVALMSKGLF VPLVFGATLA ATLVLYPTCR SRAFFRSLAI AALVCAPFAL
IWPTALFLRS ESLFLVWFWE NNVGRFFGFS VPTLGAENDK PLFIWRALLT LGFPVAPLAL
VALARSLWRD WRAPHVALPL AFAGVGMVVL HISATSRQLY ILPFIAPLAL VAAQAIPRLP
QRLHTAWDHA SRLLFGTAAA LVWIVWSLMS DRNGPRVGLQ WLGRWLPLDW TMPIEPALVL
SALAITIGWV GLMPSLRLAG KWRGALSWAM GALVAWGLVY TLLLPWLDVA KSYRSVFEDL
NRRLALEWND GDCMASVNLG ESEAPMLYYF SGVLHQPVVR PNASACTWLI VQGTRANPPA
LDVEWKPFWA GARPGDDQEM LRVYVRTPAA AAIARP