Gene BTH_I1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I1039 
Symbol 
ID3847384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp1181288 
End bp1182322 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content69% 
IMG OID637840711 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_441593 
Protein GI83718848 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.147415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA TGATCGTCAC CGATGCGTGG GAGCCGCAAG TCAACGGCGT CGTGCGCACG 
CTCAAGAGCA CCGCGCGCGA GCTCACCGCG CTCGGCCACC GCGTCGAGCT CGTCACGCCG
CTCGAATTCC GCACGGTGCC CTGCCCGACC TATCCCGAAA TCCGTCTGTC GATCCTGCCA
TACCGGCGGC TGCGCGAGCG CCTGAACGCG TTCGAGCCGG ACGCGCTGCA CATCGCGACG
GAAGGCCCGC TCGGCCTCGC CGCGCGCCGC TACGCGCGCG CGCGCAAGCT GCCGTTCACG
ACCGCGTACC ACACGCGCTT TCCCGAATAC GTGCAGGCGC GCTTCGGCGT GCCGCTCGCG
GCGACCTATC GCTTCCTGCG GTGGTTCCAC GGCGCGTCGC TCGCGGTGAT GGCGCCGACG
CCCGTCGTCA AGGACGACCT CGAGCAATTC GGCTTCGACA ACGTCGTGCT GTGGACGCGC
GGCGTCGATC TCGACATCTT CCGGCCGATG GAGTCGAAGG TGCTCAACAC CGCGCGGCCG
ATCTTCCTGT ATGTCGGCCG CGTCGCGATC GAGAAGAACG TCGAGGCGTT CCTGAAGCTC
GACCTGCCAG GCTCGAAATG GGTCGCGGGC GAAGGGCCTG CGCTCGCCGA GCTCAAATCG
CGCTATCCTG AGGCGAATTA CCTCGGCGTG CTGACGCAGG CGGAGCTCGC CAAGGTATAC
GCGGCGGCCG ACGTGTTCGT GTTCCCGAGC CGCACCGACA CGTTCGGTCT CGTGCTGCTC
GAGGCGCTCG CGTGCGGCAC GCCCGTCGCC GCCTATCCGG TGACGGGGCC CGTCGACGTG
CTCGGGAACG GCGGCGCCGG CGCGATGAAC GAGGACTTGC GCGAAGCGTG CCTCGAGGCG
CTGAAGATCG ATCGGCGGCA CGCGCGCGAG TGGGCCGAGC GTTTCTCGTG GCGCGCGGCG
TCCGAGCAGT TCGCGTCGCA CCTGAAGCCG CTGCCGAAAT CCGCCAGCCC ACATACCGAA
GGCGCAGCCG TTTGA
 
Protein sequence
MKIMIVTDAW EPQVNGVVRT LKSTARELTA LGHRVELVTP LEFRTVPCPT YPEIRLSILP 
YRRLRERLNA FEPDALHIAT EGPLGLAARR YARARKLPFT TAYHTRFPEY VQARFGVPLA
ATYRFLRWFH GASLAVMAPT PVVKDDLEQF GFDNVVLWTR GVDLDIFRPM ESKVLNTARP
IFLYVGRVAI EKNVEAFLKL DLPGSKWVAG EGPALAELKS RYPEANYLGV LTQAELAKVY
AAADVFVFPS RTDTFGLVLL EALACGTPVA AYPVTGPVDV LGNGGAGAMN EDLREACLEA
LKIDRRHARE WAERFSWRAA SEQFASHLKP LPKSASPHTE GAAV