Gene BURPS1106A_A2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2139 
Symbol 
ID4904729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2090483 
End bp2092039 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content69% 
IMG OID640145244 
Producthypothetical protein 
Protein accessionYP_001076172 
Protein GI126457868 
COG category 
COG ID 
TIGRFAM ID[TIGR03368] cellulose synthase operon protein YhjU 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTCT GGAATCTGTA TTTCGTTCTG AAGCTCTATC TGTTCGCGGC GGGCCACTTG 
AAGCCGTTGT GGATCGCGAA TCTCGGTTTC GCGCTGGCGC TCGCGCTGAG CGCGCCGGCG
AGGCGGCGCA GCGTGCAGCT GCTGCGCCAC GCGCTCGCGC TGGCGCTCGC GGTGCCGCTG
ATGTATCGCG AAGCGGACGT GCCGCCACTC GCGCGGCTCG TCGAAACGCT CGGCGGCCTG
CGCGCGTTCA GCGCCGGCTA CTGGATGGAG CTCGTGCCGC GCTTCGTGCC GCCGATGCTC
GCATTGGCCG CGCTCGGCGT CGTGATCGGC TATCTGATCG TCAATCGCTG GCTGCGCGTG
GCGACGTTCG TGCTGCTCGC GCTGATCGCG CTGCCGGTGT GGCAGGCGGG CAGCGCGGCG
CTCGCGCGGC TCGACGCGGC TGCCGCGGCC GTGCCCGGGC CGGCCGGAAC GGGCCGCGCC
GTGCAGCCGC AGGATCACAA CGCGGCGCTC GCCGCGTTCC GCTCGCAGGA ATCGCAGCGG
CAGGTGACGT TCGGCCGGCC GAGCGCCGAT CCGGCGACGC AGTTCGACGT GATCGTGCTG
CATGTGTGCT CGCTGTCGTG GGACGACCTC GACGTCGCGA GGCTGCGCAA TCATCCGCTG
CTCGGCCATT TCGACTATCT GTTCACGAAT TTCAGCACGG CGGCGAGCTA CAGCGGCCCG
GCCGCGATCC GCGTGCTGCG CGCGAGCTGC GGGCAGGAGG CGCACGCGGA CCTGTACAAG
CCCGCGCCCG CGCAGTGCCA TCTGTTCGGG CAACTCGCGG CCGCCGGCTT CGCGCCGCAG
ACGCTGCTCA ACCACGACGG CCACTTCGAC AACTTTCTCC AGTTGATCCG CGAGAACATC
GGCGTGCCGA ACGCGCCGAT GATCCCGAAC GCGGACGCGC CCGTCGCGAT GCACGCGTTC
GACGGCTCGG CGATCAAGGA CGACTACGCG ACGCTCGCGA ACTGGTACGC GAAACGCGGC
GCGAGCCCCG GCCCCGTCGC GCTGTACTAC AACACGATCA GCCTGCACGA CGGCAATCAG
CTGACGGGCG GCCGGATGTC GAGCCTCGAT TCGTACCCGC TGCGCGCGCG CAAGCTGCTG
GACGACTTCG ACCGCTTCGC GGATCTCATT GCCGCATCGG GGCGGCGCGC GGTGATCGTG
TTCGTGCCCG AGCATGGCGC GGCGCTGCGC GGCGACGCGA AACAGGTGGC GGGGCTGCGC
GAGATTCCGA CGCCGCGGAT CGTGCACGGG CCGGTCGGCG TGAGGCTCGT CGGCTTCAAG
GGCGACCACG GCGCGACCAC CGTGATCGAC GCGCCGGCGA GCTTCCTCGC GCTCGCGCAA
CTGCTGGCGA ATCTCGTGTC GAGCAGCCCG TTCAAGCCGG GCGTGACGCT GTCGCAATAC
GCGGCCGATC TGCCGCAGAC GCGAATGATC GGCGAGAACG AGGGCACGGT GACGATGACG
ACGCCGACGG GCTACGCGGT GAAGACGCCG GACGGCGTAT GGATCGACGA AAAATGA
 
Protein sequence
MTFWNLYFVL KLYLFAAGHL KPLWIANLGF ALALALSAPA RRRSVQLLRH ALALALAVPL 
MYREADVPPL ARLVETLGGL RAFSAGYWME LVPRFVPPML ALAALGVVIG YLIVNRWLRV
ATFVLLALIA LPVWQAGSAA LARLDAAAAA VPGPAGTGRA VQPQDHNAAL AAFRSQESQR
QVTFGRPSAD PATQFDVIVL HVCSLSWDDL DVARLRNHPL LGHFDYLFTN FSTAASYSGP
AAIRVLRASC GQEAHADLYK PAPAQCHLFG QLAAAGFAPQ TLLNHDGHFD NFLQLIRENI
GVPNAPMIPN ADAPVAMHAF DGSAIKDDYA TLANWYAKRG ASPGPVALYY NTISLHDGNQ
LTGGRMSSLD SYPLRARKLL DDFDRFADLI AASGRRAVIV FVPEHGAALR GDAKQVAGLR
EIPTPRIVHG PVGVRLVGFK GDHGATTVID APASFLALAQ LLANLVSSSP FKPGVTLSQY
AADLPQTRMI GENEGTVTMT TPTGYAVKTP DGVWIDEK