Gene BURPS668_A2225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2225 
Symbol 
ID4887646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2151768 
End bp2153324 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content70% 
IMG OID640132162 
Producthypothetical protein 
Protein accessionYP_001063219 
Protein GI126443978 
COG category 
COG ID 
TIGRFAM ID[TIGR03368] cellulose synthase operon protein YhjU 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTCT GGAATCTGTA TTTCATTCTG AAGCTCTATC TGTTCGCGGC GGGCCACTTG 
AAGCCGTTGT GGATCGCGAA TCTCGGTTTC GCGCTGGCGC TCGCGCTGAG CGCGCCGGCG
AGGCGGCGCG GCGTGCGGCT GCTGCGCCAC GCGCTCGCGC TGGCGCTCGC GGTGCCGCTG
ATGTATCGCG AAGCGGACGT GCCGCCACTC GCGCGGCTCG TCGAAACGCT CGGCGGCCTG
CGCGCGTTCA GCGCCGGCTA CTGGATGGAG CTCGTGCCGC GCTTCGTGCC GCCGATGCTC
GCATTGGCCG CGCTCGGCGT CGTGATCGGC TATCTGATCG TCAATCGCTG GCTGCGCGTG
GCGACGTTCG TGCTGCTCGC GCTGATCGCG CTGCCGGTGT GGCAGGCGGG CAGCGCGGCG
CTCGCGCGGC TCGACGCGGC TGCCGCGGCC GTGCCCGGGC CGGCCGGAAC GGGCCGCGCC
GTGCAGCCGC AGGATCACAA CGCGGCGCTC GCCGCGTTCC GCTCGCAGGA ATCGCAGCGG
CAGGTGACGT TCGGCCGGCC GAGCGCCGAT CCGGCGACGC AGTTCGACGT GATCGTGCTG
CATGTGTGCT CGCTGTCGTG GGACGACCTC GACGTCGCGA GGCTGCGCAA TCAGCCGCTG
CTCGGCCATT TCGACTATCT GTTCACGAAT TTCAGCACGG CGGCGAGCTA CAGCGGCCCG
GCCGCGATCC GCGTGCTGCG CGCGAGCTGC GGGCAGGAGG CGCACGCGGA CCTGTACAAG
CCCGCGCCCG CGCAGTGCCA TCTGTTCGGG CAACTCGCGG CCGCCGGCTT CGCGCCGCAG
ACGCTGCTCA ACCACGACGG CCACTTCGAC AACTTTCTCC AGTTGATCCG CGAGAACATC
GGCGTGCCGA ACGCGCCGAT GATCCCGAAC GCGGACGCGC CCGTCGCGAT GCACGCGTTC
GACGGCTCGG CGATCAAGGA CGACTACGCG ACGCTCGCGA ACTGGTACGC GAAACGCGGC
GCGAGCCCCG GCCCCGTCGC GCTGTACTAC AACACGATCA GCCTGCACGA CGGCAATCAG
CTGACGGGCG GCCGGATGTC GAGCCTCGAT TCGTACCCGC TGCGCGCGCG CAAGCTGCTG
GACGACTTCG ACCGCTTCGC GGATCTCATT GCCGCATCGG GGCGGCGCGC GGTGATCGTG
TTCGTGCCCG AGCACGGCGC GGCGCTGCGC GGCGACGCGA AACAGGTGGC GGGGCTGCGC
GAGATTCCGA CGCCGCGGAT CGTGCACGGG CCGGTCGGCG TGCGGCTCGT CGGCTTCAAG
GGCGACCACG GCGCGACCAC CGTGATCGAC GCGCCGGCGA GCTTCTTCGC GCTCGCGCAA
CTGCTGGCGA ATCTCGTGTC GAACAGCCCG TTCAAGCCGG GCGTGACGCT GTCGCAATAC
GCGGCCGATC TGCCGCAGAC GCGAATGATC GGCGAGAACG AGGGCACGGT GACGATGACG
ACGCCGACGG GCTACGCGGT GAAGACGCCG GACGGCGTAT GGATCGACGA AAAATGA
 
Protein sequence
MTFWNLYFIL KLYLFAAGHL KPLWIANLGF ALALALSAPA RRRGVRLLRH ALALALAVPL 
MYREADVPPL ARLVETLGGL RAFSAGYWME LVPRFVPPML ALAALGVVIG YLIVNRWLRV
ATFVLLALIA LPVWQAGSAA LARLDAAAAA VPGPAGTGRA VQPQDHNAAL AAFRSQESQR
QVTFGRPSAD PATQFDVIVL HVCSLSWDDL DVARLRNQPL LGHFDYLFTN FSTAASYSGP
AAIRVLRASC GQEAHADLYK PAPAQCHLFG QLAAAGFAPQ TLLNHDGHFD NFLQLIRENI
GVPNAPMIPN ADAPVAMHAF DGSAIKDDYA TLANWYAKRG ASPGPVALYY NTISLHDGNQ
LTGGRMSSLD SYPLRARKLL DDFDRFADLI AASGRRAVIV FVPEHGAALR GDAKQVAGLR
EIPTPRIVHG PVGVRLVGFK GDHGATTVID APASFFALAQ LLANLVSNSP FKPGVTLSQY
AADLPQTRMI GENEGTVTMT TPTGYAVKTP DGVWIDEK