Gene BURPS1106A_A2481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2481 
Symbol 
ID4903636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2443023 
End bp2445242 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content68% 
IMG OID640145585 
Productchain length determinant protein 
Protein accessionYP_001076512 
Protein GI126457439 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAATA CGCAAGCGAA ACATCCTTAT GCCGACCTCG CGGTGAAGAC CGACGAGGAA 
GACGTCGTCC TGGGCCAGAT GATCCAGGTG ATTCTCGACG ATATCTGGCT GCTCCTCGGC
ATCGCGTTGG TCGTGGTCGC GCTCGCCGGG CTCTACTGCT ACGTCGCGAA GCCGCTCTAT
TCGGCCGATG CGCAGGTGCG GGTGGAGGCG AGCGACAACA CGTCGCAGGC GCTTACGCAG
ACGCAGACGG GCGCGATGAT CAACAGCGGG CCGCCGACGC CGCCCACCGA TGCGGAAATC
GAGATCATCA AGAGCCGCGG CGTCGTCGCG CCGGTCGCCG AGCAGTTCAA GCTGAACGCG
TCGGTCACGC CGAACACGTT GCCGATTCTC GGCGCGATCG CCGCGCGGCT CGCGACGCCG
GGCCATCCGG GCAAACCGTG GCTCGGCTTG TCGTCGTACG CGTGGGGCGG CGAGGAGGCG
AGCATCGATT CGATCGACGT GACGCCCGCG CTCGAAGGCA AGCAGCTCAC GCTCACGGCC
GGCGCGGACG GCGGCTATGC GCTCGCCGAT CCGGACGGCG AGGTGCTCGT GCGCGGCAAG
GTCGGCGAGC GCGAGCAGGG CGGCGGCGTG ACGATCAACG TCTCGAAGCT CGTCGCGCGC
CCCGGCACGC GCTTCACGGT GGTCCGGCAG AACGATCTCG ATGCGATCAC CGCGTTCCAG
TCGGCGATCC AGGTGGCCGA GCAGGGCAAG CAGACCGGCG TGATCCAGAT CTCGCTCGAA
GGCAAGGATC CCGAACAGAC CGCGCAGATC GCGAACGCGC TCGCGCAGTC GTATCTGCAT
CAGCACGTGA CGAGCAAGCA GGCCGAAGCG ACGAAGATGC TCGAGTTCCT GAAGAACGAA
GAGCCGCGCC TGAAATCGGA CCTCGAGCGC GCGGAGGCGG AGCTCACCCA GTATCAGCGC
ACGTCGGGCT CGATCAACGC GAGCGACGAA GCGAAGGTCT ACCTCGAAGG CAGCGTCCAG
TACGAGCAGC AGGTCGCCGC GCAGAGGCTG CAGCTCGCGG CGCTCGCGCA GCGCTACACG
GACGAGCATC CGCTCGTCGT CGCGGCGAAG CAGCAGCTCG GCCAGCTCGA GGCGGAGCGC
GCGAAGTACG ACGGCAAGTT CCGCGGGCTG CCGGCGACCG AAGTCAAGGC TGTCGCGTTG
CAGCGCAACG CGAAGGTTGC GGAAGACATC TACGTGCTGC TGCTCAACCG TGTGCAGGAG
CTGTCGGTGC AGAAGGCCGG CACGGGCGGC AACATCCGCC TCGTCGATGC GGCGCTGCGC
CCGGGCGTGC CGGTCAAGCC GAAGAAGGTG CTGATCCTGT CGGCGGCGAC GCTGCTCGGC
CTGATCCTCG GCACGAGCGT CGTGTTCCTG CGCCGCAACC TGTTCCATGG CATCGAGGAT
CCGGATCGCG TCGAACGCGC GTTCAACCTG CCGCTGTACG GCCTCGTGCC GATGAGCGCG
GAGCAGGCGC GATTCGATGC CGCCGACAAG GGCAATCGCG TGCGGCCGAT TCTCGCGTGC
GCGCGGCCGA AGGATCTGAG CGTCGAAAGC CTGCGCAGCC TGCGCACCGC GATGCAGTTC
GCGCTGATGG ATGCGAAGAA CCGCGTGATC GTGCTGACCG GACCGACCCC CGGCATCGGC
AAGAGCTTTC TCGCGGTCAA CCTCGCCGCG CTCGTCGCGC ATTCGGGCAA GCGCGTGCTG
CTGATCGACG CGGACATGCG GCGCGGCTCG CTCGATCGCC ACTTCGGCAC CGGGGGAAGG
CGCGGCCTGT CGGAATTGCT GAGCGATCAG GTCGCGCTCG AAGAGGCGAT TCGCGAAACG
TCGGTGCCGG GGCTGTCGTT CATCCCGAGC GGCGCGCGCC CGCCGAATCC GTCGGAGCTG
CTGATGTCGC CGCGCCTGTC GCAATACCTC GACGGCCTCG CGAAGCGCTA CGACATGGTG
ATCGTCGATT CGCCGCCGAT CCTCGCCGTC ACCGACGCGA CGATCTTCGG CGAACTCGCC
GGCTCGACGT TCCTCGTGCT GCGCTCCGGC ATGCACACCG AAGGCGAGAT CGGCGACGCG
ATCAAGCGGC TGCGCACCGC GGGCGTGCAA CTGCAAGGCG GGATCTTCAA CGGCGTGCCG
GCGCGCACGC GCGGCTACGG CCGCGGCTAT GCGGCCGTGC ACGAATATCT GAGCGCATGA
 
Protein sequence
MVNTQAKHPY ADLAVKTDEE DVVLGQMIQV ILDDIWLLLG IALVVVALAG LYCYVAKPLY 
SADAQVRVEA SDNTSQALTQ TQTGAMINSG PPTPPTDAEI EIIKSRGVVA PVAEQFKLNA
SVTPNTLPIL GAIAARLATP GHPGKPWLGL SSYAWGGEEA SIDSIDVTPA LEGKQLTLTA
GADGGYALAD PDGEVLVRGK VGEREQGGGV TINVSKLVAR PGTRFTVVRQ NDLDAITAFQ
SAIQVAEQGK QTGVIQISLE GKDPEQTAQI ANALAQSYLH QHVTSKQAEA TKMLEFLKNE
EPRLKSDLER AEAELTQYQR TSGSINASDE AKVYLEGSVQ YEQQVAAQRL QLAALAQRYT
DEHPLVVAAK QQLGQLEAER AKYDGKFRGL PATEVKAVAL QRNAKVAEDI YVLLLNRVQE
LSVQKAGTGG NIRLVDAALR PGVPVKPKKV LILSAATLLG LILGTSVVFL RRNLFHGIED
PDRVERAFNL PLYGLVPMSA EQARFDAADK GNRVRPILAC ARPKDLSVES LRSLRTAMQF
ALMDAKNRVI VLTGPTPGIG KSFLAVNLAA LVAHSGKRVL LIDADMRRGS LDRHFGTGGR
RGLSELLSDQ VALEEAIRET SVPGLSFIPS GARPPNPSEL LMSPRLSQYL DGLAKRYDMV
IVDSPPILAV TDATIFGELA GSTFLVLRSG MHTEGEIGDA IKRLRTAGVQ LQGGIFNGVP
ARTRGYGRGY AAVHEYLSA