Gene BURPS668_A2619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2619 
Symbol 
ID4886725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2516358 
End bp2518577 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content68% 
IMG OID640132556 
Productchain length determinant protein 
Protein accessionYP_001063612 
Protein GI126442327 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAATA CGCAAGCGAA ACATCCTTAT GCCGACCTCG CGGCGAAGAC CGACGAGGAA 
GACGTCGTCC TGGGCCAGAT GATCCAGGTG ATTCTCGACG ATATCTGGCT GCTCCTCGGC
ATCGCGTTGG TCGTGGTCGC GCTCGCCGGG CTCTACTGCT ACGTCGCGAA GCCGCTCTAT
TCGGCCGATG CGCAGGTGCG GGTGGAGGCG AGCGACAACA CGTCGCAGGC GCTTACGCAG
ACGCAGACGG GCGCGATGAT CAACAGCGGG CCGCCGACGC CGCCCACCGA TGCGGAAATC
GAGATCATCA AGAGCCGCGG CGTCGTCGCG CCGGTCGTCG AGCAGTTCAA GCTGAACGTG
TCGGTCACGC CGAACACGTT GCCGATTCTC GGCGCGATCG CCGCGCGGCT CGCGACGCCG
GGCCATCCGG GCAAACCGTG GCTCGGCTTG TCGTCGTACG CGTGGGGCGG CGAGGAGGCG
AGCATCGATT CGATCGACGT GACGCCCGCG CTCGAAGGCA AGCAGCTCAC GCTCACGGCC
GGCGCGGACG GCGGCTACGC GCTCGCCGAT CCGGACGGCG CGGTGCTCGT GCGCGGCAAG
GTCGGCGAGC GCGAGCAGGG CGGCGGCGTG ACGATCAACG TCTCGAAGCT CGTCGCGCGC
CCCGGCACGC GCTTCACGGT AGTCCGGCAG AACGATCTCG ATGCGATCAC CGCGTTCCAG
TCGGCGATCC AGGTGGCCGA GCAGGGCAAG CAGACCGGCG TGATCCAGAT CTCGCTCGAA
GGCAAGGACC CCGAACAGAC CGCGCAGATC GCGAACGCGC TCGCGCAGTC GTATCTGCAT
CAGCACGTGA CGAGCAAGCA GGCCGAAGCG ACGAAGATGC TCGAGTTCCT GAAGAACGAA
GAGCCGCGCC TGAAATCGGA CCTCGAGCGC GCGGAGGCGG AGCTCACCCA GTATCAGCGC
ACGTCGGGCT CGATCAACGC GAGCGACGAA GCGAAAGTCT ACCTCGAAGG CAGCGTCCAG
TACGAGCAGC AGGTCGCCGC GCAGCGGCTG CAGCTCGCGG CGCTCGCGCA GCGTTACACG
GACGAGCATC CGCTCGTCGT CGCGGCGAAG CAGCAGCTCG GCCAGCTCGA GGCGGAGCGC
GCGAAGTACG ACGGCAAGTT CCGCGGGCTG CCGGCGACCG AAGTCAAGGC TGTCGCGTTG
CAGCGCAACG CGAAGGTCGC GGAAGACATC TACGTGCTGC TGCTCAACCG CGTGCAGGAG
CTGTCGGTGC AGAAGGCCGG CACGGGCGGC AACATCCGCC TCGTCGATGC GGCGCTGCGC
CCGGGCGTGC CGGTCAAGCC GAAGAAGGTG CTGGTCCTGT CGGCGGCGAC GCTGCTCGGC
CTGATCCTCG GCACGAGCGT CGTGTTCCTG CGCCGCAACC TGTTCCATGG CATCGAGGAT
CCGGATCGCG TCGAGCGCGC GTTCAACCTG CCGCTGTACG GCCTCGTGCC GATGAGCGCG
GAGCAGGCGC GATTCGATGC CGCGGACAAG GGCAATCGCG TGCGGCCGAT TCTCGCGTGC
GCGCGGCCGA AGGATCTGAG CGTCGAAAGC CTGCGCAGCC TGCGCACCGC GATGCAGTTC
GCGCTGATGG ATGCGAAGAA CCGCGTGATC GTGCTGACCG GGCCGACCCC CGGCATCGGC
AAGAGCTTTC TCGCGGTCAA CCTCGCCGCG CTCGTCGCGC ATTCGGGCAA GCGTGTGCTG
CTGATCGACG CGGACATGCG GCGCGGCTCG CTCGATCGCC ACTTCGGCAC CGGGGGAAGG
CGCGGCCTGT CGGAATTGCT GAGCGATCAG GTCGCGCTCG AAGAGGCGAT TCGCGAAACG
TCGGTGCCGG GGCTGTCGTT CATCCCGAGC GGCGCGCGCC CGCCGAATCC GTCGGAGCTG
CTGATGTCGC CGCGCCTGTC GCAATACCTC GACGGCCTCG CGAAGCGCTA CGACATGGTG
ATCGTCGATT CGCCGCCGAT CCTCGCCGTC ACCGACGCGA CGATCTTCGG CGAACTCGCC
GGCTCGACGT TCCTCGTGCT GCGCTCCGGC ATGCACACCG AAGGCGAGAT CGGCGACGCG
ATCAAGCGGC TGCGCACCGC GGGCGTGCAA CTGCAAGGCG GGATCTTCAA CGGCGTGCCG
GCGCGCACGC GCGGCTACGG CCGCGGCTAT GCGGCCGTGC ACGAATATCT GAGCGCATGA
 
Protein sequence
MVNTQAKHPY ADLAAKTDEE DVVLGQMIQV ILDDIWLLLG IALVVVALAG LYCYVAKPLY 
SADAQVRVEA SDNTSQALTQ TQTGAMINSG PPTPPTDAEI EIIKSRGVVA PVVEQFKLNV
SVTPNTLPIL GAIAARLATP GHPGKPWLGL SSYAWGGEEA SIDSIDVTPA LEGKQLTLTA
GADGGYALAD PDGAVLVRGK VGEREQGGGV TINVSKLVAR PGTRFTVVRQ NDLDAITAFQ
SAIQVAEQGK QTGVIQISLE GKDPEQTAQI ANALAQSYLH QHVTSKQAEA TKMLEFLKNE
EPRLKSDLER AEAELTQYQR TSGSINASDE AKVYLEGSVQ YEQQVAAQRL QLAALAQRYT
DEHPLVVAAK QQLGQLEAER AKYDGKFRGL PATEVKAVAL QRNAKVAEDI YVLLLNRVQE
LSVQKAGTGG NIRLVDAALR PGVPVKPKKV LVLSAATLLG LILGTSVVFL RRNLFHGIED
PDRVERAFNL PLYGLVPMSA EQARFDAADK GNRVRPILAC ARPKDLSVES LRSLRTAMQF
ALMDAKNRVI VLTGPTPGIG KSFLAVNLAA LVAHSGKRVL LIDADMRRGS LDRHFGTGGR
RGLSELLSDQ VALEEAIRET SVPGLSFIPS GARPPNPSEL LMSPRLSQYL DGLAKRYDMV
IVDSPPILAV TDATIFGELA GSTFLVLRSG MHTEGEIGDA IKRLRTAGVQ LQGGIFNGVP
ARTRGYGRGY AAVHEYLSA