Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2619 |
Symbol | |
ID | 4886725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2516358 |
End bp | 2518577 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640132556 |
Product | chain length determinant protein |
Protein accession | YP_001063612 |
Protein GI | 126442327 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01005] exopolysaccharide transport protein family [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAATA CGCAAGCGAA ACATCCTTAT GCCGACCTCG CGGCGAAGAC CGACGAGGAA GACGTCGTCC TGGGCCAGAT GATCCAGGTG ATTCTCGACG ATATCTGGCT GCTCCTCGGC ATCGCGTTGG TCGTGGTCGC GCTCGCCGGG CTCTACTGCT ACGTCGCGAA GCCGCTCTAT TCGGCCGATG CGCAGGTGCG GGTGGAGGCG AGCGACAACA CGTCGCAGGC GCTTACGCAG ACGCAGACGG GCGCGATGAT CAACAGCGGG CCGCCGACGC CGCCCACCGA TGCGGAAATC GAGATCATCA AGAGCCGCGG CGTCGTCGCG CCGGTCGTCG AGCAGTTCAA GCTGAACGTG TCGGTCACGC CGAACACGTT GCCGATTCTC GGCGCGATCG CCGCGCGGCT CGCGACGCCG GGCCATCCGG GCAAACCGTG GCTCGGCTTG TCGTCGTACG CGTGGGGCGG CGAGGAGGCG AGCATCGATT CGATCGACGT GACGCCCGCG CTCGAAGGCA AGCAGCTCAC GCTCACGGCC GGCGCGGACG GCGGCTACGC GCTCGCCGAT CCGGACGGCG CGGTGCTCGT GCGCGGCAAG GTCGGCGAGC GCGAGCAGGG CGGCGGCGTG ACGATCAACG TCTCGAAGCT CGTCGCGCGC CCCGGCACGC GCTTCACGGT AGTCCGGCAG AACGATCTCG ATGCGATCAC CGCGTTCCAG TCGGCGATCC AGGTGGCCGA GCAGGGCAAG CAGACCGGCG TGATCCAGAT CTCGCTCGAA GGCAAGGACC CCGAACAGAC CGCGCAGATC GCGAACGCGC TCGCGCAGTC GTATCTGCAT CAGCACGTGA CGAGCAAGCA GGCCGAAGCG ACGAAGATGC TCGAGTTCCT GAAGAACGAA GAGCCGCGCC TGAAATCGGA CCTCGAGCGC GCGGAGGCGG AGCTCACCCA GTATCAGCGC ACGTCGGGCT CGATCAACGC GAGCGACGAA GCGAAAGTCT ACCTCGAAGG CAGCGTCCAG TACGAGCAGC AGGTCGCCGC GCAGCGGCTG CAGCTCGCGG CGCTCGCGCA GCGTTACACG GACGAGCATC CGCTCGTCGT CGCGGCGAAG CAGCAGCTCG GCCAGCTCGA GGCGGAGCGC GCGAAGTACG ACGGCAAGTT CCGCGGGCTG CCGGCGACCG AAGTCAAGGC TGTCGCGTTG CAGCGCAACG CGAAGGTCGC GGAAGACATC TACGTGCTGC TGCTCAACCG CGTGCAGGAG CTGTCGGTGC AGAAGGCCGG CACGGGCGGC AACATCCGCC TCGTCGATGC GGCGCTGCGC CCGGGCGTGC CGGTCAAGCC GAAGAAGGTG CTGGTCCTGT CGGCGGCGAC GCTGCTCGGC CTGATCCTCG GCACGAGCGT CGTGTTCCTG CGCCGCAACC TGTTCCATGG CATCGAGGAT CCGGATCGCG TCGAGCGCGC GTTCAACCTG CCGCTGTACG GCCTCGTGCC GATGAGCGCG GAGCAGGCGC GATTCGATGC CGCGGACAAG GGCAATCGCG TGCGGCCGAT TCTCGCGTGC GCGCGGCCGA AGGATCTGAG CGTCGAAAGC CTGCGCAGCC TGCGCACCGC GATGCAGTTC GCGCTGATGG ATGCGAAGAA CCGCGTGATC GTGCTGACCG GGCCGACCCC CGGCATCGGC AAGAGCTTTC TCGCGGTCAA CCTCGCCGCG CTCGTCGCGC ATTCGGGCAA GCGTGTGCTG CTGATCGACG CGGACATGCG GCGCGGCTCG CTCGATCGCC ACTTCGGCAC CGGGGGAAGG CGCGGCCTGT CGGAATTGCT GAGCGATCAG GTCGCGCTCG AAGAGGCGAT TCGCGAAACG TCGGTGCCGG GGCTGTCGTT CATCCCGAGC GGCGCGCGCC CGCCGAATCC GTCGGAGCTG CTGATGTCGC CGCGCCTGTC GCAATACCTC GACGGCCTCG CGAAGCGCTA CGACATGGTG ATCGTCGATT CGCCGCCGAT CCTCGCCGTC ACCGACGCGA CGATCTTCGG CGAACTCGCC GGCTCGACGT TCCTCGTGCT GCGCTCCGGC ATGCACACCG AAGGCGAGAT CGGCGACGCG ATCAAGCGGC TGCGCACCGC GGGCGTGCAA CTGCAAGGCG GGATCTTCAA CGGCGTGCCG GCGCGCACGC GCGGCTACGG CCGCGGCTAT GCGGCCGTGC ACGAATATCT GAGCGCATGA
|
Protein sequence | MVNTQAKHPY ADLAAKTDEE DVVLGQMIQV ILDDIWLLLG IALVVVALAG LYCYVAKPLY SADAQVRVEA SDNTSQALTQ TQTGAMINSG PPTPPTDAEI EIIKSRGVVA PVVEQFKLNV SVTPNTLPIL GAIAARLATP GHPGKPWLGL SSYAWGGEEA SIDSIDVTPA LEGKQLTLTA GADGGYALAD PDGAVLVRGK VGEREQGGGV TINVSKLVAR PGTRFTVVRQ NDLDAITAFQ SAIQVAEQGK QTGVIQISLE GKDPEQTAQI ANALAQSYLH QHVTSKQAEA TKMLEFLKNE EPRLKSDLER AEAELTQYQR TSGSINASDE AKVYLEGSVQ YEQQVAAQRL QLAALAQRYT DEHPLVVAAK QQLGQLEAER AKYDGKFRGL PATEVKAVAL QRNAKVAEDI YVLLLNRVQE LSVQKAGTGG NIRLVDAALR PGVPVKPKKV LVLSAATLLG LILGTSVVFL RRNLFHGIED PDRVERAFNL PLYGLVPMSA EQARFDAADK GNRVRPILAC ARPKDLSVES LRSLRTAMQF ALMDAKNRVI VLTGPTPGIG KSFLAVNLAA LVAHSGKRVL LIDADMRRGS LDRHFGTGGR RGLSELLSDQ VALEEAIRET SVPGLSFIPS GARPPNPSEL LMSPRLSQYL DGLAKRYDMV IVDSPPILAV TDATIFGELA GSTFLVLRSG MHTEGEIGDA IKRLRTAGVQ LQGGIFNGVP ARTRGYGRGY AAVHEYLSA
|
| |