Gene BURPS1106A_0881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0881 
Symbol 
ID4900371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp864234 
End bp865535 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content68% 
IMG OID640134111 
Producthypothetical protein 
Protein accessionYP_001065162 
Protein GI126453437 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAG GACGTCGACA CTTCGTGCGC TCGGTTGCGA GCGCCTCGGC CGCGCTCGCG 
GCCGCCGCAT GGTCCCCGGC GCGCGCCGCA ATCGACGCGC CCGCCTCGCC CGCGACCGCG
CTGTCGCTCA CGCCCGGGCG CTGGTCGCCG AACAACGTCG CGCGGCTGCG CGCGGTGCTC
GCCGGGCACG GCGCGTCGAG CCCGCGCTAC CGCCCCGAGC ACCGCCCGTA CGCGGTGTTC
GACTGGGACA ACACGAGCAT CATGAACGAC TGCGAAGAAG CGCTGCTGAT GCACCAGATC
GACGGGCTGC ATTACCGGCT CACGCCCGAG CAGTTCTCGG CGATCCTGCG CCAGGGCGTG
CCCGACGGCC CGTTCGACGC GAAGCTCGGC TATACGAGCG TCGACGGCAA GCCCGTGCGG
ATGGAGGACA TCGCGGCCGA CGTCGACGCC GACTACCGGT GGCTGCATGC GAACTATCGC
GGCCTCGCGG GCGACAAGCC GCTCGACGAG ATCCACCGCA GCGAGCAGTT CCGGGATTTC
CGCGCGAAGC TGTACTTCAT GTACGACGCG ATCTGCGACA CGTATCCGGT CGAGATCGGC
TACAAGTGGA TCATGTACTG GTACGCGGGC ATGACGCGCG ACGAGTTGCA GGCGATGGCG
TTCGACAGCA ACGTCGCGAA CCTCGGCGAC GCGCTGCGCA AGGTGACCTA CGAAAGCTCG
CGCGCGCTGC CGGGCAAGGC GGGCGTCATC GCCGCGACGC ACTTCCACGG CATCCGCATC
CACGAGGAGA TCCGCGCGGT GATGGACACG CTGCGCTCGA ACGGCATCGA CGTGTACGTC
AGCACCGCAT CGCTCGACGA CGTCGTGCGC GTGTTCGCGG GCCATCCGGC GTTCGGCTAC
GGCGTGCCCG CCGAAAACGT GATCGGCATG CGGCTCACGA TGGCGGACGG CAAGTACATG
AACGAATACC TGCCGAACTG GCACTTCAAC TACGGGCCGG GCAAGACGGT CGGCATCCGC
CGCGAGCTCG AATCGAAGAA GGGCTACGGG CCGCTGCTCG TGTTCGGCGA CAGCGACGGC
GACGCGTGGA TGCTGCGCGA CTTCGCCGAT ACCGCGGTCG GCGTGATCGT CAACCGGATG
AAGAAAGGCG AGATCGGTAT CGACAGCCGC AAGGCGGCCG AGCAGATCGG CGCGAAGGAC
GCGCGGCTCG TGCTGCAAGG GCGCGACGAG AACACCGGGC TGATGGTCGC CGACGAGCGC
TCGATCAAGT ACGGCAAGCG CGATCCCAAA CTGCTCGCGT GA
 
Protein sequence
MKTGRRHFVR SVASASAALA AAAWSPARAA IDAPASPATA LSLTPGRWSP NNVARLRAVL 
AGHGASSPRY RPEHRPYAVF DWDNTSIMND CEEALLMHQI DGLHYRLTPE QFSAILRQGV
PDGPFDAKLG YTSVDGKPVR MEDIAADVDA DYRWLHANYR GLAGDKPLDE IHRSEQFRDF
RAKLYFMYDA ICDTYPVEIG YKWIMYWYAG MTRDELQAMA FDSNVANLGD ALRKVTYESS
RALPGKAGVI AATHFHGIRI HEEIRAVMDT LRSNGIDVYV STASLDDVVR VFAGHPAFGY
GVPAENVIGM RLTMADGKYM NEYLPNWHFN YGPGKTVGIR RELESKKGYG PLLVFGDSDG
DAWMLRDFAD TAVGVIVNRM KKGEIGIDSR KAAEQIGAKD ARLVLQGRDE NTGLMVADER
SIKYGKRDPK LLA