Gene BURPS1106A_A0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0947 
Symbol 
ID4904412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp922772 
End bp923863 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content68% 
IMG OID640144053 
Productouter membrane porin 
Protein accessionYP_001074983 
Protein GI126458389 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.651765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC ATCGCCGCCT TGCCGGCACG ACTGCGATCT CAGCGAGCCT CGCCGCGTGC 
GCGGGCCTCG CGTCCGGCCA CGCGCTCGCG CAATCGAGCG TCACGCTGTA CGGGATCATG
GACGCGGGCA TCGAATACGT GAACCACGCG GCGCCCGACG GCGGCGGCGC GTTCCGGATG
AAATCGGGCA ACAAGAACAC TTCGCGCTGG GGTCTGCGCG GCGTCGAGGA TCTCGGCGGC
GGGCTGAAGG CGGTGTTCCG GCTCGAAAGC GGGATCGATC TCGCGAACGG CGCGTTCGAC
GACGGCCCCG ACTCGATCTT CGCGCGGCGC GCGACGGTCG GCCTCAAGGG CCAGTGGGGC
GAGCTCACGC TCGGGCGCAA CTTCACGCCC ACGTACGACT ACATGCTGCC GTTCGACCCG
ATGGGCTACG CGCAGAACTA TTCGTGGGCG ACGTCCTCGA CGGCCACGGG CGGCCGCAAG
GACGGCCTCT TCACGCGCTC GTCGAACGCG GTGCGCTACG ACGGCGCGTA CGGCGGCCTG
CGCTTCGGCG CGATGTACGG CTTCGGCAAC GTGCCGGGCA GCATGAAGAC GAGCTCGAAA
TACGATTTCG CGCTCGGCTA CGAGAGCGGC CCGTTTGCCG CGGTCGTCAC GTTCGACCGC
CAGAACGGCG CGGCCGACAG CGTGACCCCG GCGGACCCCG TCAATTACGT GCAGGGCATT
CACGCGGGCG TCAGCTACGA CTTCGGCCGC CTGAAGACGA TGGCGGGCTA CCGCAACTAC
CGCCGCACGT ATCACACGGC GGCGGCGACG CAATTGAGCG ACATGTACTG GCTCGGCGGC
TCGTACGACT TCACGCCGGC GTTCTCGCTG ACGGGCGCGC TCTACCACCA GAACATCAAG
GGCGGCACCG ACGCCGATCC GACGCTCGTG TCGCTGCGCG CGCAATACGC GCTGTCCAAG
CGCACGGTGC TGTACGCGGC GGGCGGCTTC GCGATCGCCA AGCACGGGCA GAACGTCAGC
GTGTCGCGCG ACTCGGTCGG ATACGCGGAT ACGCAGCTCG GCGTGACCGT CGGGATGCAG
CAGCGGTTCT GA
 
Protein sequence
MKKHRRLAGT TAISASLAAC AGLASGHALA QSSVTLYGIM DAGIEYVNHA APDGGGAFRM 
KSGNKNTSRW GLRGVEDLGG GLKAVFRLES GIDLANGAFD DGPDSIFARR ATVGLKGQWG
ELTLGRNFTP TYDYMLPFDP MGYAQNYSWA TSSTATGGRK DGLFTRSSNA VRYDGAYGGL
RFGAMYGFGN VPGSMKTSSK YDFALGYESG PFAAVVTFDR QNGAADSVTP ADPVNYVQGI
HAGVSYDFGR LKTMAGYRNY RRTYHTAAAT QLSDMYWLGG SYDFTPAFSL TGALYHQNIK
GGTDADPTLV SLRAQYALSK RTVLYAAGGF AIAKHGQNVS VSRDSVGYAD TQLGVTVGMQ
QRF