Gene BURPS1106A_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2048 
Symbol 
ID4900024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2031294 
End bp2032394 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content65% 
IMG OID640135278 
Productouter membrane porin 
Protein accessionYP_001066313 
Protein GI126454873 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.569362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC TCTTGATCGC GCTGCCGCTC GCCGCGGCCG CTACCACCCA CGCCCAGAGC 
AGCGTCACGC TATACGGCGT CCTCGAGGAC GGCGTCGACT ATGTGTCGAA CGTGCAGGGT
AAGCATCTCG TGCAGCTCGC GTCGGGCGTG ACGGCCGGCA GCCGCTGGGG CGTGCGCGGT
ACCGAGGATC TCGGCGGGGG CCTGAGCGCG ATCTTCCGGC TCGAAAGCGG CTTCGACATC
AATTCCGGCC GCCTCGGCAG CGGTCTTGCG TTCTCGCGCA ACGCGTACGT CGGCGTCGGC
GACGCGAAGC TCGGCACGCT CACGCTCGGC CGCCAGTGGG ATTCGATCGT CGATTACGTC
GAGCCGTTCA CGCTGAACGG CAACATCGGC GGCTACTACT TCGCGCACCC GAACGACATG
GACAATACCG ACAACGGCTT CCCGATCTCG AACGCGGTCA AGTACCGCAG CCCGACGATC
GCGGGCTTCA CGTTCGGCGG CCTCTACGCG TTCGGCGGCC AGCCGGGCCG CTTTTCGGAC
AACGCGACGT TCAGCGTCGG CGCGAACTAC GCGGCGGGCC CGGTCGGCTT CGGCATCGGC
TATTTGCGGA TCAACAATCC GGGCGTATCG ACGCAGGGTT ACCAGAACTA TCCGGGCTTC
ACGAACGCGG TGTACGGCAA CTATCTCGAC GCGGCACGTG CTCAGAAGGT GTTCGGCGTC
GGCGCGTCGT ACCAGGTCGT GCAATGGCTG AAGCTGCTGG CCGATTTCAC GAACACGAAC
TTCCAGCAAG GCAGCGCGGG ACATGATGCG ACCTTCCAGA ACTATGAGCT GTCGGCGCTC
GTCAAGCCGA CGCCCGCGGT AACGATCGGC GCGGGCTATA CGTACACAAC GGGCCGCGAC
CACGCGACGA ATGCGGAGCC GAAGTATCAT CAGTTCAACC TGAGCGTTGA ATACGCGCTG
TCCAAGCGCA CGAGCGTCTA TGCGATGGGT GCGTTCCAGA AGGCGGCGGG GGATGCACCG
GTCGCGCAGA TCGCGGGTTT CAATCCGTCG GGCAACCAGA AGCAGGCGGT CGGGCGAGCC
GGTATCCGCC ACGTGTTCTG A
 
Protein sequence
MKKLLIALPL AAAATTHAQS SVTLYGVLED GVDYVSNVQG KHLVQLASGV TAGSRWGVRG 
TEDLGGGLSA IFRLESGFDI NSGRLGSGLA FSRNAYVGVG DAKLGTLTLG RQWDSIVDYV
EPFTLNGNIG GYYFAHPNDM DNTDNGFPIS NAVKYRSPTI AGFTFGGLYA FGGQPGRFSD
NATFSVGANY AAGPVGFGIG YLRINNPGVS TQGYQNYPGF TNAVYGNYLD AARAQKVFGV
GASYQVVQWL KLLADFTNTN FQQGSAGHDA TFQNYELSAL VKPTPAVTIG AGYTYTTGRD
HATNAEPKYH QFNLSVEYAL SKRTSVYAMG AFQKAAGDAP VAQIAGFNPS GNQKQAVGRA
GIRHVF