Gene BURPS1106A_A1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1231 
Symbol 
ID4903474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1166676 
End bp1167974 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content64% 
IMG OID640144337 
Productouter membrane porin 
Protein accessionYP_001075266 
Protein GI126455983 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCGG ACGAATCGCG GCCGTCGCTT CGATTCGAAA ATTTTATTTT GTTTCCGCTG 
ATTTCCGCCC CTATACTCCA TTCCACTAAA GGCACACAGT TCCATATATG GAACACAAAA
AGCCAAAACT CAGCCGTGGG AACGCGCGTG GCCGATCGTT TCGCGCGCGG CCACCGCCCG
CGTCATTTGG AGCAAAGGTC AGGAGATCAG ATGGTCAAGC ATGCGGCAGC AACATTACTC
GGCGCGCTCG CCGTCGCCGG CGCGTGGGCG CAGAGCAGCG TCACGCTGTA CGGCAGCCTC
GATGCCGGCG TCGCGTACGT GAACAACGTC GGCGGCGGCG CGAAGTGGTC GATGATTCAG
GGCAACACGC AGCCCGATCG ATGGGGGCTG AAGGGCGTCG AGGATCTGGG CGGCGGTCTG
AAGGCGATCT TCCAGCTCGA AAACGGCTTC TATACGAACA ACGGCCAGAT GGCGGCGGCG
GGCACGATGT GGAACCGTCA GGCGTTCGTC GGCCTGAATT CCGACCGGCT CGGCGCGCTG
ACGCTCGGCC ACCAGACGCC GTTCAACTTC GACTGGCTCG ACCCGCTGTC GAGCGCATTC
CTCGCGCAGA GCTGGTATGC GTTCCATCCC GGCAACCTCG ACCAGCTCGC GGACACCAGC
ACCGTGCCGT TCAACAACTC GGTCAAGTAC CGGTCACCCG TCTTTGCGGG CTTCACGGTG
GGCGCGATGC TCGGCTTCGG CAACACGACG AACTTCTCGA CCGGCCGCAC GATGAGCTTC
GGCGTGAACT ACGCGAACGG CCCGTTCAAG GCGGCCGCCG TCTATTCGAA CGAGCACGAC
CAGGCGTTCC CGATGGCGAC CGTCGGCGGC ATCGCCGGGC CGGGCGGCAC GTTCCAGGGC
ATGCCCGTCG CGAGCTATGT CGCGAAGAAG GCGCAGAACA TGGGCGCGGG CCTGTCGTAC
CGCTTCGGCC CGCTGCTCGT GCACGGCCTT TACACGCGCG TGAAACTGCA GGCGAACGGC
CACTCGGATA CGTTCCAGAG CTACGACGCC GGCGCGAACT ACCAGAGCTC GCCGTTCAAC
GTGATCGCGG GCGGCGCGGC GACTTCGACG CTCGCCGGCC GCCGCTGGAG CCAGTTCGAG
CTCGGCGACA CGTATTCGCT GTCCAAGCGC ACGCAGCTCT ACGTGAACGT GCTGTACGAG
CACGCGAGCG GCAACGCGAA GGCCGCGTTC TTCACGGCGG GCGCGTCGAG CACGGCGAAT
CAGGTGATTG TCCTGACGGG GATTCACCAC TCGTTCTGA
 
Protein sequence
MSSDESRPSL RFENFILFPL ISAPILHSTK GTQFHIWNTK SQNSAVGTRV ADRFARGHRP 
RHLEQRSGDQ MVKHAAATLL GALAVAGAWA QSSVTLYGSL DAGVAYVNNV GGGAKWSMIQ
GNTQPDRWGL KGVEDLGGGL KAIFQLENGF YTNNGQMAAA GTMWNRQAFV GLNSDRLGAL
TLGHQTPFNF DWLDPLSSAF LAQSWYAFHP GNLDQLADTS TVPFNNSVKY RSPVFAGFTV
GAMLGFGNTT NFSTGRTMSF GVNYANGPFK AAAVYSNEHD QAFPMATVGG IAGPGGTFQG
MPVASYVAKK AQNMGAGLSY RFGPLLVHGL YTRVKLQANG HSDTFQSYDA GANYQSSPFN
VIAGGAATST LAGRRWSQFE LGDTYSLSKR TQLYVNVLYE HASGNAKAAF FTAGASSTAN
QVIVLTGIHH SF