Gene BURPS1106A_A1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1072 
Symbol 
ID4904137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1029673 
End bp1030812 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content71% 
IMG OID640144178 
Productouter membrane porin 
Protein accessionYP_001075107 
Protein GI126457262 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.645195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACATT CGGCCCGCCT GCTGGCCTGC GGCGCGCTCT GCGTCACCGC CGCGCATGCG 
CACGCGCAAT CGAGCGTCAC GCTATACGGC ATCGTCGACA CCGGCATCGA ATTCGTTTCG
CACGCGAGCG CGAAGGGCGG TTCCGCGTGG CGCATGCCGG CCGTCACGGG CGAGCTGCCG
TCGCGCTGGG GCTTGCGCGG CGTCGAGGAT CTCGGCGGCG GCTATCGCGC GCTCTTCGCG
CTCGAAAGCG GCTTCAACCT GCGCGGCGGC GAGCTCGGCC AGGGCGGGCG ACTGTTCGGG
CGTCAGGCAT ACGTGGGCCT GCGCGCGCCG TTCGGCACGC TCGCGTTCGG CCGGCAATAC
ATGATGACTT ATGTCGCGCT GCAGGGCGCG GACATCATCG GCCCCGACAT CTATGGGCTC
GGCTCGCTCG ACGCTTACAT CCCGAACGGC CGCGCGGACA ACGCGGTGAC CTATGTCGGC
AGCTATCGCG GCGTGACGCT CGGCGCCGGC TATTCGTTCG GCCGCGACTC GGCAGGCACC
GGCAATTCGC CGGGGCAGGG CACGTGCGTC GGCTCGGTGC CGGGGCGCGC GGTCGAATGC
CGGAGCGGGT CGGCGATGCT GAAGTACGAC GCCGAGCGCT TCGGCGTGGC TGCGTCGTAC
GAAGAGCAGC GCGGCGGCGC GAACGCGGCG GCGAACTTCT TCGACGGTGC CGCGCCGATG
CCGATCGCAA GCAGTGCGGA CAAGGACACG CGCGCGCACG TGAGCGCGTA CGCGAACGCG
GGGCCGGTCA AGCTCGGCGC GGGCTGGATC GGCCGGCGCG TGTCGACCGA CGCGCCCGCC
GCGCCCGACG TGCGCACCGA TCTGTTCTTC GTCGGCGCCG CCTATCGCGC GACGCCGTTC
GTGACGATCG ACGGCGAAGC CTACCGGATC GTCGATGCGC GGCACGACGC GCGCGCGACG
ATGGCGACGC TGCGCGCGAG CTTCTCGCTG TCGAAGCGCA CCGCCGTCTA TGCGCAGACC
GCGTACCTAT GGAACAGCGC GCACGCGCGC TATTCGGTGA GCGGCGGCGG AGGCGGCACG
ATGCCCGCGG CCGGCGTCGG CCAGCTCGGC GCGATGGTCG GCGTTCGGCA CATGTTCTGA
 
Protein sequence
MRHSARLLAC GALCVTAAHA HAQSSVTLYG IVDTGIEFVS HASAKGGSAW RMPAVTGELP 
SRWGLRGVED LGGGYRALFA LESGFNLRGG ELGQGGRLFG RQAYVGLRAP FGTLAFGRQY
MMTYVALQGA DIIGPDIYGL GSLDAYIPNG RADNAVTYVG SYRGVTLGAG YSFGRDSAGT
GNSPGQGTCV GSVPGRAVEC RSGSAMLKYD AERFGVAASY EEQRGGANAA ANFFDGAAPM
PIASSADKDT RAHVSAYANA GPVKLGAGWI GRRVSTDAPA APDVRTDLFF VGAAYRATPF
VTIDGEAYRI VDARHDARAT MATLRASFSL SKRTAVYAQT AYLWNSAHAR YSVSGGGGGT
MPAAGVGQLG AMVGVRHMF