Gene BURPS1106A_3561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3561 
Symbol 
ID4902400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3466818 
End bp3467978 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content65% 
IMG OID640136787 
Productouter membrane porin 
Protein accessionYP_001067797 
Protein GI126452945 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGT CGCTTCTCGC GCTCGTCGCG CTGAGCGCGT TTGCTGGCGC GGCTCATGCG 
CAAAGCAGCG TGACGCTGTA CGGCATCATC GACGAAGGCT TCAACATCAA TACCAATGCA
GGCGGCAAGC ACCTGTACAA CCTGTCGAGC GGTGTCATGC AGGGTAGCCG TTGGGGCCTG
CGCGGCACGG AAGACCTGGG CGGTGGCCTG AAGGCGCTGT TCGTCCTCGA AAACGGCTTC
GACGTGAACT CGGGCAAGCT GAACCAGGGC GGCCTCGAAT TCGGCCGTCA AGCGTACGTC
GGCCTGTCGA GCGGCTTCGG CACCGTCACG CTCGGCCGTC AGTACGACTC CGTCGTCGAC
TTCGTCGGCC CGCTGGAAGC CGGCGACCAG TGGGGCGGCT ACATCGCCGC TCACCCGGGC
GATCTCGACA ACTTCAACAA CGCATATCGC GTGAACAACG CAGTCAAGTT CACGAGCGCG
AACTACGGCG GCTTCACGTT CGGCGGCCTG TACAGCTTCG GCGGCGTCGC CGGCGACTTC
AGCCGCAACC AGACCTGGTC GCTCGGCGCG GGCTACACGA ACGGCCCGCT CGTGTTGGGC
GTCGGCTACC TGAACGCGCG CACGCCGTCG ACGGCTGGCG GCCTGTTCGG CAACAACACG
ACGTCGAGCA CGCCGGCTGC CGTGACGACC CCGGTCTACG CGGGCTATGC GTCGGCCCAT
ACGTACCAGG TGATCGGTGC GGGCGGCGCC TATTCGTTCG GCGCGGCGAC GGTCGGCATC
ACGTACTCGA ACATCAAGTT CATGAACTTC GCGAGCACGG TGTTCCCGAA CCAGACCGCG
ACGTTCAACA ACGCGGAAAT CAACTTCAAG TATCAGTTGA CCCCGACGCT GCTCGCCGGC
GCGGCGTATG ACTACACGCA AGGCAGCAAG ATCGCCGGCT CGTCCGCGGC CAAGTATCAC
CAAGGCTCGG TCGGCGTCGA CTACTTCCTG TCGAAGCGCA CCGACGTCTA CGCGATCGGC
GTGTATCAGC ACGCTTCGGG CAACGTGATC GAAGCCGACG GCAACACGGT CGGCCCGGCG
ACCGCCGCGA TCAACGGCCT GACGCCGTCG TCGAACCGCA ACCAGTTCGC AGCGCGCGTC
GGCATCCGCC ATAAGTTCTA A
 
Protein sequence
MKKSLLALVA LSAFAGAAHA QSSVTLYGII DEGFNINTNA GGKHLYNLSS GVMQGSRWGL 
RGTEDLGGGL KALFVLENGF DVNSGKLNQG GLEFGRQAYV GLSSGFGTVT LGRQYDSVVD
FVGPLEAGDQ WGGYIAAHPG DLDNFNNAYR VNNAVKFTSA NYGGFTFGGL YSFGGVAGDF
SRNQTWSLGA GYTNGPLVLG VGYLNARTPS TAGGLFGNNT TSSTPAAVTT PVYAGYASAH
TYQVIGAGGA YSFGAATVGI TYSNIKFMNF ASTVFPNQTA TFNNAEINFK YQLTPTLLAG
AAYDYTQGSK IAGSSAAKYH QGSVGVDYFL SKRTDVYAIG VYQHASGNVI EADGNTVGPA
TAAINGLTPS SNRNQFAARV GIRHKF