Gene BURPS1710b_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3043 
SymbolopcP1 
ID3690261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3356318 
End bp3357457 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content63% 
IMG OID637729498 
Productouter membrane protein 
Protein accessionYP_334420 
Protein GI76809014 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000337977 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA CCCTCATCGT TGCCGCTCTT TCCGGCGTTT TCGCAACGGC CGCTCACGCG 
CAAAGCAGCG TGACGCTGTA CGGCCTGATC GACGCCGGCA TCACCTACAC GAACAACCAA
GGCGGCCACA GCGCATGGTC GCAATCCACC GGCTCGGTCA ACGGCAGCCG CTGGGGCCTG
CGCGGCGCCG AGGATCTCGG CGGCGGCCTG AAGGCGATTT TCGTGTTGGA AAACGGCTTC
GGCATCAATA ACGGCACGCT GAAGCAGAAC GGCCGCGAGT TCGGCCGTCA GGCGTTCGTC
GGCCTGTCGC ACGAGCAATA CGGCGCGCTG ACGCTCGGCC GTCAATACGA CAGCGTCGTC
GACTACCTCG GGCCGCTGTC GCTGACGGGC ACGCAATTCG GCGGCACGCA GTTCGCCCAC
CCGTTCGACA ACGACAACCT GAACAATTCG TTCCGGATCA ACAACGCGGT CAAGTACACG
AGCGTGAACT GGGCGGGCCT GAAATTCGGC GCGTTGTACG GCTTCTCGAA CAACAATCAG
TTCGCGAACA ACCGCGCCTA TAGCGCGGGC GTATCGTACA GCTACGCCGG CTTCAACATC
GGCGCCGGCT ACCTGCAGTT GAACAACAAC TTCGGCCCGA CGGTCTCCAA CGCATCCGGC
GCGGTCGCGC TCGACAACAC GTTCGTCGGC AAGCGCCAGC GCGTGTTCGG CGGCGGCCTG
AACTACACGT TCGGCCCGGC AACGGCCGGC TTCGTGTTCA CGCAATCGCG CGTCAACCGC
GCGACGGCAA TCGGCGCGGG CGCATCGGGC GTGTCGAGCG GCATTGCGCT CGACGGCACG
TTCATGCGCT TCAACAACTA CGAAGTGAAC GCGCGCTACG CGATCACGCC GGCATGGACG
GTGGCCGGTT CGTACACGTA CACCGCCGGC TTCATCGAGA ACCACCACCC GGGCTGGAAC
CAATTCAACC TGCAAACGGC CTACGCGCTG TCCAAGCGCA CGGACATGTA CCTGCAAGGC
GTGTATCAGA AGGTCAACAA CGACGGCACG GGCCTCGGCG CGTACATCAA CGGTATCGGC
GGCATGTCGT CGACGGAAAA ACAGATCGCC GTCACGGCCG GCCTGCGTCA CCGCTTCTAA
 
Protein sequence
MKKTLIVAAL SGVFATAAHA QSSVTLYGLI DAGITYTNNQ GGHSAWSQST GSVNGSRWGL 
RGAEDLGGGL KAIFVLENGF GINNGTLKQN GREFGRQAFV GLSHEQYGAL TLGRQYDSVV
DYLGPLSLTG TQFGGTQFAH PFDNDNLNNS FRINNAVKYT SVNWAGLKFG ALYGFSNNNQ
FANNRAYSAG VSYSYAGFNI GAGYLQLNNN FGPTVSNASG AVALDNTFVG KRQRVFGGGL
NYTFGPATAG FVFTQSRVNR ATAIGAGASG VSSGIALDGT FMRFNNYEVN ARYAITPAWT
VAGSYTYTAG FIENHHPGWN QFNLQTAYAL SKRTDMYLQG VYQKVNNDGT GLGAYINGIG
GMSSTEKQIA VTAGLRHRF