Gene BURPS668_A1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1303 
Symbol 
ID4886387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1229442 
End bp1230530 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content66% 
IMG OID640131242 
Productouter membrane porin 
Protein accessionYP_001062300 
Protein GI126443774 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAAGC ATGCGGCAGC AACATTACTC GGCGCGCTCG CCGCCGCCGG CGCGTGGGCG 
CAGAGCAGCG TCACGCTGTA CGGCAGCCTC GACGCCGGCG TCGCGTACGT GAACAACGTC
GGCGGCGGCG CAAAGTGGTC GATGATTCAG GGCAACACGC AGCCCGATCG ATGGGGGCTG
AAGGGCGTCG AGGATCTGGG CGGCGGTCTG AAGGCGATCT TCCAGCTCGA AAACGGCTTC
TATACGAACA ACGGCCAGAT GGCGGCGGCG GGCACGATGT GGAACCGTCA GGCGTTCGTC
GGCCTGAATT CCGACCGGCT CGGCGCGCTG ACGCTCGGCC ATCAGACGCC GTTCAACTTC
GACTGGCTCG ACCCGCTGTC GAGCGCATTC CTCGCGCAGA GCTGGTATGC GTTCCATCCC
GGCAACCTCG ACCAGCTCGC GGACACCAGC ACCGTGCCGT TCAACAACTC GGTCAAGTAC
CGGTCACCCG TCTTTGCGGG CTTCACGGTG GGCGCGATGC TCGGCTTCGG CAACACGACG
AACTTCTCGA CCGGCCGCAC GATGAGCTTC GGCGTGAACT ACGCGAACGG CCCGTTCAAG
GCGGCCGCCG TCTATTCGAA CGAGCACGAC CAGGCGTTCC CGATGGCGAC CGTCGGCGGC
ATCGCCGGGC CGGGCGGCAC GTTCCAGGGC ATGCCCGTCG CGAGCTATGT CGCGAAGAAG
GCGCAGAACA TGGGCGCGGG CCTGTCGTAC CGCTTCGGCC CGCTGCTCGT GCACGGCCTT
TACACGCGCG TGAAACTGCA GGCGAACGGC CACTCGGATA CGTTCCAGAG CTACGACGCC
GGCGCGAACT ACCAGAGCTC GCCGTTCAAC GTGATCGCGG GCGGCGCGGC GACTTCGACG
CTCGCCGGCC GCCGCTGGAG CCAGTTCGAG CTCGGCGACA CGTATTCGCT GTCCAAGCGC
ACGCAGCTCT ACGTGAACGT GCTGTACGAG CACGCGAGCG GCAACGCGAA GGCCGCGTTC
TTCACGGCGG GCGCGTCGAG CACGGCGAAT CAGGTGATTG TCCTGACGGG GATTCACCAC
TCGTTCTGA
 
Protein sequence
MVKHAAATLL GALAAAGAWA QSSVTLYGSL DAGVAYVNNV GGGAKWSMIQ GNTQPDRWGL 
KGVEDLGGGL KAIFQLENGF YTNNGQMAAA GTMWNRQAFV GLNSDRLGAL TLGHQTPFNF
DWLDPLSSAF LAQSWYAFHP GNLDQLADTS TVPFNNSVKY RSPVFAGFTV GAMLGFGNTT
NFSTGRTMSF GVNYANGPFK AAAVYSNEHD QAFPMATVGG IAGPGGTFQG MPVASYVAKK
AQNMGAGLSY RFGPLLVHGL YTRVKLQANG HSDTFQSYDA GANYQSSPFN VIAGGAATST
LAGRRWSQFE LGDTYSLSKR TQLYVNVLYE HASGNAKAAF FTAGASSTAN QVIVLTGIHH
SF