Gene BURPS668_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0049 
Symbol 
ID4882405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp47677 
End bp48777 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content66% 
IMG OID640125977 
Productputative porin 
Protein accessionYP_001057104 
Protein GI126441408 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.30645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGT TTGCGGTAGC GGCGGCGGGC CTTGCCGTCG CGACGGGCGC GCACGCGTCC 
GACGGCAGCG TCACGCTGTT CGGCCTGATC GATGCCGGCG TGTCGTACGT GTCGAACGAA
GGCGGCAAGC GCAACGTGTA TTTCGACGAC GGCATCGCGG TGCCGAATCT ATGGGGGCTT
CGGGGCACCG AGGATCTCGG CGGCGGCGCG AAGGCGATTT TCGAGCTGAC GTCGCAATAC
GCGCTCGGCA ACGGCGCCGC GCTGCCGACG CCGGGCTCGA TTTTCTCGCG CACCGCGCTC
GTCGGCCTCT GGAGCGAGCG GCTCGGCAGC ATGACGCTCG GCCAGCAATA CGACTTCATG
ACCGATTCGC TGACGTTCGG CTCGTTCGAC GGTGCGTTCC GCTACGGCGG CCTGTACAAC
TTCCGCCAGG GGCCGTTCTC GAAGCTCGGG ATTCCCGACA ATCCCACCGG CTCGTTCGAC
TTCGACCGGT TGGCGGGTTC GAGCCGCGTG CCGAACTCGG TCAAGTACAC GAGCGCGAAC
CTGAACGGGC TCGTGTTCGG CCTGATGTAC GGTTTCGGCA ATCAGGCGGG CGGCGGGCTC
GCGGCGAACA GCACCGTCAG CGCCGGCCTA AAGTACGAGA CGGGCAGTTT CGCGCTCGGC
GCCGCCTATG TCGAAGTCAA GTATCCGCAG ATGAACAACG GGCACGACGG GCTGCGCAAC
TGGGGGCTCG GCGCGCGTTA TGCGCTGTCC GCGTTCGATC TGAATCTGCT GTACACGAAC
ACGCGCAACA CGCTGACGGG CGCCGCGATC GACGTGATCC AGGCCGGCGT GCGCTACGTC
GGCGCGCCGT GGACGATCGG CGCGAACTAC GAGTACATGA AGGGCAACGC GCAGCTCGAT
CGCAACTACG CGCATCAAGT CACGGCGGCC GCGCAGTATG CGCTGTCCAA GCGCACGTCC
GCGTACGTCG AGACCGTGTA CCAGTACGCG GGCGGCAGCG CGGGCGCGCA TGCGTGGATC
AACGGCGTGA TGGGGCCCGA TGCGCAGTCG AGCTCGCGTT CGCAGTTTCT CGCGCGAATC
GGCATGCTTA CCCGTTTCTG A
 
Protein sequence
MKKFAVAAAG LAVATGAHAS DGSVTLFGLI DAGVSYVSNE GGKRNVYFDD GIAVPNLWGL 
RGTEDLGGGA KAIFELTSQY ALGNGAALPT PGSIFSRTAL VGLWSERLGS MTLGQQYDFM
TDSLTFGSFD GAFRYGGLYN FRQGPFSKLG IPDNPTGSFD FDRLAGSSRV PNSVKYTSAN
LNGLVFGLMY GFGNQAGGGL AANSTVSAGL KYETGSFALG AAYVEVKYPQ MNNGHDGLRN
WGLGARYALS AFDLNLLYTN TRNTLTGAAI DVIQAGVRYV GAPWTIGANY EYMKGNAQLD
RNYAHQVTAA AQYALSKRTS AYVETVYQYA GGSAGAHAWI NGVMGPDAQS SSRSQFLARI
GMLTRF