Gene BURPS668_A1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1822 
Symbol 
ID4886691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1778810 
End bp1780240 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content66% 
IMG OID640131760 
Productputative outer membrane protein TolC 
Protein accessionYP_001062817 
Protein GI126442528 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAAA CAGAGAATTT GGGTGCCTCG TCTAAATTTC GGCATGCGGG TTTTCACTTA 
ACGACAAAAT CTTTTCGGCT TCGCGCCAAT GTGGCCTACG TTACCGTTTG CGCGGTGCTT
GCCGCATTCG GCACCGGTTC CGAATCCGCA CGTGCGTTCT GTCTCGACGA GGCTTATCAG
CACGCAATAT CGAACGATCC GAAATTTCTC CAGGCGCGGG CGGAATACGA CGCCGCGCGG
CAGAAGTTTC CGCAGGCGCG CGCGCAGATG CTGCCGCAGG TGAGCGCGCA GCTCGAATGG
GGCCGCTACG GCACGCACGC GAACCTGTTC GGCATCGACG TGAGCGGCAA CAGCACCGCC
GCGTACGGCG CCGCGCAGCT CTCGCAGGCG CTGTTCAACA TGCCGTATTT GTACGACATG
AGCCGCGCGA AGGAATTCGA GGAATCCGCG CGGCAGCAGC TCGAAGTCGC GAAGCAGGAG
CTGATCATGC GCGTCGCGAA CGCGTGCTTC GATCTGCTGT CCGCGCGCGA GAAGCTGCAG
CTCGCCGACG ACGAGGTCGG CGCGCTCACG CGCCTGGAGA GCGATACCCG CCGCATGGCG
CAGCTCGGCA TGAAGACCAT CGGCGACACG GCCGAGATCG AGGCGCGCAG GAGCCTCGCG
CAGTCGGACG AGGCGCTCGC GCGCACCGAC GTCGAGGCGC GGCGCGCCCG CTACGAGACG
CTGCTCGGCT CCGCGATCGA CTTCACGCGC TGGCCGCGGC TCGCGATGCA CGGCACGTCG
CCGCGCATTC CGACGGGCGA CTACCAGCCG CAGGACAACC CGTCGTACCA GCAGGCGTAT
CGCGACCTGC GCGTCGCGCG GCTCGCGTCC AAGCGCATCA ACGCGGAGCA CCTGCCGAGC
GTCGACCTGT TCGCGACGTA CTCGCGCGGC CTCAATCCGA ACCTGCGCGG CCTCACCGAC
AAGAACGACT TCCACCAGAG CGCGGTCGGC GTGCAGGTGA CGATTCCGAT CTTCTCGGGC
GGCAGCGTGC ACTACCGGAA GATCGAGGCC GACCACGTCG CGACGCAGTA CCAGAACCGG
CTGCGCGAGG TCGAGCAGCA ACTGAGCACC GATCATCGCG AGACGCTCGC GGCGCTGCAG
TCGATCGGCA CGCGGATCCG CGCGCTGCAG CAATCGCTGC AGGCGGCGCG GCTCGCGTAC
GATTCGTCGA TGAAGGCGCA CCAGGTCGGC TACAGTACGA CGTACGAGAC GCTGAACCTG
CGCACCGACA TCTCGAACAT CCGCCAGAAG CTGTTCGAGA GCTACCTCGA CGCGCTGAAG
CTCCAGCTGA AGCTCAAGGG CATTCTCGGC ACGCTGGACG AGCAGTCGCT CGTCGCGGTC
GACAGCTTCC TCGCGAGCAA CGCGGCGCCC GCCGATCAGA AGAGCGAATG A
 
Protein sequence
MFETENLGAS SKFRHAGFHL TTKSFRLRAN VAYVTVCAVL AAFGTGSESA RAFCLDEAYQ 
HAISNDPKFL QARAEYDAAR QKFPQARAQM LPQVSAQLEW GRYGTHANLF GIDVSGNSTA
AYGAAQLSQA LFNMPYLYDM SRAKEFEESA RQQLEVAKQE LIMRVANACF DLLSAREKLQ
LADDEVGALT RLESDTRRMA QLGMKTIGDT AEIEARRSLA QSDEALARTD VEARRARYET
LLGSAIDFTR WPRLAMHGTS PRIPTGDYQP QDNPSYQQAY RDLRVARLAS KRINAEHLPS
VDLFATYSRG LNPNLRGLTD KNDFHQSAVG VQVTIPIFSG GSVHYRKIEA DHVATQYQNR
LREVEQQLST DHRETLAALQ SIGTRIRALQ QSLQAARLAY DSSMKAHQVG YSTTYETLNL
RTDISNIRQK LFESYLDALK LQLKLKGILG TLDEQSLVAV DSFLASNAAP ADQKSE