Gene BURPS1106A_3240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3240 
SymboltolA 
ID4900942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3151451 
End bp3152476 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content70% 
IMG OID640136466 
ProductTolA protein 
Protein accessionYP_001067478 
Protein GI126452064 
COG category[S] Function unknown 
COG ID[COG4487] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain
[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCTC GCCAGTCGCG CACCGCCGCC TACCCGCCCC GGCCGCCGCG CGAGCGCGGC 
ACAGGCCGGG CGTTCCTGCT CGCCGCGCTG ATGCACGTGC TGCTCGCGCT TTTCCTGTAC
CACGGCGTGC ACTGGCAGAA CAGCACGCCG GCCGGCGCGG AGGCCGAGCT GTGGACGTCG
GTGCCTGACA CGTCGACGCC GCAACCGGCG CCGACGCCGC CCGTGAAAGT CGCGCCTCCC
CCGCCGCCCG TGAAGAACGA GGAAGCGGAT ATCGCCCTGC AGCAGAAGCG GCGCGAGCAG
CAGGCCGCGG CCGCCCGCGA GGCGCAGCTC GAGGAGCAGC GCCGGCAGCA GCAATTGAAG
GCGCAGCAAC TCGCCGCGCA GCAGGCCGCT CAGCTCGCCG CGCAAAAGGC CGCCGAGCGC
GAGAAGCAAA AGCAGGCGGA AAAGCTCAAG CAGCAGCAAC TCGCGGAACA GCAGCAACGC
AAACTCGAAC AGCAGAAGCT CGAGCAACAA AAGCTCGAAC AACAGAAGAA GCAGGAACAG
CTCGCCGCGC AAAAGAAGGC GGACGCCGAA AAGGCCGAGA AAGCCGAAAA GGCGGCGAAG
GCCGCGGCGG CCGCCAAGGC GAACGCCGCC GCGAAGGCGA AGCTCGACAA GGAGCGTCAG
GCGCGCCTCG CGCAGTTGCA AGGCATCGCG GGCGGCGGCT CGGGCGGCGG CGAAGGCCTC
GCGAAGAGCG GCACGGGCAC GGGCTCGGGC GGCAACGCCG CGTCCCCGGG CTATGCGGAC
AAGGTCCGCC GGCGCGTGAA GCCGAACATC GTGTGGGCGG GCGAGCGCGA CAGCCTCGTG
ACCGTCGTCG CGATCCGCTG CACGCCGTCG GGCGACGTGC TCAGCACGTC GATCCGCCGG
TCCAGCGGAA ATTCGGGGTG GGATCAGGCG GTCATCAGCG CGATCCAGGC GTCGGTGCCC
CTGCCGCCCG ATACCAACGG CCGCACTCCG TCCGAGATTA CGATTACCTT CAAGGCGGCG
GAGTGA
 
Protein sequence
MKPRQSRTAA YPPRPPRERG TGRAFLLAAL MHVLLALFLY HGVHWQNSTP AGAEAELWTS 
VPDTSTPQPA PTPPVKVAPP PPPVKNEEAD IALQQKRREQ QAAAAREAQL EEQRRQQQLK
AQQLAAQQAA QLAAQKAAER EKQKQAEKLK QQQLAEQQQR KLEQQKLEQQ KLEQQKKQEQ
LAAQKKADAE KAEKAEKAAK AAAAAKANAA AKAKLDKERQ ARLAQLQGIA GGGSGGGEGL
AKSGTGTGSG GNAASPGYAD KVRRRVKPNI VWAGERDSLV TVVAIRCTPS GDVLSTSIRR
SSGNSGWDQA VISAIQASVP LPPDTNGRTP SEITITFKAA E