Gene BURPS668_3202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3202 
SymboltolA 
ID4885165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3135142 
End bp3136167 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content70% 
IMG OID640129130 
ProductTolA protein 
Protein accessionYP_001060214 
Protein GI126438620 
COG category[S] Function unknown 
COG ID[COG4487] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain
[TIGR02794] TolA protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCTC GCCAGTCGCG CACCGCCGCC TACCCGCCCC GGCCGCCGCG CGAGCGCGGC 
ACAGGCCGGG CGTTCCTGCT CGCCGCGCTG ATGCACGTGC TGCTCGCGCT TTTCCTGTAC
CACGGCGTGC ACTGGCAGAA CAGCACGCCG GCCGGCGCGG AGGCCGAGCT GTGGACGTCG
GTGCCTGACA CGTCGACGCC GCAACCGGCG CCGACGCCGC CCGTGAAAGT CGCGCCTCCC
CCGCCGCCCG TGAAGAACGA GGAAGCGGAT ATCGCCCTGC AGCAGAAGCG GCGCGAGCAG
CAGGCCGCGG CCGCCCGCGA GGCGCAGCTC GAGGAGCAGC GCCGGCAGCA GCAATTGAAG
GCGCAGCAAC TCGCCGCGCA GCAGGCCGCC CAGCTCGCCG CGCAAAAGGC CGCCGAGCGC
GAGAAGCAAA AGCAGGCGGA AAAGCTCAAG CAGCAGCAAC TCGCGGAACA GCAGCAACGC
AAACTCGAAC AGCAGAAGCT CGAGCAACAA AAGCTCGAAC AACAGAAGAA GCAGGAACAG
CTCGCCGCGC AAAAGAAGGC GGACGCCGAA AAGGCCGAGA AAGCCGAAAA GGCGGCGAAG
GCCGCGGCGG CCGCCAAGGC GAACGCCGCC GCGAAGGCGA AGCTCGACAA GGAGCGTCAG
GCGCGCCTCG CGCAGTTGCA AGGCATCGCG GGCGGCGGCT CGGGCGGCGG CGAAGGCCTC
GCGAAGAGCG GCACGGGCAC GGGCTCGGGC GGCAACGCCG CGTCCCCGGG CTATGCGGAC
AAGGTCCGCC GGCGCGTGAA GCCGAACATC GTGTGGGCGG GTGAGCGCGA CAGCCTCGTG
ACCGTCGTCG CGATCCGCTG CACGCCGTCG GGCGACGTGC TCAGCACGTC GATCCGCCGG
TCCAGCGGAA ATTCGGGGTG GGATCAGGCG GTCATCAGCG CGATCCAGGC GTCGATGCCC
CTGCCGCCCG ATACCAACGG CCGCACTCCG TCCGAGATTA CGATTACCTT CAAGGCGGCG
GAGTGA
 
Protein sequence
MKPRQSRTAA YPPRPPRERG TGRAFLLAAL MHVLLALFLY HGVHWQNSTP AGAEAELWTS 
VPDTSTPQPA PTPPVKVAPP PPPVKNEEAD IALQQKRREQ QAAAAREAQL EEQRRQQQLK
AQQLAAQQAA QLAAQKAAER EKQKQAEKLK QQQLAEQQQR KLEQQKLEQQ KLEQQKKQEQ
LAAQKKADAE KAEKAEKAAK AAAAAKANAA AKAKLDKERQ ARLAQLQGIA GGGSGGGEGL
AKSGTGTGSG GNAASPGYAD KVRRRVKPNI VWAGERDSLV TVVAIRCTPS GDVLSTSIRR
SSGNSGWDQA VISAIQASMP LPPDTNGRTP SEITITFKAA E