Gene BURPS1106A_2390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2390 
Symbol 
ID4901013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2356708 
End bp2357964 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content71% 
IMG OID640135618 
Producttwin-arginine translocation pathway signal sequence domain-containing protein 
Protein accessionYP_001066650 
Protein GI126453350 
COG category[S] Function unknown 
COG ID[COG4102] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAC GTGATTTTCT GGCCCTGGCG AGCCTTGCCG GCGCGGCGGG CGTATCGTTG 
CCGGTGCCGT ATGCGTTCGC TGCCGCGCCG GGCGAGACGA GCGCAACGGG GGCGATGGGA
GCGGTGGGGG CGGCGGCGCG CGCCGCACGC TACTCGAACC TGCTGATTCT CGTCGAGCTC
AAGGGCGGCA ACGACGGGCT CAACACGGTG ATTCCGTACG CGAATCCGCT GTACCGCACG
CTGCGCCCGG CGATCGGCGT CAAGCGCGAG CAGGTCGTGC AGCTCGACGA GCGCGCCGCG
CTGCATCCGG CGCTCGAGCC GCTCATGCCG ATCTGGCGCG ACGGACGGCT CGCGATCGTC
GAAGGCGTCG GCTATCCGCA GCCGAATCTG TCGCACTTTC GCTCGATCGA GATCTGGGAT
ACCGCGTCGC GCGCGAACGA GTATCTGCGC GAAGGGTGGC TCACGCGCGC GTTCGCGCAG
GCGAGCGTGC CGCCCGGCTT CGCCGCGGAC GGCATCGTGC TCGGCAGCGC GGAAATGGGG
CCGCTCGCGA ACGGCGCGCG TGCGATCGCC CTCGTGAATC CCGCGCAGTT CGCGCGCGCG
GCGCGACTCG CGCAGCCCGT GTCGCTGCGT GAGCGCAACC CCGCGCTCGC GCACGTGATC
GATATCGAAA ACGACATCGT CAAGGCCGCC GATCGGCTGC GTCCGCATGC GGGCACGCCC
GCGCTCGCGA CCGCGTTTCC GGGCGGGCCG TTCGGCGCAT CGGTGAAGAC CGCGATGCAG
GTGCTCGCCG CGTGCGATAC GCCGCAGCGT ACGCCGGCGC CGGGGCAGGG CGTCGCGGTG
CTGCGCCTCA CGCTGAACGG CTTCGACACG CATCAGAACC AGCCCGGCCA GCAGGCGGGC
TTGCTCGGCC AACTGGCGCA AGGGCTGGTG GCGATGCGCT CGGCGTTGAT CGAGCTCGGG
CGCTGGAACG ATACGCTCGT GATGACGTAT GCGGAGTTCG GCCGGCGCGC GCGAGAGAAT
CGGAGCAACG GAACCGATCA CGGCACGGCC GCGCCGCATT TCGTGATGGG CGGGCGCGTG
CGGGGCGGGC TGTACGGCGC GCCGCCCGCG CTCGACGCGC TCGACGGCAA CGGCAACCTG
CCTGTCGCCG TCGATTTCCG TCAGCTTTAT GCGACCGTGC TCGGCCCATG GTGGGGGCTC
GACGCGGCGA GTGTGCTCAG GCAGCGTTTC GAGCCGCTGC CGTTGCTGCG CGCCTGA
 
Protein sequence
MKRRDFLALA SLAGAAGVSL PVPYAFAAAP GETSATGAMG AVGAAARAAR YSNLLILVEL 
KGGNDGLNTV IPYANPLYRT LRPAIGVKRE QVVQLDERAA LHPALEPLMP IWRDGRLAIV
EGVGYPQPNL SHFRSIEIWD TASRANEYLR EGWLTRAFAQ ASVPPGFAAD GIVLGSAEMG
PLANGARAIA LVNPAQFARA ARLAQPVSLR ERNPALAHVI DIENDIVKAA DRLRPHAGTP
ALATAFPGGP FGASVKTAMQ VLAACDTPQR TPAPGQGVAV LRLTLNGFDT HQNQPGQQAG
LLGQLAQGLV AMRSALIELG RWNDTLVMTY AEFGRRAREN RSNGTDHGTA APHFVMGGRV
RGGLYGAPPA LDALDGNGNL PVAVDFRQLY ATVLGPWWGL DAASVLRQRF EPLPLLRA