Gene BTH_I2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I2097 
Symbol 
ID3848970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp2374892 
End bp2376154 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content70% 
IMG OID637841766 
Producttwin-arginine translocation pathway signal sequence domain-containing protein 
Protein accessionYP_442621 
Protein GI83720073 
COG category[S] Function unknown 
COG ID[COG4102] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAC GCGATTTTCT GGCCTTGGCG AGTCTTGCCG GCGCGGCGGG CGTTTCGCTG 
TCGATGCCGC ACGCGTTCGC GGCGGCATCC GCCGCTTCGG GCGCGAAGGG GGCAATCGGC
GCGAATGGCG CGATTGACAT GGCGGGTGCC GCGCGCTACT CGAACCTGCT CATCCTGGTC
GAGCTGAAGG GCGGCAACGA CGGGCTCAAC ACGGTGATTC CGTACGCGGA CCCGCTGTAC
CGCACGCTGC GCCCGACGAT CGGCGTCAAG CGCGAGCAGG TCGTGCAGCT CGACGAGCGC
GCCGCGCTGC ATCCGGCGCT CGAGCCGCTC GTGCCGATCT GGCGTGATGG GCGGCTCGCG
ATCGTCGATG GCGTCGGCTA TCCGCAGCCG AATCTGTCGC ATTTTCGCTC GATCGAGATC
TGGGATACCG CGTCGCGCGC GGACGAGTAT CTGCGTGAAG GGTGGCTCAC GCGCGCATTC
GCGCAGGCCG GCGTGCCGCC CGGCTTCGCG GCGGACGGCA TCGTGCTCGG CAGCGCGGAA
ATGGGGCCGC TCGCGAACGG CGCGCGCGCG ATCGCACTCG TCAATCCCGC ACAATTCGCG
CGTGCGGCGC GGCTCGTGCA GCCCGTATCG CTGCGCGAGC AGAATCCCGC GCTCGCGCAC
GTGATCGACA TCGAGAACGA CATCGTCAAG GCCGCCGATC GGCTGCGTCC GCACGCGGGC
ACGCCCGCGC TCGCGACCGC GTTTCCGGGC GGGCCGTTCG GCGCGTCGGT GAAGACCGCG
ATGCAGGTGC TCGCCGCGTG CGACACGCCA CAGCGTACGC CGGCGCCGGG GCAGGGCGTC
GCGGCGCTGC GTCTCACGCT GAACGGCTTC GACACGCATC AGAACCAGCC CGGCCAGCAG
GCGGGATTGC TCAAGCAACT GGCGCTGGGG TTCGTCGCGA TGCGTTCGGC GTTGATCGAA
CTCGGGCGCT GGAACGACAC GCTCGTGATG ACGTATGCGG AATTCGGCCG GCGCGCGCGA
GAGAACCAGA GCAACGGGAC GGATCACGGC ACGGCCGCTC CGCATTTCGT GATGGGCGGG
CGCGTGCGCG GCGGGCTGTA CGGCGCGCCG CCTGCGCTCA CCGCGCTCGA CGGCAACGGC
AACCTGCCCG TCGCCGTCGA TTTCCGGCAG CTCTATGCGA CCGTGCTCGG CCCGTGGTGG
GGGCTTGACG CGACGAGCGT GCTCAAGCGG CGCTTCGAGC CGTTGCCGCT GCTGCGTGCC
TGA
 
Protein sequence
MKRRDFLALA SLAGAAGVSL SMPHAFAAAS AASGAKGAIG ANGAIDMAGA ARYSNLLILV 
ELKGGNDGLN TVIPYADPLY RTLRPTIGVK REQVVQLDER AALHPALEPL VPIWRDGRLA
IVDGVGYPQP NLSHFRSIEI WDTASRADEY LREGWLTRAF AQAGVPPGFA ADGIVLGSAE
MGPLANGARA IALVNPAQFA RAARLVQPVS LREQNPALAH VIDIENDIVK AADRLRPHAG
TPALATAFPG GPFGASVKTA MQVLAACDTP QRTPAPGQGV AALRLTLNGF DTHQNQPGQQ
AGLLKQLALG FVAMRSALIE LGRWNDTLVM TYAEFGRRAR ENQSNGTDHG TAAPHFVMGG
RVRGGLYGAP PALTALDGNG NLPVAVDFRQ LYATVLGPWW GLDATSVLKR RFEPLPLLRA