Gene BTH_I0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I0220 
Symbol 
ID3849768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp252601 
End bp254229 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content63% 
IMG OID637839893 
Productdipeptide ABC transporter, periplasmic didpeptide-binding protein 
Protein accessionYP_440778 
Protein GI83721240 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.63568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACATA ACCGTCTGTT GCGCGCACTG CGTGTTACCG CCATCGCGGG CGTTGCAGCG 
GCATCGTTTG GCGTCGCGGG TTCTGCATTC GCACAGATCC CGAACAAAAC GCTCGTCTAC
TGCTCAGAAG GCAGCCCGGC GGGCTTCGAT TCCGCGCAGT TCACGACGGG CGTCGATTTC
ACCGCGTCAA CGTTCCCGAT CTACAACCGC CTCGTCGAGT TCGAGCGCGG CGGCACGAAG
GTCGAGCCCG GCCTCGCCGA GAAGTGGGAC ATCTCGCCCG ACGGCAAGGT CTACACGTTC
CATCTGCGCC ACGGCGTCAA GTTCCATACG ACCGATTTCT TCAAGCCGAC GCGCGAATTC
AACGCGGACG ACGTCGCGTT CACGTTCGAG CGGATGATCG ATCCGAACCA GCCGTTTCGC
AAGGCGTATC CGGTGTCGTT CCCGTACTTC ACCGACATGG GCCTGGACAA GCTGATCGTG
AAGATCGAGA AGGTCGATCC GTATACGGTC CGCTTCACGC TGAAGGAGCC GAACGCGCCG
TTCATCCAGA ACCTCGCGAT GGAATACGCG TCGATCCTCT CCGCCGAATA CGCGGACCAG
CTGATGAAGG CGGGCAAGGC CGCCGACATC AATCAGAAGC CGATCGGCAC GGGCCCGTTC
ATCTTCCGCA GCTACACGAA GGACGCGACG ATCCGCTTCG ACGGCAATCC TGATTATTGG
AAGAAGGGCG CGGTGAAGAT CTCGAAGCTG ATCTTCTCGA TCACGCCCGA TCCGGGCGTG
CGCGTGCAGA AGATCAAGCG CAACGAGTGC CAGGTGATGA GCTATCCGCG GCCCGCCGAC
ATCGCGACGC TGAAGGCCGA TCCGAACGTC GACATGCCGT CGCTGCCGGG CTTCAACCTC
GGCTACCTCG CGTACAACGT GCAGCACAAG CCGGTCGACA AGCTCGAGGT GCGCCAGGCG
CTCGACATGG CGATCAACAA GAAGGCGATC CTCGAATCCG TCTATCAGGG CGCGGGGCAG
GCGGCGAGCG CGCCGATGCC GCCGACCCAA TGGTCGTACG ACAAGAACCT GAAGGCCGCC
GCCTACGATC CGGCGAAGGC GAAGGCGCTG CTCGCGAAGG CGGGCTACCC GAACGGCTTC
CCGATCACAC TGTGGGCGAT GCCCGTGCAG CGCCCGTACA ACCCGAACGC GAAGCTGATG
GCCGAGATGA TCCAGGCCGA CTGGGCGAAG ATCGGCGTGC AGGCGAAGAT CGTCACGTAC
GAGTGGGGCG AGTACATCAA GCGCGCGCAT GCGGGCGAGC ACGACACGAT GCTGATCGGC
TGGAACGGCG ACAACGGCGA CCCCGACAAC TGGCTCGGCA CGCTGCTCGG CTGCGAGGCG
GTCAAGGGCA ACAACTTCTC CGAGTGGTGC TACAAGCCGT TCGACGAGCT GATCCAGAAG
GGCCGCGTGA CGACCTCGCA GGATGCCCGC GCGAAGATTT ACATGCAGGC GCAGCAGATC
TTCGCGCAAC AACTGCCGTT TTCGCCGATC GCGAACTCGA CCGTCTATCA GCCGGTGCGC
AAGAACGTCG TCGACATGCG GATCGAGCCG CTCGGCTATG CGCGCTTCGA CGGCGTCAGC
GTGAAATAA
 
Protein sequence
MEHNRLLRAL RVTAIAGVAA ASFGVAGSAF AQIPNKTLVY CSEGSPAGFD SAQFTTGVDF 
TASTFPIYNR LVEFERGGTK VEPGLAEKWD ISPDGKVYTF HLRHGVKFHT TDFFKPTREF
NADDVAFTFE RMIDPNQPFR KAYPVSFPYF TDMGLDKLIV KIEKVDPYTV RFTLKEPNAP
FIQNLAMEYA SILSAEYADQ LMKAGKAADI NQKPIGTGPF IFRSYTKDAT IRFDGNPDYW
KKGAVKISKL IFSITPDPGV RVQKIKRNEC QVMSYPRPAD IATLKADPNV DMPSLPGFNL
GYLAYNVQHK PVDKLEVRQA LDMAINKKAI LESVYQGAGQ AASAPMPPTQ WSYDKNLKAA
AYDPAKAKAL LAKAGYPNGF PITLWAMPVQ RPYNPNAKLM AEMIQADWAK IGVQAKIVTY
EWGEYIKRAH AGEHDTMLIG WNGDNGDPDN WLGTLLGCEA VKGNNFSEWC YKPFDELIQK
GRVTTSQDAR AKIYMQAQQI FAQQLPFSPI ANSTVYQPVR KNVVDMRIEP LGYARFDGVS
VK