Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2793 |
Symbol | |
ID | 3847570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 3207918 |
End bp | 3209798 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637842461 |
Product | proline/betaine transporter |
Protein accession | YP_443305 |
Protein GI | 83719892 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0377478 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGTCTC GTTCAACGGC CTACGCCTGT TTAATTATTT CGGTCACCCG ACCGTTCGGC CCTGTGCCGC GACTGGCCGT CCGGAATTTC GCTCACAAGG AAGCGTGCCT TCGGGCAGCT TTTTTCGTTT CGACGCCCTC CAAGGGCATT CAACCGGCCG CGCGCCACGC GACGTGCCGG TTCGCCGACC GTCCCCGTGC GGCGGTCTTT TCTCACGAGT GCAAGTGCGC GACGACCACC GGGTCCGCGC CGATCGGTTT CCGCCATGGA TTAAGCGCTT CATGCGCCCC GCTCACGCTC GCGACCCTTG TCCGATCGCG AGCCGCACTC TCCGGCAAAC CGTGGCGAAC CGCGCCCTGT CGCGCGATTC GGCCGCCGTC ACAGGAGTTT TCGACCTTGA CTGCAACACC CGCCCCCTCT ACCCCGTCCA TCGCGCCCGC CGAAGCGGCG CCCGCCGCCG CGCACGAAAT CACCGTCGTC GACCAGGGGC TCCTGAAACG CGCCGTCGGC GCGATGGCGC TCGGCAACGC GATGGAATGG TTCGACTTCG GCGTCTACAG CTACATCGCC GTCACGCTCG GCCAGGTGTT CTTCCCGTCG AGCAGCCCGT CCGCGCAGTT GCTCGCGACG TTCGGCACGT TCGCCGCCGC ATTCCTCGTG CGCCCGCTCG GCGGAATGGT GTTCGGCCCG CTCGGCGACC GCATCGGCCG CCAGCGCGTG CTCGCGATGA CGATGATCAT GATGGCCGTC GGCACGTTCG CGATCGGCCT CATCCCGAGC TACGACTCGA TCGGCCTGCT CGCGCCCGCG CTGCTCCTCG TCGCGCGCCT CGTGCAGGGC TTCTCGACGG GCGGCGAATA CGGCGGCGCG GCGACCTTCA TCGCCGAGTT CTCGACCGAC AAGCGCCGCG GCTTCATGGG CAGCTTCCTA GAGTTCGGCA CGCTGATCGG CTATGTGATG GGCGCGGGCG TCGTCGCGCT GCTGACCGCC TCGCTGTCGC ACGACGCGCT GCTGTCGTGG GGCTGGCGCG TGCCGTTCCT GATCGCCGGC CCGCTCGGCC TGATCGGCCT GTACATCCGG ATGAAGCTCG AAGAAACGCC CGCGTTCAAG CGCCAGGCCG AGGCGCGCGA AGCGCAGGAC AAGGCGGTGC CGAAAGCGCA TTTCCGTCGG CAGCTCGCAC GGCACTGGCG CGCGCTGCTG CTGTGCGTCG GACTCGTGCT GATCTTCAAC GTCACCGACT ACATGGCGCT GTCGTACCTG CCGAGCTATC TATCGTCGAC GCTGCATTTC GACGAGGCGC ACGGCCTCGT GCTGATCCTG ATCGTGATGG TGCTGATGAT GCCGATGACG CTCGCCACGG GCCGCCTGTC GGACGCCGTC GGCCGCAAGC CGGTGATGCT CGCCGGCTGC ATCGGCCTGT TCGCGCTCGC GATTCCCGCG CTGCTGCTGA TCCGCACGGG CGAGACGTCA CTCGTGTTCG GCGGCCTGCT GATCCTCGGC GCGCTGCTGT CGTGCTTCAC GGGCGTGATG CCGTCGGCGC TGCCGGCGCT CTTCCCGACC GAGATCCGCT ACGGCGCGCT CGCGATCGGC TTCAACGTGT CGGTGTCGCT GTTCGGCGGC ACGACGCCGC TCGCCGCCGC GTGGCTCGTC GACGCGACGG GCAACCTGAT GATGCCCGCG TACTATCTGA TGGGCGCGGC CGTGATCGGC GCGATCTCGG TGATGGCGCT GCCCGAGAGC GCGCGCCAGC CGCTCAAGGG CTCGCCGCCC GCCGTCGCGT CGCACCGCGA GGCGCATGCG CTCGCGCGCG AGATCAAGCG CCGCGAGGCG GCCGAGCGCG ACGACGGCGG CTATGCGAGC GCTGCGGCGC TGCGCGCGTG A
|
Protein sequence | MLSRSTAYAC LIISVTRPFG PVPRLAVRNF AHKEACLRAA FFVSTPSKGI QPAARHATCR FADRPRAAVF SHECKCATTT GSAPIGFRHG LSASCAPLTL ATLVRSRAAL SGKPWRTAPC RAIRPPSQEF STLTATPAPS TPSIAPAEAA PAAAHEITVV DQGLLKRAVG AMALGNAMEW FDFGVYSYIA VTLGQVFFPS SSPSAQLLAT FGTFAAAFLV RPLGGMVFGP LGDRIGRQRV LAMTMIMMAV GTFAIGLIPS YDSIGLLAPA LLLVARLVQG FSTGGEYGGA ATFIAEFSTD KRRGFMGSFL EFGTLIGYVM GAGVVALLTA SLSHDALLSW GWRVPFLIAG PLGLIGLYIR MKLEETPAFK RQAEAREAQD KAVPKAHFRR QLARHWRALL LCVGLVLIFN VTDYMALSYL PSYLSSTLHF DEAHGLVLIL IVMVLMMPMT LATGRLSDAV GRKPVMLAGC IGLFALAIPA LLLIRTGETS LVFGGLLILG ALLSCFTGVM PSALPALFPT EIRYGALAIG FNVSVSLFGG TTPLAAAWLV DATGNLMMPA YYLMGAAVIG AISVMALPES ARQPLKGSPP AVASHREAHA LAREIKRREA AERDDGGYAS AAALRA
|
| |