Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1598 |
Symbol | |
ID | 3847231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 1796067 |
End bp | 1798403 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637841268 |
Product | TonB-dependent receptor |
Protein accession | YP_442137 |
Protein GI | 83718695 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0220938 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCATG AACTTGCCGC GCACGTCGCG CGCACGCGGC TCGCGGCCGC CTGCGTCGCG GCGTTCGCCT GGCCCGCCGC GCACGCCGTC ACGACGGGCG CCGCCGTCCC TGCCGATTCA ACGTCCGCCG CCGCTGCCGA GACGACCGCA TCCGGGAAAA CCCTGGATAT CGTCAGGGTG ACCGCGCAGC GCCCCGCATT CGCGTCCGAC ACGCCCGGCG TCGTCGAGGC GCTCACGCGC GAGCAGATCG ATTCGCACGT CAACGTGACG ACCGAAGACG CGCTCAAGTA CGCGCCGAAC CTGATGGTGC GCCGGCGCTA TATCGGCGAT CGCAACTCCG TGTTCGCCGG CCGCGATTTC AACGAGTTGC AGAGCGCGCG CGGACTCGTT TACGCGGACG GCATCCTCCT GTCGAATCTG CTCGGCTCCA GTTACTCGTA TCCGCCGCGT TGGTCGCTGA TCCAGCCCGA CGACATCGCG CGCGTCGACG TGCTGTACGG CCCGTTCTCC GCGCTCTACC CGGGCAATGC GATCGGCTCG ACCGTGCAGA TCACGACGCG CAAGCCGGAT CGGCTCGAGG CGTCGGTGTC GACGCAGTTC TTCACGCAGC GCTATCGCGA CGGCTACGGC TTTGCCGACA GCTTCGGCGG CAATCACCAG ACCGCGCGCG TCGCCGACCG CGTCGGGCGC TTCTGGTATG CGCTGTCGCT CGACCGGCTC GAGAACGACA GCCAGCCGAT GCAATACGCG AGCCCGAATG GCACGTTCGA TCCGCGGCTC GGCGCGAGCG TGCCGGTGAC GGGCGCCGTT TCCGACATCG GCCCGAACGG CCGGCCTCGG ACGATCGTCG GCGCGCAGAC GATCGAGCGC ACCGAGCAGC TCAACGAGAC GCTGCGCTTC GGCTATGCGT TCACCGACCA CGTCGATGCG ACGGTTACGC TCGGCCACTG GGAGAATCAC TACCGGCAGC ACGGCGACAC GTTCCTGCGC GACGCGGCGG GCAACCCGGT ATACGGCGGC AACGTGTCGT TCGGCGGGCG CAGCTACACG GTGTCGCCGA CCGCGTTCGC GCCGCAGACC GGCGACCAGG AGAACTGGCT GTACGGGCTC GGGCTCGACG CGCGTCTCGC ATCAGGCTGG AAGCTGTCGG CGACCGCGTC CGCGTACGAG GTGTCGCGCG ACGTGCTGCG CAGCGCGTCC GGCGCGCCGA CCGGCGCGTG GGACGGCGGC CCGGGCACGG TATTCCATGG CGACGGCACC GGCTGGCGCA CCGTCGATTT GCGAGCGGAG TCGCCCGACG TGCGCGGGCA CCGCTTCTCG TTCGGCTATC ACTTCGACAC CTATTTCCTG CGCAACGCGA CCTACAACAC GGCGGACTGG CAAAACGCCG TGCCGACGAC GCTTGCGAAC CGTTATCGCG GCAACACGCG CACGCAGGCG CTGTACGCGC AAGACGCGTG GCGTTTCGCG CCCGGCTGGC TCGCGACGCT CGGCCTGCGC TACGAACGAT GGGATGCATA CGGCGGCCAG CTCGGCAACG CGAACGCGAC GCTCGGCTAC GCCGGCCGTG GCGCGACCGC GCTGTCGCCG AAGCTCGCGC TCGAATGGCA GCCAACGGAC GCATGGCGCC TGCGGCTGTC GTTCGCGACG GGCACGCGCT TTCCGACCGT GGCCGAACTG TTCCAGGGCA CGATCTCGAA CAACGCGATC GTCAACAACA ACCCGAACCT GCAACCGGAA AAGGCGATCG ACTGGGACTT CACGGCCGAG CGCGACGTCG GCTTCGGCGT CGTGCGCACG AGCGTGTTCC AGAGCGATCT GCGCAATTCG ATCTACAGCC AGACGACGGT CGCGGGCGCT TCGACGTACA CGAACATCTC GAACGTCGAC CGCGTGCGGG TGCGCGGCGT CGAACTCGCG TTTTCAGGGC AGGACGTCGC GATCAAGGGG CTCGACGTTG ACGCGAACGT GTCCGCGACG AATGCGCAGA CGCTCGCCGA TGCGGCGAAT CCGAACTACG TCGGCGCGCG TTGGCCGCGG ATTCCACGGA TGCGCGCGAA CTTGCTCGCG TCGTACCGCT TCGGCGAGCA CTGGATGACG AGCGTCGGCG TGCGCTATTC GGGGCGGCAG TACAACGCGC TCGACAACAG CGACGTGAAC CCGAACGTGT ACGGCGGCAC CAGTTCGTTT GCGGTCGTCG ACCTCAAGGC GCGCTACCGG TTCGATCGGC ACTGGCTCGC GTCGTTCGGC ATCGACAACG TGACCGATCG CCGCTACTAC GTGTTTCACC CTTATCCAGG CCGCACTTTT TATGGAGAGT TGAAATGGTC GCTGTGA
|
Protein sequence | MSHELAAHVA RTRLAAACVA AFAWPAAHAV TTGAAVPADS TSAAAAETTA SGKTLDIVRV TAQRPAFASD TPGVVEALTR EQIDSHVNVT TEDALKYAPN LMVRRRYIGD RNSVFAGRDF NELQSARGLV YADGILLSNL LGSSYSYPPR WSLIQPDDIA RVDVLYGPFS ALYPGNAIGS TVQITTRKPD RLEASVSTQF FTQRYRDGYG FADSFGGNHQ TARVADRVGR FWYALSLDRL ENDSQPMQYA SPNGTFDPRL GASVPVTGAV SDIGPNGRPR TIVGAQTIER TEQLNETLRF GYAFTDHVDA TVTLGHWENH YRQHGDTFLR DAAGNPVYGG NVSFGGRSYT VSPTAFAPQT GDQENWLYGL GLDARLASGW KLSATASAYE VSRDVLRSAS GAPTGAWDGG PGTVFHGDGT GWRTVDLRAE SPDVRGHRFS FGYHFDTYFL RNATYNTADW QNAVPTTLAN RYRGNTRTQA LYAQDAWRFA PGWLATLGLR YERWDAYGGQ LGNANATLGY AGRGATALSP KLALEWQPTD AWRLRLSFAT GTRFPTVAEL FQGTISNNAI VNNNPNLQPE KAIDWDFTAE RDVGFGVVRT SVFQSDLRNS IYSQTTVAGA STYTNISNVD RVRVRGVELA FSGQDVAIKG LDVDANVSAT NAQTLADAAN PNYVGARWPR IPRMRANLLA SYRFGEHWMT SVGVRYSGRQ YNALDNSDVN PNVYGGTSSF AVVDLKARYR FDRHWLASFG IDNVTDRRYY VFHPYPGRTF YGELKWSL
|
| |