Gene BMA10229_A1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A1000 
Symbol 
ID4791582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp1019465 
End bp1021801 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content69% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_001026988 
Protein GI124383487 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCCATC AACTTGCCGC GCCCGGCGCG CGCAGGCGGC TCGCGGCCGC CTGCGCCGCG 
GCGCTCGCCT GGCCCGCCGC CCACGCGGCT TCGACGGCCG CCGCCGTGCC TGCCGATTCA
ACGCCGGCCG CCGCCGCGGA GATGACCGCA TCCGGGAAAA CCTTGGATAC CGTCAAGGTC
ACCGCGCAGC GCGCCGCGTT CGCGCCCGAC ACGCCCGGCG TCGTCGAGGC GCTCACGCGC
GAGCAGATCG ATTCGCACGT CAACGTGACG ACCGAAGACG CGCTCAAGTA CGCGCCGAAC
CTGATGGTGC GCCGGCGCTA CATCGGCGAC CGCAATTCAG TGTTCGCCGG CCGAGACTTC
AATGAGCTGC AGAGCGCGCG CGGACTCGTC TATGCGGACG GCATCCTGCT GTCCAATCTG
CTCGGTTCCA GCTATTCGTA TCCGCCGCGC TGGTCGCTGA TCCAGCCCGA CGACATCGCA
CGCGTGGACG TGCTGTACGG CCCGTTTTCC GCGCTCTACC CGGGCAACGC GATCGGCTCG
ACCGTGCAGA TCACCACGCA CAAGCCGCAG CGGCTCGAGG CGTCGGTGTC GACGCAGTTC
TTCACGCAGC GCTATCGCGA CGGCTACGGC TTCGCCGACA GCTTCGGCGG CAATCACCAG
AGCGCGCGCA TCGCCGATCG CGTCGGACGT TTCTGGTATG CGCTGTCGCT CGACCGGCTC
GAGAACGACA GCCAGCCGAT GCAATACGCG AGCCCGAACG GCGCGTTCGA TCCGAGGCTC
GGCGCACCCG TGCCGGTGAC GGGGGCCGTC TCCGACATCG GGCCGAACGG CCGGCCGCGG
ACGATCGTCG GCGCGCAGAC GATCGAGCGC ACCGAGCAGC TCAACGAGAC GTTGCGCTTC
GGCTACGCGT TCACCGATCA CGTCGACGCG ACGGTCACGC TCGGCCACTG GGAGAATCAC
TACCGGCAGC ACGGCGACAC GTTCCTGCGC GACGCGGCGG GCAACCCGGT GTACGGCGGC
AACGTGTCGT TCGGCGGGCG CAACTACACG GTGTCGCCGG GCGCGTTCGC GCCGCAGACG
GGCGACCAGG AGAACTGGCT GTACGGGCTC GGGCTCGACG CGCGGCTCGC GTCCGGCTGG
AAGCTGTCGG CCATCGCGTC CGCCTACGAG GTGTCGCGCG ACGTGCTGCG CAGCGCGTCC
GGCGCACCGC CCGGCGCCTG GGACGGCGGG CCGGGCACGG TGTTCCATGG CGACGGCACC
GGCTGGCGCA CCGTCGACTT GCGCGCGGAG TCGCCCGACG TGCGCGGGCA CCGTTTCTCG
TTCGGCTACC ACTTCGATAC GTATTTCCTG CGCAACGCGA CCTACAACAC GGCCGACTGG
CAAAACGCGG TGCCGACGAC GCTCGTGAAC CGCTATCGCG GCAACACGCG CACGCAGGCG
CTGTATGCGC AGGACGCGTG GCGCGTCGCG CCCGACTGGC TCGCGACGCT CGGCCTGCGC
TACGAGCGCT GGGATGCATA CGGCGGCGAG CTCGGCGGCG CGACCGCGAC GCTCGGCTAC
GCGGAGCGCG GCGCGACCGC GTTCTCGCCG AAGCTCTCGC TCGAATGGCA GCCGGCCAGC
GCGTGGCGCC TGCGGTTGTC GTTCGCGACG GGCACGCGCT TTCCGACGGT CGCCGAGCTG
TTCCAGGGCA CGATCTCGAA CAACGCGATC GTCAACAACA ACCCGAATCT GCAGCCGGAA
AAGGCGATCG ACTGGGATTT CACCGCCGAG CGCGACGTCG GCTTCGGCGT CGTGCGCGCG
AGCGTGTTCC AGAGCGATCT GCGCAATTCG ATCTACAGCC AGACCACGCT TGCCGGCGCG
TCGACGTACA CGAACGTCTC GAACGTCGAC CGCGTGCGGG TGCGCGGCGT CGAGCTCGCG
TTCTCGGGGC AGGACGTCGC GCTCAAGGGG CTCGACGTCG ACGCGAACGT GTCCGCGACG
AATGCGCAGA CGCTTGCCGA TGCGGCCAAT CCGAACTACG TCGGCGCGCG CTGGCCGCGG
ATTCCGCGGA TGCGCGCGAA CCTGCTCGCG TCGTACCGCT TCGACGAGCA TTGGATGACG
AGCGTCGGCG TTCGCTATTC GGGGCGGCAG TACAACGCGC TCGACAACAG CGACGTGAAT
CCGGGCGTAT ACGGCGGCAC CAGCTCGTTC ATGGTCGTCG ACCTGAAGGC GCGCTATCGG
TTCGATCGGC ACTGGCTCGC GTCGTTCGGC ATCGACAACG TGACCGATCG CCGCTACTAC
GTGTTCCATC CTTATCCGGG CCGCACTTTT TATGGAGAGT TGAAATGGTC GCTGTGA
 
Protein sequence
MFHQLAAPGA RRRLAAACAA ALAWPAAHAA STAAAVPADS TPAAAAEMTA SGKTLDTVKV 
TAQRAAFAPD TPGVVEALTR EQIDSHVNVT TEDALKYAPN LMVRRRYIGD RNSVFAGRDF
NELQSARGLV YADGILLSNL LGSSYSYPPR WSLIQPDDIA RVDVLYGPFS ALYPGNAIGS
TVQITTHKPQ RLEASVSTQF FTQRYRDGYG FADSFGGNHQ SARIADRVGR FWYALSLDRL
ENDSQPMQYA SPNGAFDPRL GAPVPVTGAV SDIGPNGRPR TIVGAQTIER TEQLNETLRF
GYAFTDHVDA TVTLGHWENH YRQHGDTFLR DAAGNPVYGG NVSFGGRNYT VSPGAFAPQT
GDQENWLYGL GLDARLASGW KLSAIASAYE VSRDVLRSAS GAPPGAWDGG PGTVFHGDGT
GWRTVDLRAE SPDVRGHRFS FGYHFDTYFL RNATYNTADW QNAVPTTLVN RYRGNTRTQA
LYAQDAWRVA PDWLATLGLR YERWDAYGGE LGGATATLGY AERGATAFSP KLSLEWQPAS
AWRLRLSFAT GTRFPTVAEL FQGTISNNAI VNNNPNLQPE KAIDWDFTAE RDVGFGVVRA
SVFQSDLRNS IYSQTTLAGA STYTNVSNVD RVRVRGVELA FSGQDVALKG LDVDANVSAT
NAQTLADAAN PNYVGARWPR IPRMRANLLA SYRFDEHWMT SVGVRYSGRQ YNALDNSDVN
PGVYGGTSSF MVVDLKARYR FDRHWLASFG IDNVTDRRYY VFHPYPGRTF YGELKWSL