Gene BMA10229_A3311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A3311 
Symbol 
ID4791967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp3362564 
End bp3363778 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_001029247 
Protein GI124386290 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00468101 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC TTCTCGCAGC CGTCGGATTG TCGCTGATCC TCCTGTCGGC CGCCGCGAAC 
GCGGCGGTGC CGTCGCTGCA ACAAATCCAG CAATCGATCG CGCAAGGCAA CTGGCAGCGC
GCCGATGCGC AGCTCTCGCA AGTGATCGAC GCGTACCCGG ACAACGCGCG CGCCCGCTAT
CTGTACGGCC AGGTGCTCGA CCGCGAAGGC CGCCCCGCCG AGGCGCTCGC GCAGATCGAA
CGGGCGAAGT CGCTCGATCC GCAACTGCGC TTCACCGATC CGTCGCGCTT CGCGCAAACT
GAAGCGCGCG TGCGGGCCGA CGCGCGCCGC GCGACGGCCG CGCAGGACTC GCGCTCGGCG
ACCTCGGGCG GCATGCTCGC CGCGCCGCAG GCGCCGGCCC AGGCCCGCGC GCCATTCTCC
GCCGCCCCTG TCGCGCCTGC CGCGCCCGTG CATCGCGGCC CGTCGGTGGG TATGTGGATC
GGCTTCGCGG TGCTGCTCGG CGTGATCGTG ATCGTGCTGC GCAAAACGTT GCGCCGCGCG
CGCTCGGCGG ACGATCAGCG CGCCGACGAC GAACGCCGCG CGCAGTTGAA GCGCGCAACC
GACATCCTCA ACGAAGTGCG TCCGCTCAAG CTCGACGCGC GGCTGTCGAC GGCGCCGGGC
GCCGCCGCGC TCAACGGCGA GATCGAGGGG CTCGAAGCCC AGGCGCGCGA GCTCGTCGAG
ACCCTGTCGA ACGGCAAGAA TCCCGCGCCG CCGTACCGGC TCGACGAGTT GGAGAAACAG
TTCGCCAGCC TGAAGGCGCG CGTCGAGGGG CGCCCGGATC CGAACGCGGC CGCGCCGGGC
GGGCCTGGCC AAACGGGCTC GGTATTTGCT CAGGAGGCCG ATCGGTTGAC GGGGGCACAG
GGCCAGCCGC CCTACTCGCC GTATCCGCCG CAGCCGCAAC AGCCGCCGCC CGTCGTGATC
CAGCAAGGCG GCGGCGGCTT CGGCGGCGGC ATGGGCGGGC TGCTCACGGG CGTCCTGCTC
GGCCAGGCGA TGTCGCACGG CCGCGACCGC GTGATCGAGC GCGACGTGAT CGTCGACGAC
GAAGCGCGGC GCCGCGCGGG CGCCGATCCC GGCATCGACT TCGGCCAGGG CGACAGCTGG
GACAGCGGCG GCTCGGACGG CGGCGGGAGC ATCGATCTCG GCAGCAGCGG CGACGATTGG
AGCAACAACG GTTGA
 
Protein sequence
MKKLLAAVGL SLILLSAAAN AAVPSLQQIQ QSIAQGNWQR ADAQLSQVID AYPDNARARY 
LYGQVLDREG RPAEALAQIE RAKSLDPQLR FTDPSRFAQT EARVRADARR ATAAQDSRSA
TSGGMLAAPQ APAQARAPFS AAPVAPAAPV HRGPSVGMWI GFAVLLGVIV IVLRKTLRRA
RSADDQRADD ERRAQLKRAT DILNEVRPLK LDARLSTAPG AAALNGEIEG LEAQARELVE
TLSNGKNPAP PYRLDELEKQ FASLKARVEG RPDPNAAAPG GPGQTGSVFA QEADRLTGAQ
GQPPYSPYPP QPQQPPPVVI QQGGGGFGGG MGGLLTGVLL GQAMSHGRDR VIERDVIVDD
EARRRAGADP GIDFGQGDSW DSGGSDGGGS IDLGSSGDDW SNNG