Gene BMAA1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA1471 
Symbol 
ID3086149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp1600702 
End bp1601763 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content67% 
IMG OID637565359 
Productputative lipoprotein 
Protein accessionYP_106072 
Protein GI53717248 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG5042] Purine nucleoside permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACTC GCTCTATATT TTCCGCAGCC GTATTTTCGC TTGCCGCTTG CGCGATGGCG 
CCGTCCGTCG CGCAGAACAA CGGTGAAGCG TTCGCCGAAG CCGGTGCCCA GGGCCGCCCG
GCGAAGGTGA TGATCATCTC GATGTTCGGC CCGGAAGGCC AGGCGTGGCT CGATCGCCTC
GGCCCGTGGA AAGACGTCGC GGTGCCCGGC TTGTCGCCCG ACTATCCGAA CGTGCATTGC
AACAAGCAGG ACGTGTGCGT CGTCACGACG GGCATGGGCT ACGCGAACGC CGCGTCGACG
ATCATGGCGC TCACGTTCTC GCGCCGCTTC GATCTGCGGC GGACGTATTT CCTGATCTCG
GGCATCGCGG GCGTCGATCC GGCGCGCGGC ACGCTCGGCA CCGCCGCGTG GGCGAAGTAC
CTCGTCGATT TCGGCCTGCA ATGGGAGCTC GACGCGCGCG AAATTCCCGC GGGCTGGAAT
GGCGGCTATC TCGGCATCAA CACGAAGAGC CCGAGCGACA AGCCGCCGCT CGACTACCGC
ACCGAAGTGT TCGAGCTGAA CGGCAAGCTC GCGGACACCG CGTATGCGCT GTCGCGCAAC
GTGCAGCTCG CCGACAGCGC GCAGGCGCAG GCCGCGCGCG CGAAGTTCAA CTATGCGCCC
GCGAACCAGC CGCCCGTCGT GACCCGCTGC GATACGTCGT CGGGCAACAC GTGGTTCTCC
GGCACGCTGC TCGGCGAACG CGCGCGCCAG TGGACGAAGC TCCTCACCGA CAACAAGGGC
ACCTACTGCA TGACCGCGCA GGAGGACAAC GCGACGTTCG AGGCGCTCAA GCGCGCGGCG
AGCGTGAACC GCGTCGATTT GAGCCGCGTC GCGGTGCTGC GCACCGGCTC GGATTTCGAT
CGCCCGTATC AAGGCCAGAC GAGCGTCGAT AATCTGCTGA ACTACGCCGA CCAGGGCGGT
TTTCCGCTCG CGACCGAGAA CCTGTATCGC GCGGGCAATC CGCTCGTGCA GGACATCGCC
ACGCACTGGG GCGAGTGGAA GGACGGCGTG CCGCGCCGCT GA
 
Protein sequence
MLTRSIFSAA VFSLAACAMA PSVAQNNGEA FAEAGAQGRP AKVMIISMFG PEGQAWLDRL 
GPWKDVAVPG LSPDYPNVHC NKQDVCVVTT GMGYANAAST IMALTFSRRF DLRRTYFLIS
GIAGVDPARG TLGTAAWAKY LVDFGLQWEL DAREIPAGWN GGYLGINTKS PSDKPPLDYR
TEVFELNGKL ADTAYALSRN VQLADSAQAQ AARAKFNYAP ANQPPVVTRC DTSSGNTWFS
GTLLGERARQ WTKLLTDNKG TYCMTAQEDN ATFEALKRAA SVNRVDLSRV AVLRTGSDFD
RPYQGQTSVD NLLNYADQGG FPLATENLYR AGNPLVQDIA THWGEWKDGV PRR