Gene BMAA2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA2004 
Symbol 
ID3087289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp2194001 
End bp2195197 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content70% 
IMG OID637565869 
Producthypothetical protein 
Protein accessionYP_106523 
Protein GI53716072 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGCCG TGGTCGCCGG CGCGCTCGTG ATGAGCGCGG CAATGGGCGT GCGGCAAACC 
TTCGGCCTTT TCATCGGGCC ATTCTCGTTC GACCACGGTT TGCCGGTGAC GACGATCGCG
TTCGCGATCG CGCTGCACAA CCTCGTCTGG GGCGCCGCGC AGCCGTTCGC CGGCGCGGCC
GCCGACCGCT ACGGCGCCGG GCCGCTCGTC GCGATCGGCG CGGTCGTGTT CGCGCTCGGC
CTCGCGATCG CCGCGGTCGT GCCGACGGGC CCGATGCTCG TGCTCGGCAT AGGCGTGCTC
GTCGGCATCG GGATCAGCTG CACGAGTTTC GGCGTGGTGC TGACCGCGGT CGGCCGCGGC
GCGCCTGCCG AGAAGCGCAG CATGGCGATG GGCATCGCGA GCGCGGGCGG CTCGCTCGGC
CAGGTGGCGC TCGTGCCGAT CGCGCAGTGG TTCACGTCGC ATTCGGGCAC GATGGTGTCG
CTGTTCGTGC TGGCCGGCTG CATGATCGCG ATCGCGCCGC TCGGCGTGCT GCTCGACAAG
AACACGCGCG GCAGCCACGT GGTCGCGCAC GAGACGGCGA CGATATCGCT GAAGGAGACG
CTGTCGTACG CGGTGCGGCA TCGCGGCTAT TGCCTGCTGA CGCTCGGCTT CTTCACCTGC
GGGTTCCAGC TCGCGTTCAT CGGCACGCAC TTGCCGAACT ATCTGCTGCT CTGCCACATG
CCGGCCGGGC TCGGCGCGAC CGCGCTCGCG CTGATCGGCC TGTTCAACAT GGCGGGCAGC
TGGGCGTGCG GCTGGCTCGG CGGGCGCTAC CGGCAGCAGC ACGTGCTCGG CTGGCTGTAC
CTGATTCGCG GCGCGGCGAT CGCGCTGTTC TTCCTCGGGC CGAAGTCGAA TGCGTCGGTC
GTCGTCTTCG CGGCGATCAT GGGGCTCACG TGGCTCGGCA CCGTGCCGCT CACGAGCGGG
CTCGTCGCGA AGGTGTTCGG CACGCGGCAT CTGGGCACGC TGTTCGGCGT GTGCTTCCTG
AGCCATCAGG TCGGCTCGTT CCTCGGCTCG TGGCTCGGCG GCTACGTGTT CGACGCGACG
GGATCGTACT CGCTGATCTG GGGCGCGACG GCGCTCGCCG GGCTGTTCGC GGCACTGCTG
CATTTCCCGA TCAACGACGC GCCCGCGCAT GGCGGCGCGG CCGTCGCGCG GGCTTGA
 
Protein sequence
MIAVVAGALV MSAAMGVRQT FGLFIGPFSF DHGLPVTTIA FAIALHNLVW GAAQPFAGAA 
ADRYGAGPLV AIGAVVFALG LAIAAVVPTG PMLVLGIGVL VGIGISCTSF GVVLTAVGRG
APAEKRSMAM GIASAGGSLG QVALVPIAQW FTSHSGTMVS LFVLAGCMIA IAPLGVLLDK
NTRGSHVVAH ETATISLKET LSYAVRHRGY CLLTLGFFTC GFQLAFIGTH LPNYLLLCHM
PAGLGATALA LIGLFNMAGS WACGWLGGRY RQQHVLGWLY LIRGAAIALF FLGPKSNASV
VVFAAIMGLT WLGTVPLTSG LVAKVFGTRH LGTLFGVCFL SHQVGSFLGS WLGGYVFDAT
GSYSLIWGAT ALAGLFAALL HFPINDAPAH GGAAVARA