Gene BMA10247_A2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A2001 
Symbol 
ID4890647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp1930050 
End bp1931708 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content72% 
IMG OID640148265 
Producthypothetical protein 
Protein accessionYP_001079177 
Protein GI126447196 
COG category[S] Function unknown 
COG ID[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.66757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGATC GCGCGGCGGC GAATCCGGAT CTCGTGACGC GCGTGCCGAC GCTGTCGTCG 
CTGTCGAGCG CGGCGATGAC GAGCTCCGCC GTATCGACCA CCGGCACGAC GAACACGGTG
GCGTCGGGCG GCGCGGCCGC GGGCGCGCCG AGCTTCGCGC CGCCGGCCGC CGACGCGTTT
CCCCAAGCCG ACGGCGCGCC CCGCAACCCC GCGGTGCTGC AATTCCCGGT GCCGGGCGGC
GCGCCCGCGC CCGCCGACGC GCGGGTGGCC GCGCCCGTCG TCTATAGCGC GCAGGGCGAG
CAGGCCGCGA TCATGAAAGC AGGCCTGCAG CAGGCGAGCT GGAACAACCC GTTCGTGTCG
CACGCGCTGC CCGCGGTGCT GCAACTGCAG CGCCACCTCG CGGCCGGCCC GCTCAATCAG
GCCGCGATCC GCACGCAGCT CGGCCTCGAG GTGCGGCTCT ACCGCGAGCG GCTCGCCGCC
TCCGGCTGCG AATGGGAGCA GATCCGCGAC GCATCGTACC TGCTCTGCAC GTATCTCGAC
GAAACCGTCA ACGACGCGGC GCGCGAGCAC GCGCAAGTCG TCTACGACGG CGAGCGCAGC
CTGCTCGTCG AATTCCACGA CGACGCGTGG GGCGGCGAGG ACGCGTTCGC CGACCTGTCG
CGCTGGATGA AGACCGAGCC GCCGCCGATT CCGCTTCTGT CGTTCTACGA ACTGATCCTG
TCGCTCGGCT GGCAGGGCCG CTACCGCGTG CTCGACCGCG TCGACGTGCT GCTGCAGGAT
CTGCGCTCGC AACTGCACGC GCTGATCTGG CATCACGTGC CGCCCGAGCC GCTCGGCACC
GAGCTCGTCG CGCCCGCGAA GCGGCGCCGC TCGTGGTGGA CGGCCGGGCG CGCGGCGGCC
GTCGCGCTCG GCGTGCTGGT GCTCGCGTAC GGCGCGATCA GCTTCTGGCT CGATTCGCAG
GGCCGCCCGA TCCGCAACGC GCTCGCCGCG TGGATGCCGC CCACGCGCAC GATCAACATC
GCCGAGACGC TGCCGCCGCC GCTGCCGCAG ATTCTCACCG AAGGGTGGCT CACCGCGTAC
AAGCATCCGC AAGGATGGCT GCTCGTGTTC AAGAGCGACG GCGCGTTCGA CGTCGGCAAG
GCGAACGTGC GGGCGGACTT CATGCACAAC ATCGAGCGGC TCGGCCTCGC GTTCGCGCCG
TGGCCGGGCG ACCTCGAGGT GATCGGCCAC ACCGATTCGC GGCCGATCCG CACGAGCGAG
TTCCCGGACA ACCAGGCGCT GTCCGAAGCG CGGGCGCGCA ACGTCGCCGA CGAACTGCGC
AAGACCGCGC TGCCGGGCGG CGCGCGCGCG CCGGAGAACG CGGTGCAGCG CAACATCGAG
TACTCGGGGC GCGGCGACGC GCAGCCGATC GACACCGCGA AGACGGCCGC CGCGTACGAG
CGCAACCGCC GCGTCGACGT GCTGTGGAAG GTGATTCCCG ACGGCGCGCA GCAATCGGGC
CGCAGCCTGA ACCTGCAGCA GCCGGAGAAG CCCGGGCAGG TGCCGATGCG TCCGGCGATG
CCGGAGGGCG TGGAGATCGC GCCTGACGGG CAACTGCCGT ATGCGACGTC AACCACGATG
CCAGCAACGC GACCGACCAC GGAGGGCCGT CAGCCATGA
 
Protein sequence
MLDRAAANPD LVTRVPTLSS LSSAAMTSSA VSTTGTTNTV ASGGAAAGAP SFAPPAADAF 
PQADGAPRNP AVLQFPVPGG APAPADARVA APVVYSAQGE QAAIMKAGLQ QASWNNPFVS
HALPAVLQLQ RHLAAGPLNQ AAIRTQLGLE VRLYRERLAA SGCEWEQIRD ASYLLCTYLD
ETVNDAAREH AQVVYDGERS LLVEFHDDAW GGEDAFADLS RWMKTEPPPI PLLSFYELIL
SLGWQGRYRV LDRVDVLLQD LRSQLHALIW HHVPPEPLGT ELVAPAKRRR SWWTAGRAAA
VALGVLVLAY GAISFWLDSQ GRPIRNALAA WMPPTRTINI AETLPPPLPQ ILTEGWLTAY
KHPQGWLLVF KSDGAFDVGK ANVRADFMHN IERLGLAFAP WPGDLEVIGH TDSRPIRTSE
FPDNQALSEA RARNVADELR KTALPGGARA PENAVQRNIE YSGRGDAQPI DTAKTAAAYE
RNRRVDVLWK VIPDGAQQSG RSLNLQQPEK PGQVPMRPAM PEGVEIAPDG QLPYATSTTM
PATRPTTEGR QP