Gene BMAA1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA1900 
Symbol 
ID3086672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp2078641 
End bp2081118 
Gene Length2478 bp 
Protein Length825 aa 
Translation table11 
GC content70% 
IMG OID637565772 
Productpentapeptide repeat-containing protein 
Protein accessionYP_106439 
Protein GI53715998 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins
[COG5351] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCG TCAAACCGCT TGCCATCAGC CCGCTGACCC GCGTGTACCG GATGCACGGC 
CGGGAGTATC TCGGCGTCGC CGCGCTGTTG ATCGCGACGC TCGGCGACGA GCCGAAACTG
CTGGCCGAAT CGGCGCTCTG GCGTCTGGCC GGCGACGAAC TGCGCGGCTA TCCGCTCGAC
ATGGCGCTGC CGAAGGCGTG TCCGGAGTTT CTCGTGTCCG GATACGCGTA CGGAAAGTAC
GCGAGCGATC CGCACGCGTG CGCGTGCGAA GTGGGCGTGC GCATTGCCGG CCTCGAGAAG
CGGCTGCGTG TCTGCGGCGA CCGGCAGTGG GCGGGCGCGC GCATCACCGC GCCGCGGCCG
TTCGAGCGGC TACCGATCGA CTGGGATCTC GCTTACGGCG GCGCGGGTTG CGCGGACAAT
CCGCGAGGCC GCGGCGCGCA CGCGCGGGAG GGCGCGCCGC GCGATCTGCC GAATGTCGAA
TACGCGCACA GCCCGATGCG CTTTGCGCAC GAGCAACCCG CGCCCGCCGG CTTTTGCCCG
GTCGACGCGG CATGGCCGGC GCGCGCCGGC CTGTACGGCG CGCTCGATCG GCAATGGCAG
GAAGAGGATT GTCCGGGCTT TCCGCGCACG CTCGATCCGC GCTACTTCAA CATCGCGCCG
GCCGATCAGC AACTGCCCGA GCTGCGGGCA TTCCCGGACG GCGCGCGCTA CGAACTGACG
CACATGCATC CGGACCACGC GACGCTCGCG GGAAACCTGC CCGCGCTGCG CGCGAGATCG
TTCGTGGTAC GTCGGGGCAG CGATGCGCCC GAGGAAATGC CGATGCGCTT GACGACCGCG
TGGTTCGTTC CGCATCGCGA ACGCGTGATC CTGATCTATC ACGGCGTCAC GCCCGTTCGC
GCGTTCGACG CGAGCGACGT GCAGACGGTG CTGTTCGGTG CGGAGGCGAG CGGGCACGCG
AGGCCCGCCG ACTGGTATCG GCAGGTGATC GAGTGGCGCA CGCGGGACGA CAGGGCGGCG
CTGTACGCGC TGCGCGACCG GGATCTGCTG CCCGAGCATG CGCTTGCGCC CGAAGCGGCG
GCGACGCCCG AGCCGACGCA GCAGAGCGCG AAGCAGCGGC AGCTTCGCGA GCGGTTGAGC
GTCTTTCCGG ATGCTCCGCG CGCACAGACG CCGGCGCCGG ATCGGCTGGC CGAATTCGTC
GAGCAGCAGC AAGCGCTCGC CGACGAAAAG CGCGCCGCGC TGGAAGCCAT GCGGCGGGAA
CTGGCGACCA GCGAAGTATT TTCGGTCGGC CGTCGGCGCG GCCCGCCCGG CCGGATCGCG
CCCGCGGACG AAGATCCCGC GCGGCACGCG GGCGCGTTGG CCGAATCGCC GGACATCCGG
GCGCTCGAAC GCGACGCGGA CGAGCGCCTT CGCGGGCTGT ACCAGCAGTG CGCGCAACAT
CAGGACGCGC CGGCCCGGCT GCACGGCGCG GCCGCGCGAG CGCGCCGCGA GTGCGTCGCG
TCGGCCGCCG CGGCCGGCCA GTCGCTGCAA GGCGCCGATC TGACCGGCGC GGACCTCTCG
GGAATGGACT TGCGCGGCGC GCGCCTGGCC GGCGCGATGC TGGAGAACGC CGATTTGAGC
GGCGCCGATC TGACGGGCGC GGATCTGTCG CGCACGGTGC TCGTGCGCGC CGATCTGACA
CGTGCGAAGC TCGTCGATGC GCGCCTGACG GCGGCCAATC TGTCGCTCGC GCATTGCGAG
CGGACGGATT TCTCCGGCTC GGATTTGAGT GACGGCATTT TCGAGCAGGT ACACCTACGA
GATTGCCGCT TCAACGGCAG CGTGCTGGCG AGCACGCGCT TCGACGCGTG CCGGTTCGAT
GCCGTCGATT TCGGTCGCGC GACGCTGCGC GAGCTGATCT TCATCGAACA ATCGTTCAGC
GGCGTGAGCT TCTCGGATGC GACGATCCGC AAGATGCTGC TGATGCGTTG CGCGTTCGCC
GACGTGCGGT TCTCGGCGGC GAGCATCGAC GGATTCGGGA TCGTCGAGAC GCAGGCGAGC
GGGCAGCTCC GCTTCGATCG CGCGAGCGTG AACAAAGCGT GTTTCGTCGG GCGCTGCGAC
ATCGGGCGCG CCGATTTCTC GTTCGCGACG CTGACGGAGG TCAATTTCCG CGAGACGCAG
CTCGTCGAGG CGAACTTCGG CGGCGCGCGC ATCGGCAATT GCGATTTCAC CGATGCGTGC
CTGCGAGCAG CCGATCTACG GGGCGCGAAG GCCGAGGGCA GCCCGTTCGT GCGCGCCGAT
CTCACGCGCG CCGATCTTCG GGACACCGAT CTGATCGCCG CGTATCTGCG CGGCGCGAAG
CTGGACGGCG CGGACCTTCG GCGCGCCAAC CTGTTTCGCG CGAACCTCTC GCAGATCCTC
ACCGATGCCG ATACGCGCTG GCAGGGCGCG TACCTGAACC GGGCGGTGCG GTTTCCGCTG
GCGGAGGCGC GCACATGA
 
Protein sequence
MKIVKPLAIS PLTRVYRMHG REYLGVAALL IATLGDEPKL LAESALWRLA GDELRGYPLD 
MALPKACPEF LVSGYAYGKY ASDPHACACE VGVRIAGLEK RLRVCGDRQW AGARITAPRP
FERLPIDWDL AYGGAGCADN PRGRGAHARE GAPRDLPNVE YAHSPMRFAH EQPAPAGFCP
VDAAWPARAG LYGALDRQWQ EEDCPGFPRT LDPRYFNIAP ADQQLPELRA FPDGARYELT
HMHPDHATLA GNLPALRARS FVVRRGSDAP EEMPMRLTTA WFVPHRERVI LIYHGVTPVR
AFDASDVQTV LFGAEASGHA RPADWYRQVI EWRTRDDRAA LYALRDRDLL PEHALAPEAA
ATPEPTQQSA KQRQLRERLS VFPDAPRAQT PAPDRLAEFV EQQQALADEK RAALEAMRRE
LATSEVFSVG RRRGPPGRIA PADEDPARHA GALAESPDIR ALERDADERL RGLYQQCAQH
QDAPARLHGA AARARRECVA SAAAAGQSLQ GADLTGADLS GMDLRGARLA GAMLENADLS
GADLTGADLS RTVLVRADLT RAKLVDARLT AANLSLAHCE RTDFSGSDLS DGIFEQVHLR
DCRFNGSVLA STRFDACRFD AVDFGRATLR ELIFIEQSFS GVSFSDATIR KMLLMRCAFA
DVRFSAASID GFGIVETQAS GQLRFDRASV NKACFVGRCD IGRADFSFAT LTEVNFRETQ
LVEANFGGAR IGNCDFTDAC LRAADLRGAK AEGSPFVRAD LTRADLRDTD LIAAYLRGAK
LDGADLRRAN LFRANLSQIL TDADTRWQGA YLNRAVRFPL AEART