Gene BMA10247_A1681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A1681 
Symbol 
ID4889894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp1622558 
End bp1625200 
Gene Length2643 bp 
Protein Length880 aa 
Translation table11 
GC content76% 
IMG OID640147946 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001078864 
Protein GI126447541 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCG TCAAACCCGA AACGCTCGCG CTGATGTGCC GCACGCTGCG CATCGAACGC 
GCCGACCGGC TGTCGATCGG CGCGCTCGCC TGCTTCGCGC TGCGCGCGCA CGCGCCCGAC
GGCCCCGGCG ATCTCGCGCC GGAGGCCACG CTCTGGCAGA TCGCCCAGCA ATGGCTCGGC
GCCCATGCGC CGCTCGACGA GGGCTGGCCG AAGCCGGCGG GCGAATTTCT CGTCTACGGC
GACGCATGCG CGCCGGCGGG CCGCGAGCAC GCGGGCGGCG CGCCGTTCGC GGTGCGCGCA
CGCATCGGCG CGGCATGCAA GGCGCGGCTC GTCGATGCGC GCGACCCCGC CGAGCGGGTA
CTCGCCGATT TTCGCGCGCT GCCGCCGTCG CATCCGCAGC GCGTGCGCGA TCTCGGGCCG
TTCGACGCGC GCTGGCTGGC CGAGCGGTGG CCTCACCTGC CTTCGGGCAC GCGCGCCGAG
CATTTCCATA CCGCACCGCG CGATCAGCGG ATCGCCGGGT TCTGGCGCGG CGACGAGGAC
ATCGAGCTCG TCAACCTGCA CGCGCAGCAT CCGATCGTCG CCGGCGCGTT GCCGCGCGTG
CGGGCGCGCT GCTTCGTCGA GCGCTCGGCC GGCGGCGCGA CGCGCGTCGA CGCGTGCCCG
ATGCGCGCGG AAACCGTCTG GCTGTTCCCC GGCGCGGCAT GCGGCATCGT CCTGTACCGC
GGGCTCGCCA CGATCGACGA CGAAGACGGC GACGACGTCT TGCGCGTGAT CGCCGGCTGG
GAAGATGCCG CCGCGCCGCC GTTGCCCGCC GACGCTTATC TCGGCCGGCC GGCCTCCGGG
GGCGCCGGCT CGCGCCCGAC GCCCGCGCCC GACGCCGCGC CCGTGGCGTC TGTGGTGCCC
GCCGCGCTCA CCGCCGAAGA AGCGCACGCC GACGAACATG CGCCGGGCGG ATCGGCATCG
GCCTCGCAGG CGCACTCGCC GGCAGCGCCC GAGTTCCCCG AAGCACCGCA CGCGCCGGAT
CTGTCCGCGC TCGAACGGGA AGCGGCGGCG CTTGCCGGGC AAACCGACGC GTTGCTCGCC
GGGCTGGGCA TCACCGAAGC GGACATCGCG CGCTTGCTGC CCGCGCGCGA GGCCCCCGCC
GAGCTGAATC TGGATCAGCT CGCCACGCTC GCGGCCGAAC TCGACGCGCA AACCGCACAG
TGGCAGGCGC AGCAAGCGGT GGCGGCCGTC GAGCGCGGCG ACGCGGCCTC GGCGACGCCC
GCCGCGCCGG CCACGCCGGA TGCCGAGGCC GCGCACGAAG CGTCGCTGGC CGACCTGCTT
CGGCAGGCCG ACGCGCAAAT GCGCGCCCTC GTCGAGCAGC ACGGCCTGTC GCGCGCACGA
ATGGAGGCGG CCGCGCGAAC CCTGCCGGAG CTCGCGCCCC TCGCGGGCTC GCTCGATGCC
CTCGACGCAC TCGATGCGCC GCTCGACGTC GATGCCTTGA CGGCAGGGCT CGCCGCCGCC
GGCGGCGACG CGGCGGCCGA ACCGGATACG CCGGCCGAAC CGAGCCCGCC GGCGCCCGCG
AACGAATTCG CCGCCGCCGT GCCCGCCCCC GCATCGTCCA CCGCCGCGCC GCCGGTGGAC
GATGCGCCGC CAGGGCCGCT CACGCGCGAG CAAGTAATCG AGCGCCACGC GCGCGGGCTC
GGCTTCGCCG GCCTCGACCT GAGCGGCCTG GACCTGTCGT CGGCCGCGCT CGAGCGCGCG
GACTTTCGCC GCGCACGCCT CGAACGCACC CGCTTCGCGG GCTGCCGGCT CGCCGGCGCA
TCGTTCGAGC GCGCGCTGCT GTCGCACGCC GATTTCTCGA ACGCGGACCT GCGCGACGCG
GTCTTCGCCG GCGCCTCCGC GCCCGGCGCA TCGTGGCGCG GCGCCGTGCT CGAGCGCGCG
CGCCTCGAGC ACGGCGACTT CAGCGGCGGC GACTTCGTGC AAGCGTCGCT CGCCGACAGC
CATTGCGCGC ACGCGCAGTT CGACGCGAGC GCGATGACGG CGCTCGTCGC GGCGCGCATC
GACGGCACGC ACGCGAGCTT CGCCGGCTGC ACGCTCGACG CCGCCGATTT CACGTCGGCG
CGCCTGCCGC GCGCGAATTT CCAGCATGCG ACGCTCGCGG ACGCGGCGCT CGCCTGCGCG
CACTGCGACG GCGCCGAATG GTACGGCGCG CAGGCGCCGC GTGTCCGGCT TCGCGCGGCG
TCGCTGCGCG GCTCGCGCGC GGACGCGTCG ACATCGTTCC GGCAAGCCGA TCTGAGCAGC
GCCGCGCTCG ACGACGCGAA CTGGGACGGC GTCGACCTGC GCGGCACGAA CCTGCACGAG
GCGACGCTCG ACGGCGCGAG CCTCGCGCGC GCGAACGCGA GCGGCGCGCA ACTGACGCGC
GCGCGCGCAC GGCGCGCGGA TCTGACCCAG GCCGACCTCA CGCACGCGGA TGCGCGCTGC
TCGAACCTGC ACGGCGCATC GCTGCGCCGC GCACGGCTCG GCGGCACGCA ACTGCAATCG
AGCAACCTGT ACGGCGCCGA CTGCTACGGC ACCGCGCTCG CCCGGCCGCA GCTCGACGGC
GCGAATATCG AGCGCACGCT CCTCGCCGTG CCGGGCCGCC CCGAACTCGC CGCCTCCCGC
TGA
 
Protein sequence
MKIVKPETLA LMCRTLRIER ADRLSIGALA CFALRAHAPD GPGDLAPEAT LWQIAQQWLG 
AHAPLDEGWP KPAGEFLVYG DACAPAGREH AGGAPFAVRA RIGAACKARL VDARDPAERV
LADFRALPPS HPQRVRDLGP FDARWLAERW PHLPSGTRAE HFHTAPRDQR IAGFWRGDED
IELVNLHAQH PIVAGALPRV RARCFVERSA GGATRVDACP MRAETVWLFP GAACGIVLYR
GLATIDDEDG DDVLRVIAGW EDAAAPPLPA DAYLGRPASG GAGSRPTPAP DAAPVASVVP
AALTAEEAHA DEHAPGGSAS ASQAHSPAAP EFPEAPHAPD LSALEREAAA LAGQTDALLA
GLGITEADIA RLLPAREAPA ELNLDQLATL AAELDAQTAQ WQAQQAVAAV ERGDAASATP
AAPATPDAEA AHEASLADLL RQADAQMRAL VEQHGLSRAR MEAAARTLPE LAPLAGSLDA
LDALDAPLDV DALTAGLAAA GGDAAAEPDT PAEPSPPAPA NEFAAAVPAP ASSTAAPPVD
DAPPGPLTRE QVIERHARGL GFAGLDLSGL DLSSAALERA DFRRARLERT RFAGCRLAGA
SFERALLSHA DFSNADLRDA VFAGASAPGA SWRGAVLERA RLEHGDFSGG DFVQASLADS
HCAHAQFDAS AMTALVAARI DGTHASFAGC TLDAADFTSA RLPRANFQHA TLADAALACA
HCDGAEWYGA QAPRVRLRAA SLRGSRADAS TSFRQADLSS AALDDANWDG VDLRGTNLHE
ATLDGASLAR ANASGAQLTR ARARRADLTQ ADLTHADARC SNLHGASLRR ARLGGTQLQS
SNLYGADCYG TALARPQLDG ANIERTLLAV PGRPELAASR