Gene BURPS1106A_A2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2033 
Symbol 
ID4906228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2005030 
End bp2007663 
Gene Length2634 bp 
Protein Length877 aa 
Translation table11 
GC content76% 
IMG OID640145138 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001076066 
Protein GI126455886 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCG TCAAACCCGA AACGCTCGCG CTGATGTGCC GCACGCTGCG CATCGAACGC 
GCCGACCGGC TGTCGATCGG CGCGCTCGCC TGCTTCGCGC TGCGCGCGCA CGCCCCCGAC
GGCCCCGGCG ATCTCGCGCC GGAGGCCACG CTCTGGCAGA TCGCCCAGCA ATGGCTCGGC
GCCCATGCGC CGCTCGACGA GGGCTGGCCG AAGCCGGCGG GCGAATTTCT CGTCTACGGC
GACGCATGCG CGCCGGCGGG CCGCGAGCAC GCGGGCGGCG CGCCGTTCGC GGTGCGCGCG
CGCATCGGCG CGGCATGCAA GGCGCGGCTC GTCGATGCGC GCGACCCCGC CGAGCGGGTA
CTCGCCGATT TTCGCGCGCT GCCGCCGTCG CATCCGCAGC GCGTGCGCGA TCTCGGGCCG
TTCGACGCGC GCTGGCTGGC CGAGCGGTGG CCTCACCTGC CTTCGGGCAC GCGCGCCGAG
CATTTCCATA CCGCACCGCG CGATCAGCGG ATCGCCGGGT TCTGGCGCGG CGACGAGGAC
ATCGAGCTCG TCAACCTGCA CGCGCAGCAT CCGATCGTCG CCGGCGCGTT GCCGCGCGTG
CGGGCGCGCT GCTTCGTCGA GCGCTCGGCC GGCGGCGCGA CGCGCGTCGA CGCGTGCCCG
ATGCGCGCGG AAACCGTCTG GCTGTTCCCC GGCGCGGCAT GCGGCATCGT CCTGTACCGC
GGGCTCGCCA CGATCGACGA CGAAGACGGC GACGACGTCT TGCGCGTGAT CGCCGGCTGG
GAAGATGCCG CCGCGCCGCC GTTGCCCGCC GACGCTTATC TCGGCCGGCC GGCCTCCGGG
GGCGCCGGCT CGCGCCCGAC GCCCGCGCCC GACGCCGCGC CCGTGGCGTC TGTGGTGCCC
GCCGCGCTCA CCGCCGAAGA AGCGCACGCC GACGAACATG CGCCGGGCGG ATCGGCATCG
GCCTCGCAGG CGCACTCGCC GGCAGCGCCC GAGTTCCCCG AAGCACCGCA CGCGCCGGAT
CTGTCCGCGC TCGAACGGGA AGCGGCGGCG CTTGCCGGGC AAACCGACGC GTTGCTCGCC
GGGCTGGGCA TCACCGAAGC GGACATCGCG CGCTTGCTGC CCGCGCGCGA GGCCCCCGCC
GAGCTGAATC TGGATCAGCT CGCCACGCTC GCGGCCGAAC TCGACGCGCA AACCGCACAG
TGGCAGGCGC AGCAAGCGGC GGCGGCCGTC GAGCGCGGCG ACGCGGCCTC GGCGACGCCC
GCCGCGCCGG ATGCCGAGGC CGCGCACGAA GCGTCGCTGG CCGACCTGCT TCGGCAGGCC
GACGCGCAAA TGCGCGCCCT CGTCGAGCAG CACGGCCTGT CGCGCGCACG AATGGAGGCG
GCCGCGCGAA CCCTGCCGGA GCTCGCGCCC CTCGCGGGCT CGCTCGATGC CCTCGACGCA
CTCGATGCGC CGCTCGACGT CGATGCCTTG ACGGCAGGGC TCGCCGCCGC CGGCGGCGAC
GCGGCGGCCG AACCGGATAC GCCGGCCGAA CCGAGCCCGC CGGCGCCCGC GAACGAATTC
GCCGCCGCCG TGCCCGCCCC CGCATCGTCC ACCGCCGCGC CGCCGGTGGA CGATGCGCCG
CCAGGGCCGC TCACGCGCGA GCAAGTAATC GAGCGCCACG CGCGCGGGCT CGGCTTCGCC
GGCCTCGACC TGAGCGGCCT GGACCTGTCG TCGGCCGCGC TCGAGCGCGC GGACTTTCGC
CGCGCACGCC TCGAACGCAC CCGCTTCGCG GGCTGCCGGC TCGCCGGCGC ATCGTTCGAG
CGCGCGCTGC TGTCGCACGC CGATTTCTCG AACGCGGACC TGCGCGACGC GGTCTTCGCC
GGCGCCTCCG CGCCCGGCGC ATCGTGGCGC GGCGCCGTGC TCGAGCGCGC GCGCCTCGAG
CACGGCGACT TCAGCGGCGG CGACTTCGCG CAAGCGTCGC TCGCCGACAG CCATTGCGCG
CACGCGCAGT TCGACGCGAG CGCGATGACG GCGCTCGTCG CGGCGCGCAT CGACGGCACG
CACGCGAGCT TCGTCGGCTG CACGCTCGAC GCCGCCGATT TCACGTCGGC GCACCTGCCG
CGCGCGAATT TCCAGCATGC GACGCTCGCG GACGCGGCGC TCGCCTGCGC GCACTGCGAC
GGCGCCGAAT GGTACGGCGC GCAGGCGCCG CGTGCCCGGC TTCGCGCGGC GTCGCTGCGC
GGCTCGCGCG CGGACGCGTC GACGTCGTTC CGGCAAGCCG ATCTGAGCAG CGCCGCGCTC
GACGACGCGA ACTGGGACGG CGTCGACCTG CGCGGCACGA ACCTGCACGA GGCGACGCTC
GACGGCGCGA GCCTCGCGCG CGCGAACGCG AGCGGCGCGC AACTGACGCG CGCGCGCGCA
CGGCGCGCGG ATCTGACCCA GGCCGACCTC ACGCACGCGG ATGCGCGCTG CTCGAACCTG
CACGGCGCAT CGCTGCGCCG CGCACGGCTC GGCGGCACGC AACTGCAATC GAGCAACCTG
TACGGCGCCG ACTGCTACGG CACCGCGCTC GCCCGGCCGC AGCTCGACGG CGCGAATATC
GAGCGCACGC TCCTCGCCGT GCCGGGCCGC CCCGAACTCG CCGCCTCCCG CTGA
 
Protein sequence
MKIVKPETLA LMCRTLRIER ADRLSIGALA CFALRAHAPD GPGDLAPEAT LWQIAQQWLG 
AHAPLDEGWP KPAGEFLVYG DACAPAGREH AGGAPFAVRA RIGAACKARL VDARDPAERV
LADFRALPPS HPQRVRDLGP FDARWLAERW PHLPSGTRAE HFHTAPRDQR IAGFWRGDED
IELVNLHAQH PIVAGALPRV RARCFVERSA GGATRVDACP MRAETVWLFP GAACGIVLYR
GLATIDDEDG DDVLRVIAGW EDAAAPPLPA DAYLGRPASG GAGSRPTPAP DAAPVASVVP
AALTAEEAHA DEHAPGGSAS ASQAHSPAAP EFPEAPHAPD LSALEREAAA LAGQTDALLA
GLGITEADIA RLLPAREAPA ELNLDQLATL AAELDAQTAQ WQAQQAAAAV ERGDAASATP
AAPDAEAAHE ASLADLLRQA DAQMRALVEQ HGLSRARMEA AARTLPELAP LAGSLDALDA
LDAPLDVDAL TAGLAAAGGD AAAEPDTPAE PSPPAPANEF AAAVPAPASS TAAPPVDDAP
PGPLTREQVI ERHARGLGFA GLDLSGLDLS SAALERADFR RARLERTRFA GCRLAGASFE
RALLSHADFS NADLRDAVFA GASAPGASWR GAVLERARLE HGDFSGGDFA QASLADSHCA
HAQFDASAMT ALVAARIDGT HASFVGCTLD AADFTSAHLP RANFQHATLA DAALACAHCD
GAEWYGAQAP RARLRAASLR GSRADASTSF RQADLSSAAL DDANWDGVDL RGTNLHEATL
DGASLARANA SGAQLTRARA RRADLTQADL THADARCSNL HGASLRRARL GGTQLQSSNL
YGADCYGTAL ARPQLDGANI ERTLLAVPGR PELAASR