Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2033 |
Symbol | |
ID | 4906228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 2005030 |
End bp | 2007663 |
Gene Length | 2634 bp |
Protein Length | 877 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640145138 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001076066 |
Protein GI | 126455886 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCG TCAAACCCGA AACGCTCGCG CTGATGTGCC GCACGCTGCG CATCGAACGC GCCGACCGGC TGTCGATCGG CGCGCTCGCC TGCTTCGCGC TGCGCGCGCA CGCCCCCGAC GGCCCCGGCG ATCTCGCGCC GGAGGCCACG CTCTGGCAGA TCGCCCAGCA ATGGCTCGGC GCCCATGCGC CGCTCGACGA GGGCTGGCCG AAGCCGGCGG GCGAATTTCT CGTCTACGGC GACGCATGCG CGCCGGCGGG CCGCGAGCAC GCGGGCGGCG CGCCGTTCGC GGTGCGCGCG CGCATCGGCG CGGCATGCAA GGCGCGGCTC GTCGATGCGC GCGACCCCGC CGAGCGGGTA CTCGCCGATT TTCGCGCGCT GCCGCCGTCG CATCCGCAGC GCGTGCGCGA TCTCGGGCCG TTCGACGCGC GCTGGCTGGC CGAGCGGTGG CCTCACCTGC CTTCGGGCAC GCGCGCCGAG CATTTCCATA CCGCACCGCG CGATCAGCGG ATCGCCGGGT TCTGGCGCGG CGACGAGGAC ATCGAGCTCG TCAACCTGCA CGCGCAGCAT CCGATCGTCG CCGGCGCGTT GCCGCGCGTG CGGGCGCGCT GCTTCGTCGA GCGCTCGGCC GGCGGCGCGA CGCGCGTCGA CGCGTGCCCG ATGCGCGCGG AAACCGTCTG GCTGTTCCCC GGCGCGGCAT GCGGCATCGT CCTGTACCGC GGGCTCGCCA CGATCGACGA CGAAGACGGC GACGACGTCT TGCGCGTGAT CGCCGGCTGG GAAGATGCCG CCGCGCCGCC GTTGCCCGCC GACGCTTATC TCGGCCGGCC GGCCTCCGGG GGCGCCGGCT CGCGCCCGAC GCCCGCGCCC GACGCCGCGC CCGTGGCGTC TGTGGTGCCC GCCGCGCTCA CCGCCGAAGA AGCGCACGCC GACGAACATG CGCCGGGCGG ATCGGCATCG GCCTCGCAGG CGCACTCGCC GGCAGCGCCC GAGTTCCCCG AAGCACCGCA CGCGCCGGAT CTGTCCGCGC TCGAACGGGA AGCGGCGGCG CTTGCCGGGC AAACCGACGC GTTGCTCGCC GGGCTGGGCA TCACCGAAGC GGACATCGCG CGCTTGCTGC CCGCGCGCGA GGCCCCCGCC GAGCTGAATC TGGATCAGCT CGCCACGCTC GCGGCCGAAC TCGACGCGCA AACCGCACAG TGGCAGGCGC AGCAAGCGGC GGCGGCCGTC GAGCGCGGCG ACGCGGCCTC GGCGACGCCC GCCGCGCCGG ATGCCGAGGC CGCGCACGAA GCGTCGCTGG CCGACCTGCT TCGGCAGGCC GACGCGCAAA TGCGCGCCCT CGTCGAGCAG CACGGCCTGT CGCGCGCACG AATGGAGGCG GCCGCGCGAA CCCTGCCGGA GCTCGCGCCC CTCGCGGGCT CGCTCGATGC CCTCGACGCA CTCGATGCGC CGCTCGACGT CGATGCCTTG ACGGCAGGGC TCGCCGCCGC CGGCGGCGAC GCGGCGGCCG AACCGGATAC GCCGGCCGAA CCGAGCCCGC CGGCGCCCGC GAACGAATTC GCCGCCGCCG TGCCCGCCCC CGCATCGTCC ACCGCCGCGC CGCCGGTGGA CGATGCGCCG CCAGGGCCGC TCACGCGCGA GCAAGTAATC GAGCGCCACG CGCGCGGGCT CGGCTTCGCC GGCCTCGACC TGAGCGGCCT GGACCTGTCG TCGGCCGCGC TCGAGCGCGC GGACTTTCGC CGCGCACGCC TCGAACGCAC CCGCTTCGCG GGCTGCCGGC TCGCCGGCGC ATCGTTCGAG CGCGCGCTGC TGTCGCACGC CGATTTCTCG AACGCGGACC TGCGCGACGC GGTCTTCGCC GGCGCCTCCG CGCCCGGCGC ATCGTGGCGC GGCGCCGTGC TCGAGCGCGC GCGCCTCGAG CACGGCGACT TCAGCGGCGG CGACTTCGCG CAAGCGTCGC TCGCCGACAG CCATTGCGCG CACGCGCAGT TCGACGCGAG CGCGATGACG GCGCTCGTCG CGGCGCGCAT CGACGGCACG CACGCGAGCT TCGTCGGCTG CACGCTCGAC GCCGCCGATT TCACGTCGGC GCACCTGCCG CGCGCGAATT TCCAGCATGC GACGCTCGCG GACGCGGCGC TCGCCTGCGC GCACTGCGAC GGCGCCGAAT GGTACGGCGC GCAGGCGCCG CGTGCCCGGC TTCGCGCGGC GTCGCTGCGC GGCTCGCGCG CGGACGCGTC GACGTCGTTC CGGCAAGCCG ATCTGAGCAG CGCCGCGCTC GACGACGCGA ACTGGGACGG CGTCGACCTG CGCGGCACGA ACCTGCACGA GGCGACGCTC GACGGCGCGA GCCTCGCGCG CGCGAACGCG AGCGGCGCGC AACTGACGCG CGCGCGCGCA CGGCGCGCGG ATCTGACCCA GGCCGACCTC ACGCACGCGG ATGCGCGCTG CTCGAACCTG CACGGCGCAT CGCTGCGCCG CGCACGGCTC GGCGGCACGC AACTGCAATC GAGCAACCTG TACGGCGCCG ACTGCTACGG CACCGCGCTC GCCCGGCCGC AGCTCGACGG CGCGAATATC GAGCGCACGC TCCTCGCCGT GCCGGGCCGC CCCGAACTCG CCGCCTCCCG CTGA
|
Protein sequence | MKIVKPETLA LMCRTLRIER ADRLSIGALA CFALRAHAPD GPGDLAPEAT LWQIAQQWLG AHAPLDEGWP KPAGEFLVYG DACAPAGREH AGGAPFAVRA RIGAACKARL VDARDPAERV LADFRALPPS HPQRVRDLGP FDARWLAERW PHLPSGTRAE HFHTAPRDQR IAGFWRGDED IELVNLHAQH PIVAGALPRV RARCFVERSA GGATRVDACP MRAETVWLFP GAACGIVLYR GLATIDDEDG DDVLRVIAGW EDAAAPPLPA DAYLGRPASG GAGSRPTPAP DAAPVASVVP AALTAEEAHA DEHAPGGSAS ASQAHSPAAP EFPEAPHAPD LSALEREAAA LAGQTDALLA GLGITEADIA RLLPAREAPA ELNLDQLATL AAELDAQTAQ WQAQQAAAAV ERGDAASATP AAPDAEAAHE ASLADLLRQA DAQMRALVEQ HGLSRARMEA AARTLPELAP LAGSLDALDA LDAPLDVDAL TAGLAAAGGD AAAEPDTPAE PSPPAPANEF AAAVPAPASS TAAPPVDDAP PGPLTREQVI ERHARGLGFA GLDLSGLDLS SAALERADFR RARLERTRFA GCRLAGASFE RALLSHADFS NADLRDAVFA GASAPGASWR GAVLERARLE HGDFSGGDFA QASLADSHCA HAQFDASAMT ALVAARIDGT HASFVGCTLD AADFTSAHLP RANFQHATLA DAALACAHCD GAEWYGAQAP RARLRAASLR GSRADASTSF RQADLSSAAL DDANWDGVDL RGTNLHEATL DGASLARANA SGAQLTRARA RRADLTQADL THADARCSNL HGASLRRARL GGTQLQSSNL YGADCYGTAL ARPQLDGANI ERTLLAVPGR PELAASR
|
| |