Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3284 |
Symbol | |
ID | 4902594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3199543 |
End bp | 3200826 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640136510 |
Product | putative capsular polysaccharide biosynthesis protein |
Protein accession | YP_001067521 |
Protein GI | 126453898 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.32234 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCAGTC GGGAGCGGCT GGCGCGCCAA ATTGTGTTCC ATCACATTCC CAAGACGGCG GGATCGTCGT TCAATCAGAT ACTTCGCACG CTATATCGCG ACGACGAAGT ATGCAACGCT GCGTTGGATG ACGAACTCGA TGAAGTGATG GCCGACGAGA CGCGTCGTTA CGAGCTGTTT GTCGGGCATT TCAGCTTCGA CGCGCTGCAT CGGCACTTCG GCGGCGCCAC GCGTTTGACT TTTCTTCGCG ATCCGGTTCA GCGCTGTATT TCCCAGTATC ACAACTGGCA TGACGCCTCG CGCTATTCGG ATGCGTGGAT CGGGCGCAGC GACACGAATC CGGACGTCAT CAAGGCGCTG AAGATGACGT CCGAGATGTC GCTTGGTGAA TTTGTGAGTT CGGACAATCT CGTGATTTCC GACAGCGCTC AAAACATGAT GACTCGCTAC CTCGCGCCGA GCGTCGAATG GAAGAAGGAG CGTGGATACT ATGACGCCGA GCTTGTCGAG AAAGCCAAGC GCAATCTCGT CGAGTATTTT CATTTTTTTG GCCTGACCGA GCAATTTGAT CGTTCACTAG TGCTTCTTGC GCATACCCTC GGTATCCGCC CATGGGAACG GAGCGATGCA CTGCTAACTA ACCGCAATCC GAAGAAGGCT TCGTTCGACA GTGTTTACAA TACCACGCCA GAAGAAGGCG GTGTTTTACG CGATTACAAC TTGATGGATA TCGAGTTGTA CGAGTTCGCG GTAAAGGAAT TCAATCGCCG CTTCGACGCG GGATACCAGA AGCTTGTCGA GTGCGCCTTT GAGTATCTCG CTGACAAGGA CACTCGCGAC ATGGGTAATG CTGGCGATTT TTACACGTTC GACATGACGA ACGCGGTCGG CGCCCGAGGT TTGCATTTTC TGGAATCCAC CCGGTTGCCG TGCGGTGCGA ATGTTCTTGG ACGTTGGACA GGGCTGGAGC CGCGAGCTGT ATGGGAGATT CCGCTTCGCG CGGGGCGCGA CAGCCATGTC GTGATCGAAG TGGACTATAT CGATAGCGTG TCGCAGGAGG CCCTGGCGCC GGAGCATTTC ACGTTAAACG GCATGCCGGC CAGGCAGCAT GCGTTCAGCG CGGAGGGCTC GATCCAGCGT CTGCGCCTGG TCTTTTCCGC CGGCGCCGCG CTTGCCGGCA GAATGTTGCA CACGCTGAAA TTGACTACTC CGCTTGTGCG TGCGGAAGAC GGAACGCGCG ACGTTGGAGT GCTTCTATTG CGCTTGCAGT CTTACAGCGT TTAG
|
Protein sequence | MRSRERLARQ IVFHHIPKTA GSSFNQILRT LYRDDEVCNA ALDDELDEVM ADETRRYELF VGHFSFDALH RHFGGATRLT FLRDPVQRCI SQYHNWHDAS RYSDAWIGRS DTNPDVIKAL KMTSEMSLGE FVSSDNLVIS DSAQNMMTRY LAPSVEWKKE RGYYDAELVE KAKRNLVEYF HFFGLTEQFD RSLVLLAHTL GIRPWERSDA LLTNRNPKKA SFDSVYNTTP EEGGVLRDYN LMDIELYEFA VKEFNRRFDA GYQKLVECAF EYLADKDTRD MGNAGDFYTF DMTNAVGARG LHFLESTRLP CGANVLGRWT GLEPRAVWEI PLRAGRDSHV VIEVDYIDSV SQEALAPEHF TLNGMPARQH AFSAEGSIQR LRLVFSAGAA LAGRMLHTLK LTTPLVRAED GTRDVGVLLL RLQSYSV
|
| |