Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0902 |
Symbol | |
ID | 4906337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 886603 |
End bp | 887853 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640144008 |
Product | hypothetical protein |
Protein accession | YP_001074938 |
Protein GI | 126456332 |
COG category | [S] Function unknown |
COG ID | [COG3214] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGTCCTT CCCCACCGCT CGTGTCCGCC CCGCTCTCGA TCGCTTCGGC CCGCGCGCTG CATCTCGCCG CGCAGGGCCT CTTGACGCCG CCTCGCCGCA AGGCGGTCAA GGCCGACGTG CTCGCCGCCG TCCGCCGGAT GGGCCAACTG CAGATCGACA CGATCCATGT CGTCGCGCGC AGCCCGTATC TCGTGCTGTT CAGCCGGCTC GGCGCATATG CGCCGCAATG GCTCGACGAA CATCTCGCCG ACGCGAAACT GTTCGAGTAC TGGTCGCATG AAGCGTGCTT CCTGCCGATC GAGGACTTCG GCCTGATGCG CCACAAGATG CTCAACCCCG TCGGCATGGG CTGGAAATAC GCGGCCGAAT GGCACGCGCA GCATCGCGAC GCAATCGACG CGCTGCTCGC ACATGTTCGC GCGCGCGGCC CCGTGCGCTC CGCCGATTTC GCGCGCGGCG CCGGCAAGGG CAACGGCTGG TGGGACTGGA AGCCCGAGAA GCGGCATCTC GAAGTGCTGT TCTCGACCGG GCAATTGATG GTCGCCGAGC GGCGCAACTT CCAGCGCGTC TACGACGTCG CCGAACGCGT GCTGCCGGAC TGGGACGACG CGCGCGACCT GCCGCCGCGC GCGGCGGTGG TGCCGCGCCT CGTCGGCAAC ACCTGTCGCG CGCTCGGCAT CGTCCGCGCG GACTGGATCG CCGATTATTA CCGGCTGCCG AAGCGCTCGT ATCGCGACGA ACTGCATGCG CTCGCGGACG CGGGCGAGCT GCTGCCCGTC GCGGTCGACG GCTGGCACGC CGACGCGTTC GTGCATCGCG AGCTCGCGCC GCTCGTCGAC GCCGCGCGCG ACGGCGCGCT GCGCCCGACG GTCACGACGC TGCTGTCGCC GTTCGATCCG GTCGTCTGGG ATCGGCGGCG TGCATCGGCG CTGTTCGATT TCGACTACAC GATCGAATGC TACACGCCCG CGCACAAGCG TCGCTACGGC TACTTCTGCC TGCCGATCCT GCACCGCGGG CGGCTCGTCG GCCGCGTCGA CGCGAAGGCG CATCGCACGC AGCGCGTGTT CGAGCTGAAG GCGGTGCACA TCGAGCCGGG CGTGCGGCTC GGCGCGGGGC TCGCGGCGGA TGTCGGTCGC GCGATCCGCA AGCTCGCCGA CTGGCACGAA ACGCCCGTCG TCGAGGCCGG CCACGCGCCG AAGGAGATCG CGCGAGCAAT CGGCGCGGAT CGCGTCGCCA AGCCGCGATA G
|
Protein sequence | MRPSPPLVSA PLSIASARAL HLAAQGLLTP PRRKAVKADV LAAVRRMGQL QIDTIHVVAR SPYLVLFSRL GAYAPQWLDE HLADAKLFEY WSHEACFLPI EDFGLMRHKM LNPVGMGWKY AAEWHAQHRD AIDALLAHVR ARGPVRSADF ARGAGKGNGW WDWKPEKRHL EVLFSTGQLM VAERRNFQRV YDVAERVLPD WDDARDLPPR AAVVPRLVGN TCRALGIVRA DWIADYYRLP KRSYRDELHA LADAGELLPV AVDGWHADAF VHRELAPLVD AARDGALRPT VTTLLSPFDP VVWDRRRASA LFDFDYTIEC YTPAHKRRYG YFCLPILHRG RLVGRVDAKA HRTQRVFELK AVHIEPGVRL GAGLAADVGR AIRKLADWHE TPVVEAGHAP KEIARAIGAD RVAKPR
|
| |