Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A1957 |
Symbol | |
ID | 3692708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | - |
Start bp | 2390813 |
End bp | 2393743 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637732211 |
Product | hypothetical protein |
Protein accession | YP_337108 |
Protein GI | 76819107 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.52891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCGCAT GCGCCCGTCC TTTCGACGCC GCCGCGCACG TCGAAGAAAC GATCGCCGCG CCGACGCATC TCGGCGACTG GCTCGCCGCC CATCAGTCGA CGCCCGCCGT CGGAACGCCG CCGTCCGGCG CGCCTTCGCC GTACCTCGGC GGCCTGAGCT GGCGCTCGAA CCGCGAAGTC GCCGCACAGC AGGCGAGCAA GCGCCGCCTG CTCGCCGGCA TCGACGCGCT GCCCGCGCTC ACGCCGGCCG CGCAGGCGGC GCGGGCACGC CTCTCGGCGA TGATCGCCGC GCGTGCCGCA ACCGGCCGCG TGATCGTCGC GCGAAGCGAT GCGCGCTGGC TGCAGGCCAA TCCCGCCCAC GATCCCTATC TCGAAGCCGG CGACGTCGTG ACGATTCCCG ATCGCCCGTC GAGCGTCGCC GTCGTGCGCG CGGACGGCTC GATCTGCACG GTCGCTCACG TGCAGGACGT CGAAGCATTG CCGTACGTGC TCGCGTGCGC CCCCGACGCG GCGCCCGATC TCGCGTGGAT CGCGCAACCC GACGGCACGG TCAGCGAAAG CAAGGTGGCG ATGTGGAATC GCGACGTGCA GGACGCGCCG GCGCCCGGCA GTTGGATCTG GGCGCCCGAT CGGGGCAGCC GATGGCCGCC GGCCCTGTCG CGCGCCCTGG CGGAATTCAT GGCGACGCAG GGCGTATCCG GGCTCGCGGA CGACGGCTCG CCGCTGCCCG CGCCTCCCAT TGCGCCCGTC CACCAGACCG CGTTTCCGAG CGGCGCGCCC GGCCGGTCCG CAGCGTTCCC GGTAACGGGC GGCGACTGGG GCACGGCGGG CATTCTGCAA ACGCCGACGG CGCGAATGAA CGACGCCGGC GAAGCATCGC TCAGCATGAG CCACGTGAGC CCGTACACGC GCCTGAACTT CACGCTGCAG CCGCTCGATT GGCTCGAAAT CGGGTTCCGC TACACCGACG TCAGCAATCA GCCGTACGGC CCCGTCTCGC TGAGCGGCAC CCAGTCGTAC AAGGACAAGA GCATCGACGC GAAGCTCAGG CTGTGGCGCG AATCCGCCTA TCTGCCCGAC GTGGCCGTCG GCTTTCGCGA CATCGCCGGC TCGGGCCTGT TCTCCGGCGA GTACCTGGTG GCCAGCAAGC GAACCGGGCC GTTCGACTGG AGCGTCGGCC TCGGCTGGGG TTACGTGGGC GCGCGCGGCA ATCTGCGCAA CCCGCTGGCG GTGGTCAGCC GGCGGTTCGA CGATCGCACG AACAGCGCGA CACCGAACGG CGGCGAGCTC GGCTACAGCT CATGGTTTCG CGGCCGCGTC TCGCCGTTCG GCGGCGTGCA GTACCAGACG CCGCATGAGC GCCTCATCCT GAAAGCCGAA TACGACGGCA ACGACTATCG GCACGAACCG TTCGGTCAAG TGCTGAAGGC ACGATCGCCA TTCAACTTCG GCGCCGTCTA TCGCGCGACG CGCAACATCG ACTTGAGCCT CGGCTTCGAG CGAGGCGCGC GCGTGATGTT CGGCGTCTCG CTGCACGGCA ATCTGAAGCG TGCGTCGATG CCCAAGCTCG GCAATCCGCC GGCTCCGCCG GTGACGCAAC CGGCCGCGAA CGCCGGGCCG CCCCCGCCGG CCGCCGATCC GGCATCGGGC GACGCGCAAG CGGCGACCGC GCAGGCATCG CGCATCGGAC GCGCGTCGCC GTCGCCGTTC GATCGCGACT GGTCCGGCAC CGTCGCGCAA TTGCAGGCGC AAACGCATTG GCACGTGCGC AGCATCCGTG CGCTCGGCAT GGATCTCGTC GTCGAGTTCG ACGACGTCGA CGCGTTCTAC CTGCAGGACC CGCTCGAGCG CATCGCGACG ATCCTGAACC GTGACGCGCC GCTCAACGTG CGCACGTTCC ATGTCGTCGC GCTCGTGCAC GGCGTGCCGG TTGCCGACTA TCAGGTGCAG CGCACGCAGT GGTTCGCGAG CCGCACCCGC GCCCTCACGC CGAGCGAGGC TGCGCCCGAC ACGGCGCTCG GCCGGCCGCT CACGCGACAG TCGATCGACA TGCTGCCCTC TCTATTCGAG CAGCGGCCCA AGGCCTTCGT GGCGTCGGTC GGGCCGGGCT ACCGGCAAAC CCTTGGCGGT CCGAACGGTT TCCTGCTCTA CCAGATCTCC GCCGATGCAT ACGGCGAGCT GAGACTGCCC GGCGGCGCAT GGCTCGGCGG CGAACTGAAC GTGGGGCTCG TCGACAACTA CGGCAAGTTC ACCTACACGG CGGACAGCAA GCTGCCGCGC GTGCGCACGT ATCTGCGCGA GTACCTGACG ACGTCGCGCG TCACGCTGCC GCTGCTGCAA CTGACGAAGA TGGGACGCCT CGGCAACGAT CAGTTCTACA GCGTATACGG CGGGCTGCTC GAAAGCATGT TCGCGGGCGT CGGGGCCGAA TGGCTGTATC GCCCGGCGGA TAGCCGCCTC GCGATCGGCG TCGACGTGAA CGCGGTGCGG CAGCGCGGCT TCCGCCAGGA TTTCTCGATG CGCGACTACC GGACGCTCAC CGGACACGTG ACGGCGTATT GGAACACCGG GTGGCAGGGC ATCCAAATCA ATCTGAGCGT CGGCCAGTAT CTGGCGAAGG ACAAGGGCGC GACGCTCGAC ATTTCGCGGC GCTTTCGCAA CGGCGTCGTG ATCGGCGCCT ATGCGACGAA GACGAACATA TCGGCGGCCC AATTCGGCGA AGGCAGCTTC GACAAGGGCA TCTACCTGAC GATTCCGTTC GACGCGATGA TGACGCGCTC GAGCGGCAGC GTGGCGAATC TGCGCTGGAA CCCCGTGACG CGCGACGGCG GCGCGAAGCT GGATCGCAAA TATCCGCTGT ACGATCTCAC CGACATGGGC GAGCGCCGCA GCTTGTGGTA CGCGCCGCCG GATGGCGCAT TGTCGCCGTG A
|
Protein sequence | MLACARPFDA AAHVEETIAA PTHLGDWLAA HQSTPAVGTP PSGAPSPYLG GLSWRSNREV AAQQASKRRL LAGIDALPAL TPAAQAARAR LSAMIAARAA TGRVIVARSD ARWLQANPAH DPYLEAGDVV TIPDRPSSVA VVRADGSICT VAHVQDVEAL PYVLACAPDA APDLAWIAQP DGTVSESKVA MWNRDVQDAP APGSWIWAPD RGSRWPPALS RALAEFMATQ GVSGLADDGS PLPAPPIAPV HQTAFPSGAP GRSAAFPVTG GDWGTAGILQ TPTARMNDAG EASLSMSHVS PYTRLNFTLQ PLDWLEIGFR YTDVSNQPYG PVSLSGTQSY KDKSIDAKLR LWRESAYLPD VAVGFRDIAG SGLFSGEYLV ASKRTGPFDW SVGLGWGYVG ARGNLRNPLA VVSRRFDDRT NSATPNGGEL GYSSWFRGRV SPFGGVQYQT PHERLILKAE YDGNDYRHEP FGQVLKARSP FNFGAVYRAT RNIDLSLGFE RGARVMFGVS LHGNLKRASM PKLGNPPAPP VTQPAANAGP PPPAADPASG DAQAATAQAS RIGRASPSPF DRDWSGTVAQ LQAQTHWHVR SIRALGMDLV VEFDDVDAFY LQDPLERIAT ILNRDAPLNV RTFHVVALVH GVPVADYQVQ RTQWFASRTR ALTPSEAAPD TALGRPLTRQ SIDMLPSLFE QRPKAFVASV GPGYRQTLGG PNGFLLYQIS ADAYGELRLP GGAWLGGELN VGLVDNYGKF TYTADSKLPR VRTYLREYLT TSRVTLPLLQ LTKMGRLGND QFYSVYGGLL ESMFAGVGAE WLYRPADSRL AIGVDVNAVR QRGFRQDFSM RDYRTLTGHV TAYWNTGWQG IQINLSVGQY LAKDKGATLD ISRRFRNGVV IGAYATKTNI SAAQFGEGSF DKGIYLTIPF DAMMTRSSGS VANLRWNPVT RDGGAKLDRK YPLYDLTDMG ERRSLWYAPP DGALSP
|
| |