Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A0550 |
Symbol | |
ID | 3692610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | + |
Start bp | 744424 |
End bp | 746364 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637730804 |
Product | hypothetical protein |
Protein accession | YP_335709 |
Protein GI | 76819789 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.210469 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCGATG CGGCGCGGCA GCGGCGCCGT CGTCGGGCGG TCGAGTCATC GAGTCATCGA GTCATCGAGT CATCGAGTCA TCGAGTCATC GAGTCATCGA TCTGTCGAAT CCATTTACGT TCACTTTCCA TTGATGTTCA CCTTCTGTCT CGCAATGATG AATCACGCTT TGAATCACGC TTCGGATCAA GCCGATCGTC CCAACGCGCC GCCTTCGACA CGAAGCGCCG CCGGCTCCGA CGCACCGTCT TCGACTCGTG CTCTCGACGA CGCCGGTGCG CTGCCGCCGA CTCGCACGGC ACGCCGTTCC GATACGCCGC CATCAGCTCG ACACGCACCG CCCCACTCGC CCCCCAGTTC ACTCGGGGAC TTCCGCGACG GCGCGCACCG CTCGGCATCC GGCGATTTTC CGGGCGCAAG GAGCGACGCC CGAACGCACA CCGCGCAAAT CGAGCGGTTC GTGAAAGCGC CCTCGCCGAA GCGCGCCCGA GAATGAACGC TCGACGTCCG GCGTTCGGCC TGATTGCGTC GCACGCGTCA CGCCGCCGGG CCGTCGAATC CGTGCGCGCC CACTTTCCGT TCATGTTCAC CCTCTGTCTT GCGATGAAAC ACACTTCGAA CCACTCCGAC CCTTCCGGTG CGCCGTCGTC AGCGCGACGC GCCGCATCCG ATTCGTCCCG CCTCGTCCGC GGCATGCGCT CGCCCCAACG CGGCGGCGCG GCACTGGCGC TGGCCGTCGC CTCGCTCGCC GGATGCGGGG GCGGCGATTC GGGCGAACCC GCTCCACGCG AATTCGCGCC CCCGACAGTG CAACTCGCCT ACCCGACGCA ACCGAACTCG CCCGTCGCAC CAGCGCCCAC CGCGCACGTA TCGAGCGGAC ACACGCCTCC GGCCACGGCG CCCGCCGCGA TGCCCACCGC TTCCCCCACC GCGACGCCCA CCGCTTCACC CACCGCGACG CCTACCGCTT CACCCACCGC GACGGCTGCC GCTTCGCCCT CCGCGCCATC CACTGCGACG CCCAACGCCC CGCCCGCCGC TCCGCCGGCC GTCGTCGCCA CCCGCGTCCC GCCGACGCAC GCGGCGTTGC GCCGCCCGAC GATCGAACTC GAATTCGATC GCGCGATCGA GCCGGGTTCA GTCCCACACA TCGTGCTGCG CGCCGACGAT GGCACGAGCG TCGCCGTCGG CCCGTTGTCG TGGCTGAGCG ATCGCCGGAT CGCGTTCGCG CCGCGCAAAC CGCTCAAGTC GAACAGCCGC TACGAAATCA TGGTGCCCGC CGGCATCAGG AGCACCACGG GCGAACGGTC GGCCCATCCG CTAACGAGCA GCTTCGATAC CGCGCCCGTT ACGCCGCCGC GCGGCCTGCC CAATCTCGAC GGCGCCTCGT GCTTCATCAA CACGGCGCTG CAATTGGCGG TTCACTCGTC GGCGCTCGAC GACATTCTGT CGAACGAAGC CGTCCCGCCC GCCGTCCGCA CGCTGCTCGA AGACTACGAC GCCGCATCGG CTGACGCACT CGACGCGCAG TTGGCCGCCG CGGTCGCCGC GCTGCGCGCC ACGCCGGAGG TCCCGGACAG CGGGCCGGGA CAAACGCTGG AAGTGATGCA AGCGTTGCGG ATGCCGTTAT ATGACACGAG CAGCGCGAAC AACGCAAAGA ACAACGCCGA CGCCATACGT CATGCGCCGC CCAACACCAA GGCGTTCTTT CTGAACTCCT ATCCACCGCT TTCCTACGCG GATCTGCCGA ACCACGACCG GCTCGTCGCG TTCGACTACA GCACGGGCGG TCACTATGTC GCTTATGTGA AGCGGGATGG AATCTGGTAT CGAATCGACG ATGCCCAGGT CAGCGCCGTC AACGAACAGG ACTTGCTTGC CCTGCCGGCG TTCAACCCCG CTAACGGCAG CGTGTCGATC GAAATCGCGA TCTATCGATG A
|
Protein sequence | MLDAARQRRR RRAVESSSHR VIESSSHRVI ESSICRIHLR SLSIDVHLLS RNDESRFESR FGSSRSSQRA AFDTKRRRLR RTVFDSCSRR RRCAAADSHG TPFRYAAISS TRTAPLAPQF TRGLPRRRAP LGIRRFSGRK ERRPNAHRAN RAVRESALAE ARPRMNARRP AFGLIASHAS RRRAVESVRA HFPFMFTLCL AMKHTSNHSD PSGAPSSARR AASDSSRLVR GMRSPQRGGA ALALAVASLA GCGGGDSGEP APREFAPPTV QLAYPTQPNS PVAPAPTAHV SSGHTPPATA PAAMPTASPT ATPTASPTAT PTASPTATAA ASPSAPSTAT PNAPPAAPPA VVATRVPPTH AALRRPTIEL EFDRAIEPGS VPHIVLRADD GTSVAVGPLS WLSDRRIAFA PRKPLKSNSR YEIMVPAGIR STTGERSAHP LTSSFDTAPV TPPRGLPNLD GASCFINTAL QLAVHSSALD DILSNEAVPP AVRTLLEDYD AASADALDAQ LAAAVAALRA TPEVPDSGPG QTLEVMQALR MPLYDTSSAN NAKNNADAIR HAPPNTKAFF LNSYPPLSYA DLPNHDRLVA FDYSTGGHYV AYVKRDGIWY RIDDAQVSAV NEQDLLALPA FNPANGSVSI EIAIYR
|
| |