Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2142 |
Symbol | |
ID | 4888086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2082514 |
End bp | 2083911 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640132078 |
Product | hypothetical protein |
Protein accession | YP_001063135 |
Protein GI | 126443193 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.264049 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCTC GACGTCCGGC GTTCGGCCTG ATTGCGTCGC ACGCGTCACG CCGCCGGGCC GTCGAATCCG TGCGCGCCCA CTTTCCGTTC ATGTTCACCC TCTGTCTTGC GATGAAACCC ACTTCGAACC ACTCCGACCC TTCCGGTGCG CCGTCGTCAG CGCGGCGCGC CGCATCCGAT TCGTCCCGCC TCGTCCGCGG CATGCGCTCG CCCCAACGCG GCGGCGCGGC ACTGGCGCTG GCCGTCGCCT CGCTCGCCGG ATGCGGGGGC GGCGATTCGG GCGAACCCGC TCCACGCGAA TTCGCGCCCC CGACAGTGCA ACTCGCCTAC CCGACGCAAC CGAACTCGCC CGTCGCACCA GCGCCCACCG CGCACGTATC GAGCGGACAC ACGCCTCCGG CCACGGCGCC CGCCGCGATG CCCACCGCTT CACCCACCGC GACGGCTGCC GCTTCGCCCT CCGCGCCATC CACTGCGACG CCCAACGCCC CGCCCGCCGC TCCGCTGGCC GTCGTCGCCA CCCGCGTCCC GCCGACGCAC GCGGCGTTGC GCCGCCCGAC GATCGAACTC GAATTCGATC GCGCGATCGA GCCGGGTTCA GTCCCACACA TCGTGCTGCG CGCCGACGAT GGCACGAGCG TCGCCGTCGG CCCGTTGTCG TGGCTGAGCG ATCGCCGGAT CGCGTTCGCG CCGCGCAAGC CGCTCAAGTC GAACAGCCGC TACGAAATCA TGGTGCCCGC CGGCATCAGG AGCACCACGG GCGAACGGTC GGCCCATCCG CTAACGAGCG GCTTCGATAC CGCGCCCGTC ACGCCGCCGC GCGGCCTGCC CAATCTCGAC GGCGCCTCGT GCTTCATCAA CACGGCGCTG CAATTGGCGG TTCACTCGTC GGCGCTCGAC GACATTCTGT CGAACGAAGC CGTCCCGCCC GCCGTCCGCA CGCTGCTCGA AGACTACGAC GCCGCATCGG CTGACGCGCT CGACGCGCAG TTGGCCGCCG CGGTCGCCGC GCTGCGCGCC ATGCCGGAGG TCACGGACAG TGGGGCGGGA CGAACGCTGG AAGTGATGCA CGCGTTGCGG ATGCCGCTAT ACGACGCGAG CAGCGCGAAC AACGCAACGA ACAACGCCGA CGCCATACGT CATGCGCCGC CCAACACCAA GGCGTTCTTT CTGAACTCCT ATCCACCGCT TTCCTACGCG GATCTGCCGA ACCACGACCG GCTCGTCGCG TTCGACTACA GCACGGGCGG TCACTATGTC GCTTATGTGA AGCGGGATGG AATCTGGTAT CGAATCGACG ATGCCCAGGT CAGCGCCGTC AACGAACAGG ACTTGCTTGC CCTGCCGGCG TTCAACCCCG ACGGCAGCGT GTCGATCGAA ATCGCGATCT ATCGATGA
|
Protein sequence | MNARRPAFGL IASHASRRRA VESVRAHFPF MFTLCLAMKP TSNHSDPSGA PSSARRAASD SSRLVRGMRS PQRGGAALAL AVASLAGCGG GDSGEPAPRE FAPPTVQLAY PTQPNSPVAP APTAHVSSGH TPPATAPAAM PTASPTATAA ASPSAPSTAT PNAPPAAPLA VVATRVPPTH AALRRPTIEL EFDRAIEPGS VPHIVLRADD GTSVAVGPLS WLSDRRIAFA PRKPLKSNSR YEIMVPAGIR STTGERSAHP LTSGFDTAPV TPPRGLPNLD GASCFINTAL QLAVHSSALD DILSNEAVPP AVRTLLEDYD AASADALDAQ LAAAVAALRA MPEVTDSGAG RTLEVMHALR MPLYDASSAN NATNNADAIR HAPPNTKAFF LNSYPPLSYA DLPNHDRLVA FDYSTGGHYV AYVKRDGIWY RIDDAQVSAV NEQDLLALPA FNPDGSVSIE IAIYR
|
| |