Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2148 |
Symbol | |
ID | 4886833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2085732 |
End bp | 2086787 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640132085 |
Product | AraC-type DNA-binding domain-containing proteins |
Protein accession | YP_001063142 |
Protein GI | 126443771 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAAGA TGCACGCGCC CCGCGGCGGC ACGCTGCTGC AATTCTTTTC GACCGACGAC ATGCCGCTCG CGCGCGCGGC GGCGTTCTGG AGCGCGCACG TGTTCCAGTG CGAGGACGTG CGCGCGTCGT CCGCGCGCGC GTTTCACGGG CACGGCTTTC TGTGCCGCTG CGAGCGCGGC CGGTTCGTTC GCTTTCGCGG CGCGTCGCTC GACACGCGCA TCGGCGCGGC GTGGCTGAGC GCCGCGCCAG CCGATGCGTA CGTGACGATC TGCGCGCTGC ATGCGGGCGA GTGCACGGTC GAAGCGCCCG GCTTGCCGGA TGTGCGCTTT CGTGCGAACG AGCTGTTCAT GCTGGACGGC GGGCAGCCGA TGCGCGTGCG CTGGAGCGAG CCGTGTTTCA GCGCGCTCAG GCTGCCGCGC GCATCGGTGG GGCGCACGCT CGGCCAGGCG GCGATGGACG CGTCGCCGGG CGCGGCTTCG TTGCAGGAGG CGCGGCTCGC GCCGTTTCTC GCGGCGGAGC TCGCGCTGAT CGGCGGTCGC GGCCCGACGC TGTCGTCGGA CGAGCTCGAT TACATGCTCG CGCGCGCAGC CGAGCTCGGC CGCACGCTGC TTCAGGCGGC GCTGTCGTCG CGCGCGCGGC GCGGCGCGCC CGCGCGCGCC GACCGGCTGC AGGCCGCGTA TCGCTACATC GAGCAGCATC TTCATCTGCC GACGCTCACA CCCGAGCGGA TCGCCGACGC GATCCATTGC TCGCGCACGC AGCTCTATCG GCTGTTCCGC CATGAATCGC AGACGGTGAA GGCGGCGCTG CGCGACGCGC GGCTGAACCG CAGCCTCGGC TATCTCGAGC AGCCCGAGGT TACGCTCAGC ATCGGCGAGA TCGCGCATGC TTGCGGTTTT CCCGATCAGT CGACGTTCGG CAAGCTGTTT CGCCGGCGCT TCGGCAGAAC GCCCGGCGAG GTGCGCCGCG CCGCGCGGGG GCGTTGCAAT GAAACCGTGT TGCCCGACTG CGCGGAAAGC GGCGACGCGG CGGATGTGCA AACGCCGCGG CGGTGA
|
Protein sequence | MAKMHAPRGG TLLQFFSTDD MPLARAAAFW SAHVFQCEDV RASSARAFHG HGFLCRCERG RFVRFRGASL DTRIGAAWLS AAPADAYVTI CALHAGECTV EAPGLPDVRF RANELFMLDG GQPMRVRWSE PCFSALRLPR ASVGRTLGQA AMDASPGAAS LQEARLAPFL AAELALIGGR GPTLSSDELD YMLARAAELG RTLLQAALSS RARRGAPARA DRLQAAYRYI EQHLHLPTLT PERIADAIHC SRTQLYRLFR HESQTVKAAL RDARLNRSLG YLEQPEVTLS IGEIAHACGF PDQSTFGKLF RRRFGRTPGE VRRAARGRCN ETVLPDCAES GDAADVQTPR R
|
| |