Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1891 |
Symbol | |
ID | 3845344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 2290614 |
End bp | 2291684 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637839192 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_440085 |
Protein GI | 83716702 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGA TCCGCTCGGC GGTGCCGCCG CCGCCGCTGC CCGAGATCGT CGAGGGGCAG CGCTACGTGT CTGCGCAGCG CGACGTCGCG CTCGCCGACA CGCTGTTCGT CGATTGCCAC TTCGAGCGCG TCGAATGGAC CGGCTGCCGG CTGTCGAACC TGCGCTTCGT GAACTGCACG TTCGATGCGA ACCGCTTCGA TCGATGCGAG CTCGAGAAGC TCTCGTACGA ATCGAGCCGG GTCCGCGAGG GCGCGTGGAC GCAAAGCGCG TTGCAGCGCG TGTCGTTCAA CGAGTGCGAG ATCGACGGGG GCGCGTGGGC GGGCTGCCTG CTGAAGGACG TCGTGTGCTC GCAGTCGAAG GGCGGCGCCT GGACGTTCGA CGCCGTGCGC GGCGCGCACG TGTCGCTCGT CGCGGGCGAA TACGCGGGCG TCACGCTGCG CGGCGGCCAC TGGAGCGACA CGTCGTGGAT CGGCAGCCGG CTCGTCGATC TGCGGCTCGA ATCGGTCGGG CTCGAGAATC TGATCGCCGG GCAAAGCGGC TTCGAGCGCG CGGTGCTCGT CGAATGCCGC GGGATCAACG TACGCTGGAT CGATTCCCGG ATCGAGCGGA TGACCGTTCA AGGCTGCGAG CTGAAGCAGG CTGCCTGGTC GCACAGCACA TGGGCGACGG GCGAGATCCA CGCGAGCCGG CTGCCGATCG CGAGCTTCGA TCACGCGAGC GTCAACGGCC TGACGGTGAC GAACAGCGAA TTGCCGCAGG CGATCTTCGA CAGCGCGAGC GTGGCGGACA GCGCGCTGCA AGGCGTGCGC GCGCCGCGCA TCGCGTTGCG CGACGCATGG CTCACGCGCG TGAACCTCGC GGGCGCGCAA TTGCAGCAGC TCGACGCGCG CGGCGTGCGT CTGGAGCGCG TCGACTTGCG CGGCGCCGAT TGCCGCAGCG GCAATCTGGT CGGCCAGCTT CGCCAGACGT GGGCGGCGGC CGATACGCGG GACGCGGTTT TCGAGGAAGC CACGAGTGCC GACGATCGGC TCTGGTGGCA GCGAGTGCAA CCCGGAGCAA GAGGAGTTTG A
|
Protein sequence | MSKIRSAVPP PPLPEIVEGQ RYVSAQRDVA LADTLFVDCH FERVEWTGCR LSNLRFVNCT FDANRFDRCE LEKLSYESSR VREGAWTQSA LQRVSFNECE IDGGAWAGCL LKDVVCSQSK GGAWTFDAVR GAHVSLVAGE YAGVTLRGGH WSDTSWIGSR LVDLRLESVG LENLIAGQSG FERAVLVECR GINVRWIDSR IERMTVQGCE LKQAAWSHST WATGEIHASR LPIASFDHAS VNGLTVTNSE LPQAIFDSAS VADSALQGVR APRIALRDAW LTRVNLAGAQ LQQLDARGVR LERVDLRGAD CRSGNLVGQL RQTWAAADTR DAVFEEATSA DDRLWWQRVQ PGARGV
|
| |