Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_A2007 |
Symbol | |
ID | 4889792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009079 |
Strand | - |
Start bp | 1934899 |
End bp | 1935969 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640148271 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001079183 |
Protein GI | 126445740 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.630648 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGA TCCGCTCGGC GGTGCCGCCG CCGCCGCTGC CCGAAGTCGT CGAGGGGCAG CGTTACGCGA CGCCGCAGCG CGGCGTCACG CTGGCCGACA CGATGTTCGT CGATTGCCAC TTCGAGCGCG TCGAATGGAC CGGCTGCCGG CTCGCGAACC TGCGCTTCGT GAACTGCACG TTCGACGCGA ACCGCTTCGA TCGCTGCGAG CTCGAGAAGC TCTCGTACGA TTCGAGCCGG ATTCGCGCGG GCGCGTGGAC GCAGAGCGCG CTGCAGCGCG TGTCGTTCAA CGAATGCGAG CTCGACGGCG GCACGTGGAC GGGCAGCCTG GTGAAAGACG TCGTATGCAC GCAGTCGAAG GGCGGCGCGT GGACGTTCGA CGCGGTGCGC GGCGCGCACG TGTCGCTCGT CGCGGGCGAT TACGCGGGCG TCACGCTGCG CGGCGGCCGC TGGAGCGATA CGTCGTGGAT CGGCAGCCGG CTCGCCGACC TGCGGCTCGA ATCGGTCGAG CTCGAGAACC TGATCGCCGG GCAAAGCGGC TTCGAGCGCG TGGTGCTCGT CGAGTGCCGC GGCGTGAACG TGCGCTGGAT CGATTCGCGG ATCGAGCGGA TGACCGTGCA CGGCTGCGAG CTGAAGCAGG CGGCGTGGTC GCACAGCACA TGGGCGACGG GCGAGATTCA CGCGAGCCGG CTGCCGATCG CGAGCTTCGA TCATGCGAGC GTCAACGGGC TGACGGTGAC GAACAGCGAA TTGCCGCAGG CGATCTTCGA TAGCGCGAGC GTCGCGGACA GCGCGCTGCA AGGCGTGCGC GCGCCGCGCA TCGCATTGCG CGACGCATGG CTCACGCGCG TGAACCTGTC GGGCGCGCAG ATGCAGCAGC TCGATGCGCG CGGCGTGCAT CTGGAGCGCG TCGACCTGCG CGGCGCCGAT TGCCGCGGCG GCAACCTGAT CGGCCAACTG AGCCACACGT GGGCGGCGGC CGATACGCGG GACGCGATTT TCGAAGAAGC CACGAGCGCC GACGACCGGC TCTGGTGGCA GCGAGTTCAA CCCGGAGCAA GAGGAGTTTG A
|
Protein sequence | MSKIRSAVPP PPLPEVVEGQ RYATPQRGVT LADTMFVDCH FERVEWTGCR LANLRFVNCT FDANRFDRCE LEKLSYDSSR IRAGAWTQSA LQRVSFNECE LDGGTWTGSL VKDVVCTQSK GGAWTFDAVR GAHVSLVAGD YAGVTLRGGR WSDTSWIGSR LADLRLESVE LENLIAGQSG FERVVLVECR GVNVRWIDSR IERMTVHGCE LKQAAWSHST WATGEIHASR LPIASFDHAS VNGLTVTNSE LPQAIFDSAS VADSALQGVR APRIALRDAW LTRVNLSGAQ MQQLDARGVH LERVDLRGAD CRGGNLIGQL SHTWAAADTR DAIFEEATSA DDRLWWQRVQ PGARGV
|
| |