Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_A2823 |
Symbol | |
ID | 4791036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008836 |
Strand | - |
Start bp | 2849863 |
End bp | 2851083 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_001028772 |
Protein GI | 124386264 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0821891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTCGT CCGCCCCGCT TCCCCCGCCC GTTCCCCCGC CCGCCGACAC GGCGTCGCCC CTGCAAGCGG CAAGCCCGGT CGCGTGCGAC ATCTTCTGTG CGGTCGTCGA CAACTTCGGC GACATCGGCG TGTGCTGGCG TCTCGCGCGC CAGCTCGCGC TCGAGCACGG CTGGCAGGTG CGGATCTTCG TCGACGCGCT CGCGACGTTC GCGCGCCTGC AGCCGGCCGC GTTGCCCGAC GCCGCGCAGC AGACCGTCGA CGGCATCGTC GTCGAGCACT GGCGCGCGCC CGCGCACGCG GGCGACACGC TCGAGATCGC CGACATCGTG ATCGAGGCGT TCGCCTGCGA GCTGCCGGGC GCGTATGTCG CCGCGATGGC GCGCCGCGCG CGGCCGCCCG TCTGGATCAA CCTCGAATAC CTGAGCGCCG AGGACTGGGT CGGCGAATTC CATCTGCGCC CGTCGCCGCA TCCGCGCTAC CCGCTCACGA AGACGTTCTT CTTCCCTGGC CTCGGGCCCG GCACGGGCGG CGTGCTGAAG GAGCGCGATC TCGACGCGCG CCGCGCCGCG TTCGAAACCG GCGACGATGC GCGCCGCACG TGGTGGCAAA ACGTCGCGGG CGCGCCGATA CCCGCTCCGG ACACCACCGT CGTGTCGCTC TTCGCGTACG AGAATCCGGC GCTCGACGCG CTGCTCGAAC AGTGGCGCGA CGGCCGCGAG CCGGTCGCGC TGCTCGTGCC CGAAGGCAGG ATCTCGGCGC GCGTCGCGCG CTTCTTCGGG GCCGGCGCGT TCGGCGCCGG CGCGCACGCG GCGCGCGGCA GCCTCGTCGC ACACGGTCTC GCCTTCGTCG CGCAGCCCGA CTACGACCGG CTGCTGTGGG CGAGCGACGT GAACTTCGTG CGCGGCGAGG ATTCGTTCGT CCGCGCGCAA TGGGCGCGCC GGCCGTTCGT CTGGCAGATC TATCCGCAGG CCGACGACGC GCATCTGCCG AAGCTCGACG CGGCGCTCGC GCACGTCACC GCACGTGTCG ATCACGCGAC GCGCGCGGCG ACCGAGCGCT TCTGGCACGC CTGGAACGGC GCGGGCACGC CCGATTGGAC CGATTTCTGG CGGCACCGCG CGGCGCTCGC CGCGCGCGCC GCGAGTTGGG CGGACGAGCT CGCGGCCGTC GGCGACCTCG CCGGAAATCT GGCGAATTTT GCAAAAACTC AGTTAAAATA A
|
Protein sequence | MTSSAPLPPP VPPPADTASP LQAASPVACD IFCAVVDNFG DIGVCWRLAR QLALEHGWQV RIFVDALATF ARLQPAALPD AAQQTVDGIV VEHWRAPAHA GDTLEIADIV IEAFACELPG AYVAAMARRA RPPVWINLEY LSAEDWVGEF HLRPSPHPRY PLTKTFFFPG LGPGTGGVLK ERDLDARRAA FETGDDARRT WWQNVAGAPI PAPDTTVVSL FAYENPALDA LLEQWRDGRE PVALLVPEGR ISARVARFFG AGAFGAGAHA ARGSLVAHGL AFVAQPDYDR LLWASDVNFV RGEDSFVRAQ WARRPFVWQI YPQADDAHLP KLDAALAHVT ARVDHATRAA TERFWHAWNG AGTPDWTDFW RHRAALAARA ASWADELAAV GDLAGNLANF AKTQLK
|
| |