Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_A3120 |
Symbol | |
ID | 4793933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008836 |
Strand | + |
Start bp | 3156840 |
End bp | 3159410 |
Gene Length | 2571 bp |
Protein Length | 856 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_001029061 |
Protein GI | 124384467 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACATGC GCCGCTCGGG CGGCGCGCAA GGCGGAGGCG CGATGCGGGC GAAGTGCGGC GCGTTCGCGC TCGGCGTGGT CGCGCTGCAG CAGCAGGCGG CGTTGCCGGG CGCGGCGGCA TGGGCGGGCG GCGCGCTCGC GTTCGGCTTG TGCGTGTGGC TCGCGCTCGT GTGGCGCGGC GATGCGCGCG CGCGAGCGGC CGGGTTCTGC GCATGCTGCT GCGCGGCGGC GCTCGCGGGC TTCGGCTACG CCGCGGCGCG CGCGCAGTGG CGGCTCGCCG ATACGCTGCC CGCGCAGTGG GAAGGGCGCG ACATCGTCGT GACGGGCGCC GTGCGCGGGC TGCCGTCGCG CGACGCGAAC GGCGTGCGTT TCCTGTTCGA CGTCGACGCA AACGATGCGC GCATTGCGCG TTTTCCGGCG ACGCTGTCGC TTGGCTGGTA CGCGTTCGGC CGCCCGGGCG CGCAGCCGCC CGAACTCGTG CCGGGCGACC GGTGGCGGCT GCGCGTGCGT CTGAAACGCC CGCACGGCAA TGCGAATTTC GGCGTGCGCG ACGCGGAGGC GGCGTGGCTC GCGCGCGGCA TCCGGGCGCT CGGCTACGTG TCGGCGCCGC GCGATGCGCG GCGGCTCGCG GGGCGTGCAT CCGGCGTCGC GGCCGCGCTC GACCGGCTGC GCGCGCGGCT GCGCGGGCGC ATCGCCGAGG CGCTCGGCGA CGCCGCGCAT CGCGGGATCG TCGTCGCGCT CGCGATCGGC GCGCAGGACG ATATCGCCGA CGACGACCGG CGCATCCTGC GCGATACCGG CACGAGCCAT CTCGTCGCGA TTTCCGGGCT GCACGTCGGG ATGGTCGGGG GCCTGTGCGC GTGGCTCGCG GGCGGGCTCT GGCGGCGCTC GGGCTACGTC GGACGCGACT GGCCGCTCGT CGTGCCCGCG CAGAAAGTCG CGGCGCTCGG CGCGATCGTC GGCGGCGCCG GCTATGCGGC GCTCGCGGGC TTCAACGTGC CCGTGCAGCG CGCGTGGTGG ATGCTTGCCG CCGCGGGCGT CGCGTATCTG AGCGCTCGCT CGCTCGCGCC TTCGTCGGTG CTGGCGGCCG CGCTCGGCTG CGTGCTGCTC GTCGATCCGT GGGCGGTGAC GTCGGCGGGG TTCTGGCTGT CGTTCTGCGC GGTCGCGGCG ATCCTGGCCG TATCGTCGGG GTGGCGCGCC CCGCGCGATC TCGACGAGAC GCGCGGCGCG CGCCACGCGC TCGGCGGCGT GGGTCACGAA GGCGCGCCGC CGTCGCATCG CGCGCGGTGG CGCGCCGCGT GCGGATTCGC GTGGGGGTGG GCGGCGCGCG CGTTCGGCCG GCTCGCCCGG CGCGTGCGCG ATGCCGCGCG GGCGCAGTTC GCGGTGACGA TCGCGCTCGC GCCGCTCACC GTGCTCTGGT TCGCGCAGAT CCCGCTCGCC GGCCCGCTCG CGAACGCGTT CGCGATTCCG TGGGTCGGCT CGCTCGTCAC GCCGATCGTG CTCGCGGGCA TCGTGTTGCC GGCGCCGCTC GACGCGTCCG CATACGCGCT CGCGCATACA CTCGTTCAGG CGCTGATGCG GCTGCTCGAC GCGACGGCCG GCGCGGGGCG CACGGTCTGG ATGCTGCCGG CGCCGGACGG CTTCGCGCTC GCCGCGGCGG CGGTGGGCGT CGCGTGGGCG TTGATGCCGC GCGGCTGGCC GGTGCGCTGC GCGGCGCCGC TCGCGTGGCT GCCGCTCGTC GCGCCGGCGC CGCTTGCGCC GCCCGACGGC GCGTTCAGGC TGACGGCGCT CGACGTCGGG CAGGGCTCCG CGGTGCTGAT CGAGACCGCG CGGCACACGC TGCTGTTCGA CGCGGGCCCG GGGCCGGAGG CGTCGAACGC GGGCGAGCGG ATCGTCGTGC CGTTCTTGCG CGCGCGAGGC GTGCGCGCGC TCGACGCGCT CGTCGTGAGC CACGCGGATT CGGATCACGC GGGCGGCGCA CCCGCAGTAC TCGGATCGAT CGCCGTCGCG CAGATGGCGG GCGGGCTGCC GCCGTCGAAC CGCCTGTGGC GCGCCGCGCG CGCGGCCGGC GTGGCCGACG CGCTGCCGTG CGTGGCGGGG CAGCGCTGGC GCTGGGACGG CGTCGAGTTC GACGCGCTAT GGCCGGCCGG CGGCCCGCGC GCGGGCGGCG CGACGAACGC GCAGTCGTGC GTGCTGCGCG TATCGGCGGG CGGGCGCGCG GCGCTCCTGA CGGGCGATGT CGATGCGTGC TCCGAGCGCG CGCTCGTCGC CGGATCGCGC GGCGCGCTCG CCGCGCAGGT GCTCGTCGTG CCGCACCACG GCAGCCGCAC GTCGTCGACT GAGCCTTTCC TCGATTCGGT CAAACCGCGC ATTGCAATAT TTCAGGTAGG CTACGCCAAC AGGTTTCACC ATCCGCATCC GACCGTCTGG GCGCGCTATG CCGGGCGCGG CATCGAGTTG CCGCGCACCG ACCGCGACGG CGCCGTGCGC GTCGACGTGA CATCGAGCGG CGCGCTTGCC GAGCCGGTGC GGTATCGGGA CGCGCACCGG CGCTACTGGA TGGGCCGTTG A
|
Protein sequence | MDMRRSGGAQ GGGAMRAKCG AFALGVVALQ QQAALPGAAA WAGGALAFGL CVWLALVWRG DARARAAGFC ACCCAAALAG FGYAAARAQW RLADTLPAQW EGRDIVVTGA VRGLPSRDAN GVRFLFDVDA NDARIARFPA TLSLGWYAFG RPGAQPPELV PGDRWRLRVR LKRPHGNANF GVRDAEAAWL ARGIRALGYV SAPRDARRLA GRASGVAAAL DRLRARLRGR IAEALGDAAH RGIVVALAIG AQDDIADDDR RILRDTGTSH LVAISGLHVG MVGGLCAWLA GGLWRRSGYV GRDWPLVVPA QKVAALGAIV GGAGYAALAG FNVPVQRAWW MLAAAGVAYL SARSLAPSSV LAAALGCVLL VDPWAVTSAG FWLSFCAVAA ILAVSSGWRA PRDLDETRGA RHALGGVGHE GAPPSHRARW RAACGFAWGW AARAFGRLAR RVRDAARAQF AVTIALAPLT VLWFAQIPLA GPLANAFAIP WVGSLVTPIV LAGIVLPAPL DASAYALAHT LVQALMRLLD ATAGAGRTVW MLPAPDGFAL AAAAVGVAWA LMPRGWPVRC AAPLAWLPLV APAPLAPPDG AFRLTALDVG QGSAVLIETA RHTLLFDAGP GPEASNAGER IVVPFLRARG VRALDALVVS HADSDHAGGA PAVLGSIAVA QMAGGLPPSN RLWRAARAAG VADALPCVAG QRWRWDGVEF DALWPAGGPR AGGATNAQSC VLRVSAGGRA ALLTGDVDAC SERALVAGSR GALAAQVLVV PHHGSRTSST EPFLDSVKPR IAIFQVGYAN RFHHPHPTVW ARYAGRGIEL PRTDRDGAVR VDVTSSGALA EPVRYRDAHR RYWMGR
|
| |