Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2581 |
Symbol | |
ID | 4885063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2549763 |
End bp | 2552333 |
Gene Length | 2571 bp |
Protein Length | 856 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640128509 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_001059607 |
Protein GI | 126439024 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACATGC GCCGCTCGGG CGGCGCGCAA GGCGGAGGCG CGATGCGGGC GAAGTGCGGC GCGTTCGCGC TCGGCGTGGT CGCGCTGCAG CAGCAGGCGG CGTTGCCGGG CGCGGCGGCA TGGGCGGGCG GCGCGCTCGC GTTCGGCTTG TGCGTGTGGC TCGCGCTCGT GTGGCGCGGC GATGCGCGCG CGCGAGCGGC CGGGTTCTGC GCATGCTGCT GCGCGGCGGC GCTCGCGGGC TTCGGCTACG CCGCGGCGCG CGCGCAGTGG CGGCTCGCCG ATACGCTGCC CGCGCAGTGG GAAGGGCGCG ACATCGTCGT GACGGGCGCC GTGCGCGGGC TGCCGTCGCG CGACGCGAAC GGCGTGCGTT TCCTGTTCGA CGTCGACGCA AACGATGCGC GCATTGCGCG TTTTCCGGCG ACGCTGTCGC TTGGCTGGTA CGCGTTCGGC CGCCCGGGCG CGCAGCCGCC CGAACTCGTG CCGGGCGACC GGTGGCGGCT GCGCGTGCGT CTGAAACGCC CGCACGGCAA TGCGAATTTC GGCGTGCGCG ACGCGGAGGC GGCGTGGCTC GCGCGCGGCA TCCGGGCGCT CGGCTACGTG TCGGCGCCGC GCGATGCGCG GCGGCTCGCG GGGCGTGCAT CCGGCGTCGC GGCCGCGCTC GACCGGCTGC GCGCGCGGCT GCGCGGGCGC ATCGCCGAGG CGCTCGGCGA CGCCGCGCAT CGCGGGATCG TCGTCGCGCT CGCGATCGGC GCGCAGGACG ATATCGCCGA CGACGACCGG CGCATCCTGC GCGATACCGG CACGAGCCAT CTCGTCGCGA TTTCCGGGCT GCACGTCGGG ATGGTCGGGG GCCTGTGCGC GTGGCTCGCG GGCGGGCTCT GGCGGCGCTC GGGCTACGTC GGACGCGACT GGCCGCTCGT CGTGCCCGCG CAGAAAGTCG CGGCGCTCGG CGCGATCGTC GGCGGCGCCG GCTATGCGGC GCTCGCGGGC TTCAACGTGC CCGTGCAGCG CGCGTGGTGG ATGCTTGCCG CCGCGGGCGT CGCGTATCTG AGCGCTCGCT CGCTCGCGCC TTCGTCGGTG CTGGCGGCCG CGCTCGGCTG CGTGCTGCTC GTCGATCCGT GGGCGGTGAC GTCGGCGGGG TTCTGGCTGT CGTTCTGCGC GGTCGCGGCG ATCCTGGCCG TATCGTCGGG GTGGCGCGCC CCGCGCGATC TCGACGAGAC GCGCGGCGCG CGCCACGCGC TCGGCGGCGT GGGTCACGAA GGCGCGCCGC CGTCGCATCG CGCGCGGTGG CGCGCCGCGT GCGGATTCGC GTGGGGGTGG GCGGCGCGCG CGTTCCGCCG GCTCGCCCGG CGCGTGCGCG ATGCCGCGCG GGCGCAGTTC GCGGTGACGA TCGCGCTCGC GCCGCTCACC GTGTTGTGGT TCGCGCAGAT CCCGCTCGCC GGCCCGCTCG CGAACGCGTT CGCGATTCCG TGGGTCGGCT CGCTCGTCAC GCCGATCGTG CTCGCGGGCA TCGTGCTGCC GGCGCCGCTC GACGCGTCCG CATACGCGCT CGCGCATGCA CTCGTTCAGG CGCTGATGCG GCTGCTCGAC GCGACGGCCG GCGCGGGGCG CACGGTCTGG ATGCTGCCGG CGCCGGACGG CTTCGCGCTC GCCGCGGCGG CGGTGGGCGT CGCGTGGGCG TTGATGCCGC GCGGCTGGCC GGTGCGCTGC GCGGCGCCGC TCGCGTGGCT GCCGCTCGTC GCGCCGGCGC CGCTTGCGCC GCCCGACGGC GCGTTCAGGC TGACGGCGCT CGACGTCGGG CAGGGCTCCG CGGTGCTGAT CGAGACCGCG CGGCACACGC TGCTGTTCGA CGCGGGCCCG GGGCCGGAGG CGTCGAACGC GGGCGAGCGG ATCGTCGTGC CGTTCTTGCG CGCGCGAGGC GTGCGCGCGC TCGACGCGCT CGTCGTGAGC CACGCGGATT CGGATCACGC GGGCGGCGCA CCCGCAGTAC TCGGATCGAT CGCCGTCGCG CAGATGGCGG GCGGGCTGCC GCCGTCGAAC CGCCTCTGGC GCGCCGCGCG CGCAGCCGGC GTGGCCGACG CGCTGCCGTG TGTGGCGGGG CAGCGCTGGC GCTGGGACGG CGTCGAGTTC GACGCGCTAT GGCCGACCGG CGGCCCGCGC GCGGGCGGCG CGACGAACGC GCAGTCGTGC GTGCTGCGCG TATCGGCGGG CGGGCGCGCG GCGCTCCTGA CGGGCGATGT CGATGCGCGC TCCGAGCGCG CGCTCGTCGC CGGATCGCGC GGCGCGCTCG CCGCGCAGGT GCTCGTCGTG CCGCACCACG GCAGCCGCAC GTCGTCGACT GAGCCTTTCC TCGATTCGGT CAAACCGCGC ATTGCAATAT TTCAGGTAGG CTACGCCAAC AGGTTTCACC ATCCGCATCC GACCGTCTGG GCGCGCTATG CCGGACGCGG CATCGAGTTG CCGCGCACCG ACCGCGACGG CGCCGTGCGC GTCGACGTGA CATCGAGCGG CGCGCTTGCC GAGCCGGTAC GGTATCGGGA CGCGCACCGG CGCTACTGGA TGGGCCGTTG A
|
Protein sequence | MDMRRSGGAQ GGGAMRAKCG AFALGVVALQ QQAALPGAAA WAGGALAFGL CVWLALVWRG DARARAAGFC ACCCAAALAG FGYAAARAQW RLADTLPAQW EGRDIVVTGA VRGLPSRDAN GVRFLFDVDA NDARIARFPA TLSLGWYAFG RPGAQPPELV PGDRWRLRVR LKRPHGNANF GVRDAEAAWL ARGIRALGYV SAPRDARRLA GRASGVAAAL DRLRARLRGR IAEALGDAAH RGIVVALAIG AQDDIADDDR RILRDTGTSH LVAISGLHVG MVGGLCAWLA GGLWRRSGYV GRDWPLVVPA QKVAALGAIV GGAGYAALAG FNVPVQRAWW MLAAAGVAYL SARSLAPSSV LAAALGCVLL VDPWAVTSAG FWLSFCAVAA ILAVSSGWRA PRDLDETRGA RHALGGVGHE GAPPSHRARW RAACGFAWGW AARAFRRLAR RVRDAARAQF AVTIALAPLT VLWFAQIPLA GPLANAFAIP WVGSLVTPIV LAGIVLPAPL DASAYALAHA LVQALMRLLD ATAGAGRTVW MLPAPDGFAL AAAAVGVAWA LMPRGWPVRC AAPLAWLPLV APAPLAPPDG AFRLTALDVG QGSAVLIETA RHTLLFDAGP GPEASNAGER IVVPFLRARG VRALDALVVS HADSDHAGGA PAVLGSIAVA QMAGGLPPSN RLWRAARAAG VADALPCVAG QRWRWDGVEF DALWPTGGPR AGGATNAQSC VLRVSAGGRA ALLTGDVDAR SERALVAGSR GALAAQVLVV PHHGSRTSST EPFLDSVKPR IAIFQVGYAN RFHHPHPTVW ARYAGRGIEL PRTDRDGAVR VDVTSSGALA EPVRYRDAHR RYWMGR
|
| |