Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A5246 |
Symbol | |
ID | 3750458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 2299262 |
End bp | 2300482 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637763545 |
Product | hypothetical protein |
Protein accession | YP_369484 |
Protein GI | 78066715 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGAC GTGATTTTCT GACGCTGACG GGCGCTGCGG CCGCGGCCGG CGTGTCGATG TGGCAGGCGC CTGCGATGGC GGCTTCCGTG GCGCAGGCGG GGCGCCAACC GGCGCCCGGG TATGCGAACG TGCTGATCCT CGTCGAGTTG AAGGGCGGCA ACGATGGCCT CAACACGGTG GTGCCGTATG CGGACCCGCT GTATTACCAG TTCCGGCGCA GCATCGGCAT CAAGCGCGAG CAGGTGCTGC AACTCGACGC ACACACGGGG CTGCATCCGT CGCTTGCGCC GCTGATGCCG CTGTGGCGCG ACGGGCAGGT CGCGGTCGTG CAGGGCGTCG GCTATCCGCA GCCGAACCTG TCGCACTTTC GCTCGATCGA GATCTGGGAT ACCGCGTCGC GCTCGGATCA ATACCTGCAC GAAGGCTGGC TCACGCGCAC GTTCGCGCAA GCACCCGTAC CGCCCGGTTT CGCGGCCGAC GGCGTCGTGC TCGGCAGCGC CGAGATGGGG CCGCTGTCGA ACGGTGCGCG TGCGATCGCG CTCGTCAATC CCGCGCAGTT CATCCGTGCG GCCCGGCTTG CCGAGCCGTC GTCGCTGCGC GAACAGAACC CTGCGCTCGC CCACATCATC GACGTCGAGA ACGACATCGT GAAGGCGGCC GACCGGCTGC GCCCGCGCGG CGGGATGCGT GAATTCCGGA CAGCCTTTCC GGCCGGCGCG TTCGGCACGT CGGTCAAGAC CGCGATGCAG GTGCTGGCCG CATGCGAAGC GTCCGGGCCC GGCGCGCAGG ATGGTGTCGC GGTGCTGCGT CTGACGCTCA ACGGCTTCGA TACGCACCAG AACCAGCCGG GGCAGCAGGC TGCACTGCTC AAGCAGTTCG CGGAAGGGAT GAGCGCGATG CGCGGCGCGT TGATCGAGCT CGGCCGCTGG AACCAGACGC TCGTGATGAC GTATGCGGAA TTCGGGCGGC GCGTGCGCGA GAACCAGAGC AACGGCACCG ATCACGGCAC GGCCGCGCCG CATTTCGTGA TGGGCGGCCG CGTGGCCGGC GGGCTGTACG GTGCGCCGCC GGCGCTTGGG CGGCTCGACG GCAACGGCAA TCTGCCGGTC GCGGTCGATT TCCGCCAGCT CTACGCGACC GTGCTCGGGC CGTGGTGGGG GCTCGACGCG ACCCGCGTGC TGCAGCAGCG CTTCGATACG CTGCCGTTGT TGAAGGCGTG A
|
Protein sequence | MNRRDFLTLT GAAAAAGVSM WQAPAMAASV AQAGRQPAPG YANVLILVEL KGGNDGLNTV VPYADPLYYQ FRRSIGIKRE QVLQLDAHTG LHPSLAPLMP LWRDGQVAVV QGVGYPQPNL SHFRSIEIWD TASRSDQYLH EGWLTRTFAQ APVPPGFAAD GVVLGSAEMG PLSNGARAIA LVNPAQFIRA ARLAEPSSLR EQNPALAHII DVENDIVKAA DRLRPRGGMR EFRTAFPAGA FGTSVKTAMQ VLAACEASGP GAQDGVAVLR LTLNGFDTHQ NQPGQQAALL KQFAEGMSAM RGALIELGRW NQTLVMTYAE FGRRVRENQS NGTDHGTAAP HFVMGGRVAG GLYGAPPALG RLDGNGNLPV AVDFRQLYAT VLGPWWGLDA TRVLQQRFDT LPLLKA
|
| |