Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen_3887 |
Symbol | |
ID | 4095744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia AU 1054 |
Kingdom | Bacteria |
Replicon accession | NC_008061 |
Strand | - |
Start bp | 1046649 |
End bp | 1048535 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638017181 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_623749 |
Protein GI | 107026238 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3540] Phosphodiesterase/alkaline phosphatase D |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACTG GCGGCGCGCG CCGCGCGATC GTACGTATCG GCCGCCATGC ATGCGGGGTT GCGGTACTGT CACAACGCGG CAAAACTGCC CTTCGACAAT GCAGGCGACC GGGACCCCGC TCGTTTCCGG CCCACGCCCC GGCCGTGCTT GCGCATGGCC GCGCGGTTCC CCTCGCGCCC CATCAAACGA ATACACCGAC CATGTCGAAC CAGGACACTT TCCCTGACCA GCCGAACGAG CCGGCCGCCT CCGTGTCGCG TCGCGGCTTC CTGAAACTCG CCGGCGTCTC CAGCCTCGCC ACTGCTGCCG GCGGGCTTGC GGCCGCCCGC GCCGCCGCGT CGAATCCGGA CGGCACGCCC GAGCAAGTTC ACCTGACGTG GGGCAACGAC CCGACGTCCG AAGTCGTGAT CTCGTGGGCG TCGCTCGCCC CGGCCGTCAA TCCGCGCGCG CGCATCGTCG CCGACGGCGA GCCTGCGCGC ACCGTGCACG GCGTCCAGCG CCTGTACACG GACGGCCTGA ACGGCGAGAC CGTATTCACG TACCACGCGC GCGTGCACGG ACTGAAGCCG GATACGCGCT ACCGCTACGA GATCACGGCC GACAACGACA GCAACGCCGC GCAGCCGTTC TCCGCGAATT TCTCGACTGC GCCGCGCGGC CGGGCGCCGT TCCGTTTCAC GAGCTACGGC GATCTCGCGA CGCCGAACGG CGCGTGGGTG CTGTCGTCGC CGCAGAGCCG CTTCGCGGTG CAGGCCGTCG AACAGTTCCA GCCGCTGTTT CACCTGCTGA ACGGCGACCT CTGCTATGCG AACCTGAACC CGGCGCACCA GCCCGAGGTG TGGCGCGATT TCGGCAACAA CAACCAGACG TCGGCTGCGA ACCGCCCGTG GATGCCGTGC CCCGGTAATC ACGAGATCGA ATTCAACAAC GGTCCGCAGG GGCTCGACTC GTACCTCGCA CGCTATACGC TGCCCGAGAA CGGCACGCAT TTCCCGGGCC GCTGGTACAG CTTCCGCGTG AGCTCCGTGC TGTTCGTGTC GCTCGACGCC GACGACGTCG TGTACCAGGA TGCCGCCGCG TTCGTCGGCG GCCCGGCGCC GCTCGTGCCG GCCGCGAGCA CCGGCCGCCC GCCGATCGAG CCCGGCACGT CGTTCTACGT GCGCGGCTAC AGCAACGGCG AGCAGACGCG CTGGCTCGAA CGCACGCTGC GTCACGCCGC GCACGACGAC GACATCGACT GGATCGTCGT GCAGATGCAT CAGGACGCGC TCAGTTCGTC GAAAACGGGC AACGGCTCCG ACAAGGGCAT TCGCGAAGCG TGGCTGCCGC TGTTCGACCG TTACGGCGTC GACCTCGTGC TGTGCGGCCA CGATCACGAC TACGAGCGCA GCTACCCGGT GCGCGGTTGC AATCACCGGG CGGGCGTCGA TGCGAAAACC GGTGAAGTGG TCGAAACGCT GCAGCCGCGC CCGGTCGGCT CGAACGATCC GGATCGCACG AAGTTCGATA CGAGCCACGG CACGATCCAC CTGATCCTCG GCGGCGGCGG CACCAGCGCG CCGCTCGACG TGTACGGCGA AAACCCGTCG ACCGGCTTTG CGCAGGCGAA GGTGTTCACG AAGCCGAACC GGCCGATGCC GGGCACCGCG CCGAACACGT TCGTGCGTCA GCCGGCCGAT GCGCTCGAGG ATGCGATCTG GTCCGCGCGT CGCGATACGG GCACCGGCTA CGGGATCGCG GTGTTCGACC ACGATCCGGG CAAGCCGGGC GGCCAAACGA CGATCACGAT GCGCTACTAC CACGCGCCGG GCGCCGACCA GCATCCGACC GCGCAGTACG AGCTGTTCGA GACGATCGAG TTGAGCAAGA AGCGGCGCGA GCGGTGA
|
Protein sequence | MSTGGARRAI VRIGRHACGV AVLSQRGKTA LRQCRRPGPR SFPAHAPAVL AHGRAVPLAP HQTNTPTMSN QDTFPDQPNE PAASVSRRGF LKLAGVSSLA TAAGGLAAAR AAASNPDGTP EQVHLTWGND PTSEVVISWA SLAPAVNPRA RIVADGEPAR TVHGVQRLYT DGLNGETVFT YHARVHGLKP DTRYRYEITA DNDSNAAQPF SANFSTAPRG RAPFRFTSYG DLATPNGAWV LSSPQSRFAV QAVEQFQPLF HLLNGDLCYA NLNPAHQPEV WRDFGNNNQT SAANRPWMPC PGNHEIEFNN GPQGLDSYLA RYTLPENGTH FPGRWYSFRV SSVLFVSLDA DDVVYQDAAA FVGGPAPLVP AASTGRPPIE PGTSFYVRGY SNGEQTRWLE RTLRHAAHDD DIDWIVVQMH QDALSSSKTG NGSDKGIREA WLPLFDRYGV DLVLCGHDHD YERSYPVRGC NHRAGVDAKT GEVVETLQPR PVGSNDPDRT KFDTSHGTIH LILGGGGTSA PLDVYGENPS TGFAQAKVFT KPNRPMPGTA PNTFVRQPAD ALEDAIWSAR RDTGTGYGIA VFDHDPGKPG GQTTITMRYY HAPGADQHPT AQYELFETIE LSKKRRER
|
| |