Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BamMC406_1850 |
Symbol | |
ID | 6179107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria MC40-6 |
Kingdom | Bacteria |
Replicon accession | NC_010551 |
Strand | - |
Start bp | 2062894 |
End bp | 2064105 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641681609 |
Product | hypothetical protein |
Protein accession | YP_001808548 |
Protein GI | 172060896 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00703093 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACCGAC GTGATTTTCT GACGCTCACG GGCGCCGCGG CCGCGGCGGG CGTGTCGCTG TGGCAGCCGG CCGCGCACGC GGCTTCGATG CCCGCGGCGG GGCGGCCCGG CTATGCGAAC GTGCTGATCC TCATCGAGCT GAAGGGCGGC AACGACGGCC TCAACACGGT GGTGCCGTAT GCGGATCCGC TCTACTACCA GTTCCGCCGC GGCATCGGCA TCAAGCGCGA TCAGGTGCTG CAGCTCGATG CGCATACGGG GCTGCACCCG GCGCTCGCGC CGCTGATGCC GCTGTGGCGC GACGGGCAGG TCGCGATCGT GCAGGGCGTC GGCTATCCGC AGCCGAACCT GTCGCATTTC CGTTCGATCG AGATCTGGGA CACCGCATCG CGCTCGGACC AGTACCTGCA CGAAGGCTGG CTCACACGCA CGTTCGCGCA GGCGCCGGTG CCGCCGGGCT TCGCGGCGGA CGGCGTGGTG CTCGGCAGCG CCGAGATGGG ACCGCTCGCG AACGGCGCGC GCGCGATCGC GCTCGTCAAT CCCGCCCAGT TCATCCGCGC GGCGCGGCTC GCCGAGCCGT CGTCGCTGCG TGAACGGAAC CCGGCGCTCG CACACATCAT CGACGTCGAG AACGACATCG TGAAGGCGGC CGACCGGCTG CGCCCGCGCG GCGGGATGCG CGAGTTCAGG ATGGCCTTTC CGGCCGGTAC GTTCGGCACC TCGGTGAAGA CCGCCATGCA GGTGCTCGCG GCATGCGAAA CGTCGGGGCC CGGCGCGCAG GACGGCGTCG CGGTGCTGCG GCTCACGCTC AACGGATTCG ACACGCACCA GAACCAGCCG GGACAGCACG CCGCGCTGCT CAAGCAGTTC GCGGAAGGGA TGAGTGCGAT GCGCGACGCG CTGATCGAGC TGGGGCGCTG GAACCAGACG CTCGTGATGA CGTATGCGGA ATTCGGGCGG CGCGTGCGCG AGAACCAGAG CAACGGCACC GATCACGGTA CCGCCGCGCC GCATTTCGTG ATGGGCGGCC GCGTGGCCGG CGGGCTGTAC GGTGGGGCGC CCGCGCTCGG GCGCCTCGAC GGCAACGGCA ACCTGCCGGT CGCGGTCGAT TTCCGTCAGC TGTATGCGAC CGTGCTCGGG CCGTGGTGGG GACTCGATGC GACGCGCGTG CTGCAGCAGC GCTTCGACAT GCTGCCGCTG CTCAAGGTGT GA
|
Protein sequence | MNRRDFLTLT GAAAAAGVSL WQPAAHAASM PAAGRPGYAN VLILIELKGG NDGLNTVVPY ADPLYYQFRR GIGIKRDQVL QLDAHTGLHP ALAPLMPLWR DGQVAIVQGV GYPQPNLSHF RSIEIWDTAS RSDQYLHEGW LTRTFAQAPV PPGFAADGVV LGSAEMGPLA NGARAIALVN PAQFIRAARL AEPSSLRERN PALAHIIDVE NDIVKAADRL RPRGGMREFR MAFPAGTFGT SVKTAMQVLA ACETSGPGAQ DGVAVLRLTL NGFDTHQNQP GQHAALLKQF AEGMSAMRDA LIELGRWNQT LVMTYAEFGR RVRENQSNGT DHGTAAPHFV MGGRVAGGLY GGAPALGRLD GNGNLPVAVD FRQLYATVLG PWWGLDATRV LQQRFDMLPL LKV
|
| |