Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BamMC406_2001 |
Symbol | |
ID | 6178871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria MC40-6 |
Kingdom | Bacteria |
Replicon accession | NC_010551 |
Strand | + |
Start bp | 2232267 |
End bp | 2233487 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641681767 |
Product | cupin 4 family protein |
Protein accession | YP_001808698 |
Protein GI | 172061046 |
COG category | [S] Function unknown |
COG ID | [COG2850] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.484534 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000000254865 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCAACC GGTCTCAAGC CGAACCGCGG GATGCCGCTG CCGCCCCGCT CGGCGCGCCT CCTTCCGATC TTCCCACACC GCTGCTCGGC GGTCTCTCGC CGGCGCAATT CATGCGCCGC TACTGGCAGA AAAAGCCGCT GTTGATCCGT CAGGCGATCC CCGGCGTGAA GCCGCCGGTC ACGCGCGACG CACTGTTCGA ACTGGCAGCC GACTATGACG CCGAATCGCG ACTGATCACC CATTTTCGTA ACAAGTGGCA ACTCGCGCAC GGCCCATTCG AGCCGGGCGC GCTGCCCGCC GTGTCGCGCA AATCCTGGTC CCTGCTCGTG CAGGGGCTCG ACCTGCACGT CGACGCGGCC CGCGCGCTGC TCGACCGCTT CCGCTTCATC CCGGACGCGC GCCTCGACGA CCTGATGATT TCGTACGCGA CGGACGGCGG CGGCGTCGGC CCGCACTTCG ACTCGTATGA CGTGTTCCTG CTGCAGGTCG AGGGCCGGCG CCGCTGGCGC ATCGGCGCGC AGAAGGATCT GTTGCTGCAA CCGGACGTGC CGCTGAAGAT CCTCGAGCAT TTCGAGCCGA GCGACGAGTG GGTGCTGGAG CCGGGCGACA TGCTGTACCT GCCGCCGCAC ATCGCGCACG ACGGCGTGGC CGAAGGAGAA TGCATGACCT GCTCGATCGG CTTTCGGGCA CCGTCCGCCG GCGAGTTGGG CGCGCAATTC CTGTACTACC TCGCGGAACG CGGCGGGCTG CGTGACGGGC GCGGCGACGC GCTCTACCGC GACCCGAAGC AGCCGGCGGT CGACACGCCG GCGCAACTGC CGCCGGCGAT GGTCGATCGC GTCGCCGAGA TCGTCGATGC GATCCGCTGG CGCAAGCGCG ACGTCGCCGA ATTCCTGGGC TGCTATCTGA GCGAGCCGAA ATCTAGCGTG GTATTCGAGC CGCCGGCACG CCCGCTGTCC GAGGCGGCGT TCGTCACGCA GGCTTCTCGC CGTGGCGTAT ATCTCGACAG AAGGGCCGCA TTGATGTATA ACGCGCGATC GTACTTCATT AATGGTGAAG AAGAACCGCT CGAGCAAGCC GGCGAATGGC TGCCCGAACT GGCCAATCTG CGCCATATGG AGGCGAAACG GTTTGTAACA CTCTCCCGGG CTCCCTCCAT GACAGCCTTG CTGCACGAGT GGTATTGTGC GGGCTGGATA CGGGTCGGAA ACCGGATTTA G
|
Protein sequence | MRNRSQAEPR DAAAAPLGAP PSDLPTPLLG GLSPAQFMRR YWQKKPLLIR QAIPGVKPPV TRDALFELAA DYDAESRLIT HFRNKWQLAH GPFEPGALPA VSRKSWSLLV QGLDLHVDAA RALLDRFRFI PDARLDDLMI SYATDGGGVG PHFDSYDVFL LQVEGRRRWR IGAQKDLLLQ PDVPLKILEH FEPSDEWVLE PGDMLYLPPH IAHDGVAEGE CMTCSIGFRA PSAGELGAQF LYYLAERGGL RDGRGDALYR DPKQPAVDTP AQLPPAMVDR VAEIVDAIRW RKRDVAEFLG CYLSEPKSSV VFEPPARPLS EAAFVTQASR RGVYLDRRAA LMYNARSYFI NGEEEPLEQA GEWLPELANL RHMEAKRFVT LSRAPSMTAL LHEWYCAGWI RVGNRI
|
| |