Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0799 |
Symbol | bioF |
ID | 6144826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 802229 |
End bp | 803383 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641615687 |
Product | 8-amino-7-oxononanoate synthase |
Protein accession | YP_001742879 |
Protein GI | 170680137 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes |
TIGRFAM ID | [TIGR00858] 8-amino-7-oxononanoate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000322202 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTGGC AGGAGAAAAT CAACGCGGCG CTCGATGCGC GGCGTGCTGC CGATGCCCTG CGTCGCCGTT ATCCAGTGGC GCAAGGAGCC GGACGCTGGC TGGTGGCGGA CGATCGCCAT TATCTGAACT TTTCCAGTAA CGATTATTTA GGTTTAAGCC ATCATCCGCA AATTATCCGT GCCTGGAAGC TGAGTGCGGA GCAATTTGGC GTCGGTAGCG GCGGCTCCGG TCACGTCAGC GGTTATAGCG TGGCGCATCA GGCGCTGGAA GAAGAACTGG CCGAGTGGCT GGGCTATTCG CGGGCACTGC TGTTTATCTC TGGTTTTGCC GCTAACCAGG CAGTTATTAC CGCGATGATG GCGAAAGAGG ACCGTATTGT TGCCGACCGG CTTAGCCATG CCTCATTGCT GGAGGCTGCA AGTTTAAGCC CGTCGCCGCT TCGCCGTTTT GCTCATAACG ATGTCACTCA TCTGGCGCGA CTGCTTGCTT CCCCCTGTCC GGGGCAGCAA CTGGTAGTGA CAGAAGGCGT GTTCAGCATG GACGGCGATA GTGCGCCACT GGAGGAAATC CAGCAGGTAA CGCAACAGCA CGATGGCTGG TTGATGGTCG ATGACGCCCA CGGCACGGGC GTTATCGGGG AGCAGGGGCG TGGCAGCTGC TGGCTGCAAA AGGTAAAACC AGAATTGCTG GTGGTGACTT TTGGCAAAGG ATTTGGCGTC AGCGGGGCAG CGGTGCTTTG CTCCAATACG GTGGCGGATT ATCTGCTGCA ATTCGCCCGC CATCTTATCT ACAGCACCAG TATGCCGCCC GCTCAGGCGC AGGCATTACG TGCGTCGCTG GCGGTCATTC GCAGTGATGA GGGTGATGCA CGGCGCGAAA AACTGGCGGC ACTCATTACG CGTTTTCGTG CCGGAGTGCA GGATTTGCCG TTTACGCTTG CTGATTCATG GAGCGCCATC CAGCCATTGA TCGTCGGTGA TAACAGCCGT GCGTTACAAC TGGCAGAAAA ACTGCGCCAG CAAGGTTGCT GGGTCACGGC GATTCGCCCG CCAACCGTAC CTGCTGGTAC TGCGCGACTG CGCTTAACAC TAACCGCCGC GCATGAAATG CAGGATATCG ACCGTCTGCT GGAGGTGCTG CATGACAACG GTTAA
|
Protein sequence | MSWQEKINAA LDARRAADAL RRRYPVAQGA GRWLVADDRH YLNFSSNDYL GLSHHPQIIR AWKLSAEQFG VGSGGSGHVS GYSVAHQALE EELAEWLGYS RALLFISGFA ANQAVITAMM AKEDRIVADR LSHASLLEAA SLSPSPLRRF AHNDVTHLAR LLASPCPGQQ LVVTEGVFSM DGDSAPLEEI QQVTQQHDGW LMVDDAHGTG VIGEQGRGSC WLQKVKPELL VVTFGKGFGV SGAAVLCSNT VADYLLQFAR HLIYSTSMPP AQAQALRASL AVIRSDEGDA RREKLAALIT RFRAGVQDLP FTLADSWSAI QPLIVGDNSR ALQLAEKLRQ QGCWVTAIRP PTVPAGTARL RLTLTAAHEM QDIDRLLEVL HDNG
|
| |