Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0804 |
Symbol | moaA |
ID | 6143236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 808738 |
End bp | 809727 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615692 |
Product | molybdenum cofactor biosynthesis protein A |
Protein accession | YP_001742884 |
Protein GI | 170682341 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2896] Molybdenum cofactor biosynthesis enzyme |
TIGRFAM ID | [TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00174086 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCAC AACTGACTGA TGCATTTGCG CGTAAGTTTT ACTACTTGCG CCTGTCGATT ACCGATGTGT GTAACTTTCG TTGCACCTAC TGCCTGCCGG ATGGCTACAA ACCGAGCGGC GTCACCAATA AAGGCTTTCT TACCGTCGAT GAAATTCGCC GGGTTACGCG CGCCTTCGCC AGTCTGGGCA CCGAAAAAGT CCGTCTGACG GGCGGTGAGC CGTCTTTACG CCGCGACTTT ACCGATATCA TCGCCGCTGT GCGAGAAAAC GACGCTATCC GCCAGATTGC TGTCACCACC AATGGTTACC GTCTGGAACG CGATGTGGCG AACTGGCGCG ATGCTGGACT TACTGGCATC AACGTCAGCG TTGATAGTCT GGACGCCCGC CAGTTTCATG CCATTACCGG GCAGGATAAA TTCAACCAGG TCATGGCAGG AATCGATGCT GCATTTGAGG CCGGTTTTGA GAAGGTCAAA GTCAATACCG TGCTGATGCG TGATGTTAAT CATCATCAGC TCGACACCTT TCTGAACTGG ATCCAGCATC GCCCTATCCA GCTGCGTTTC ATCGAATTGA TGGAAACGGG CGAGGGCAGC GAGCTCTTCC GTAAGCATCA CATCTCTGGT CAGGTTCTGC GTGACGAGCT ACTGCGTCGC GGCTGGATCC ACCAATTACG TCAACGCAGC GACGGTCCCG CGCAAGTCTT TTGTCATCCG GATTACGCCG GAGAGATTGG CCTTATCATG CCGTATGAAA AAGACTTCTG CGCCACTTGC AACCGCCTGC GCGTTTCCTC CATTGGTAAA CTCCATCTCT GCCTGTTTGG TGAAGGCGGC GTTAACCTGC GCGATCTGCT GGAAGACGAT ACCCAGCAAC AGGCGCTGGA AGCGCGTATT TCAGCGGCGC TGCGGGAGAA AAAACAGACC CATTTCCTGC ATCAAAACAA CACCGGTATT ACGCAAAACT TATCGTACAT TGGTGGCTAA
|
Protein sequence | MASQLTDAFA RKFYYLRLSI TDVCNFRCTY CLPDGYKPSG VTNKGFLTVD EIRRVTRAFA SLGTEKVRLT GGEPSLRRDF TDIIAAVREN DAIRQIAVTT NGYRLERDVA NWRDAGLTGI NVSVDSLDAR QFHAITGQDK FNQVMAGIDA AFEAGFEKVK VNTVLMRDVN HHQLDTFLNW IQHRPIQLRF IELMETGEGS ELFRKHHISG QVLRDELLRR GWIHQLRQRS DGPAQVFCHP DYAGEIGLIM PYEKDFCATC NRLRVSSIGK LHLCLFGEGG VNLRDLLEDD TQQQALEARI SAALREKKQT HFLHQNNTGI TQNLSYIGG
|
| |