Gene EcSMS35_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0804 
SymbolmoaA 
ID6143236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp808738 
End bp809727 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content54% 
IMG OID641615692 
Productmolybdenum cofactor biosynthesis protein A 
Protein accessionYP_001742884 
Protein GI170682341 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2896] Molybdenum cofactor biosynthesis enzyme 
TIGRFAM ID[TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00174086 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAC AACTGACTGA TGCATTTGCG CGTAAGTTTT ACTACTTGCG CCTGTCGATT 
ACCGATGTGT GTAACTTTCG TTGCACCTAC TGCCTGCCGG ATGGCTACAA ACCGAGCGGC
GTCACCAATA AAGGCTTTCT TACCGTCGAT GAAATTCGCC GGGTTACGCG CGCCTTCGCC
AGTCTGGGCA CCGAAAAAGT CCGTCTGACG GGCGGTGAGC CGTCTTTACG CCGCGACTTT
ACCGATATCA TCGCCGCTGT GCGAGAAAAC GACGCTATCC GCCAGATTGC TGTCACCACC
AATGGTTACC GTCTGGAACG CGATGTGGCG AACTGGCGCG ATGCTGGACT TACTGGCATC
AACGTCAGCG TTGATAGTCT GGACGCCCGC CAGTTTCATG CCATTACCGG GCAGGATAAA
TTCAACCAGG TCATGGCAGG AATCGATGCT GCATTTGAGG CCGGTTTTGA GAAGGTCAAA
GTCAATACCG TGCTGATGCG TGATGTTAAT CATCATCAGC TCGACACCTT TCTGAACTGG
ATCCAGCATC GCCCTATCCA GCTGCGTTTC ATCGAATTGA TGGAAACGGG CGAGGGCAGC
GAGCTCTTCC GTAAGCATCA CATCTCTGGT CAGGTTCTGC GTGACGAGCT ACTGCGTCGC
GGCTGGATCC ACCAATTACG TCAACGCAGC GACGGTCCCG CGCAAGTCTT TTGTCATCCG
GATTACGCCG GAGAGATTGG CCTTATCATG CCGTATGAAA AAGACTTCTG CGCCACTTGC
AACCGCCTGC GCGTTTCCTC CATTGGTAAA CTCCATCTCT GCCTGTTTGG TGAAGGCGGC
GTTAACCTGC GCGATCTGCT GGAAGACGAT ACCCAGCAAC AGGCGCTGGA AGCGCGTATT
TCAGCGGCGC TGCGGGAGAA AAAACAGACC CATTTCCTGC ATCAAAACAA CACCGGTATT
ACGCAAAACT TATCGTACAT TGGTGGCTAA
 
Protein sequence
MASQLTDAFA RKFYYLRLSI TDVCNFRCTY CLPDGYKPSG VTNKGFLTVD EIRRVTRAFA 
SLGTEKVRLT GGEPSLRRDF TDIIAAVREN DAIRQIAVTT NGYRLERDVA NWRDAGLTGI
NVSVDSLDAR QFHAITGQDK FNQVMAGIDA AFEAGFEKVK VNTVLMRDVN HHQLDTFLNW
IQHRPIQLRF IELMETGEGS ELFRKHHISG QVLRDELLRR GWIHQLRQRS DGPAQVFCHP
DYAGEIGLIM PYEKDFCATC NRLRVSSIGK LHLCLFGEGG VNLRDLLEDD TQQQALEARI
SAALREKKQT HFLHQNNTGI TQNLSYIGG