Gene EcSMS35_0852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0852 
SymbolmoeA 
ID6145071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp856808 
End bp858043 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content57% 
IMG OID641615740 
Productmolybdopterin biosynthesis protein MoeA 
Protein accessionYP_001742932 
Protein GI170681896 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTTA CCACCGGATT GATGTCGCTC GACACCGCGC TTAATGAGAT GCTTTCTCGC 
GTTACCCCAC TGACCGCCCA GGAAACGCTG CCACTGGTTC AGTGTTTTGG TCGTATTCTG
GCGAGCGATG TCGTTTCGCC ACTGGATGTC CCGGGGTTTG ATAACTCCGC TATGGACGGC
TACGCGGTGC GTTTAGCCGA TATTGCCTCC GGGCAACCGC TGCCCGTTGC CGGTAAATCC
TTTGCCGGTC AGCCGTACCA TGGTGAATGG CCTGCGGGTA CCTGCATTCG TATTATGACC
GGTGCGCCGG TGCCAGAAGG CTGCGAAGGG GTGGTGATGC AGGAGCAGAC TGAACAAACG
GACAATGGCG TGCGTTTTAC TGCTGAAGTG CGTAGCGGGC AAAATATTCG CCGTCGCGGT
GAAGATATCT CTGCAGGTGC GGTTGTTTTC CCGGCGGGGA CTCGCCTGAC TACCGCTGAA
CTGCCAGTGA TTGCTTCGCT AGGGATTGCC GAAGTTCCGG TGATTCGTAA AGTGCGTGTA
GCGCTTTTTT CTACCGGTGA TGAACTCCAG TTGCCCGGTC AGCCGCTGGG CGACGGCCAA
ATCTACGATA CCAACCGTCT CGCCGTACAC CTGATGTTAG AACAGTTGGG ATGCGAGGTA
ATCAACTTAG GGATTATCCG CGACGATCCC CATGCCCTGC GCGCCGCATT TATTGAAGCC
GACAGCCAGG CGGATGTGGT GATCAGTTCC GGCGGTGTTT CAGTGGGTGA AGCGGATTAC
ACCAAAACCA TTCTTGAAGA GCTGGGGGAG ATCGCCTTCT GGAAGCTGGC GATTAAACCA
GGTAAACCGT TCGCGTTCGG TAAACTCAGC AATAGCTGGT TCTGCGGCCT GCCGGGCAAC
CCGGTTTCAG CAACGCTGAC CTTCTATCAA CTGGTACAGC CTTTGCTGGC AAAACTAAGC
GGCAATACCG CCAGCGGCCT GCCCGCGCGC CAGCGTGTGC GCACGGCATC GCCTCTGAAG
AAATCGCCAG GACGCCTTGA TTTCCAGCGC GGCGTGCTGC AACGCAACGC CGATGGCGAA
CTGGAAGTGA CGACCACCGG ACATCAGGGT TCACATATAT TTAGCTCCTT TAGCCTCGGC
AACTGCTTTA TCGTGCTGGA ACGCGATCGC GGCAATGTGG AAGTGGGCGA ATGGGTGGAA
GTAGAACCGT TTAACGCGTT GTTCGGAGGC CTGTAA
 
Protein sequence
MEFTTGLMSL DTALNEMLSR VTPLTAQETL PLVQCFGRIL ASDVVSPLDV PGFDNSAMDG 
YAVRLADIAS GQPLPVAGKS FAGQPYHGEW PAGTCIRIMT GAPVPEGCEG VVMQEQTEQT
DNGVRFTAEV RSGQNIRRRG EDISAGAVVF PAGTRLTTAE LPVIASLGIA EVPVIRKVRV
ALFSTGDELQ LPGQPLGDGQ IYDTNRLAVH LMLEQLGCEV INLGIIRDDP HALRAAFIEA
DSQADVVISS GGVSVGEADY TKTILEELGE IAFWKLAIKP GKPFAFGKLS NSWFCGLPGN
PVSATLTFYQ LVQPLLAKLS GNTASGLPAR QRVRTASPLK KSPGRLDFQR GVLQRNADGE
LEVTTTGHQG SHIFSSFSLG NCFIVLERDR GNVEVGEWVE VEPFNALFGG L