Gene EcHS_A0835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0835 
SymbolmoaA 
ID5592046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp843647 
End bp844636 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content54% 
IMG OID640920007 
Productmolybdenum cofactor biosynthesis protein A 
Protein accessionYP_001457574 
Protein GI157160256 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2896] Molybdenum cofactor biosynthesis enzyme 
TIGRFAM ID[TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.344402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCAC AACTGACTGA TGCATTTGCG CGTAAGTTTT ACTACTTGCG CCTGTCGATT 
ACCGATGTGT GTAACTTTCG TTGCACCTAC TGCCTGCCGG ATGGCTACAA ACCGAGCGGC
GTCACCAATA AAGGCTTTCT TACCGTCGAT GAAATTCGCC GGGTTACGCG CGCCTTCGCC
AGTCTGGGCA CCGAAAAAGT GCGCCTGACA GGAGGAGAGC CGTCTTTACG CCGCGACTTT
ACCGATATCA TCGCCGCTGT GCGGGAAAAC GACGCTATCC GCCAGATTGC GGTCACCACC
AATGGTTACC GTCTGGAACG CGATGTGGCG AACTGGCGCG ATGCGGGACT TACTGGCATT
AACGTCAGTG TCGACAGTCT GGACGCCCGC CAGTTTCACG CTATTACCGG GCAGGATAAA
TTCAACCAGG TCATGGCAGG GATTGATGCT GCATTTGAGG CCGGTTTTGA GAAGGTCAAA
GTCAATACCG TGCTGATGCG TGATGTTAAT CATCACCAGC TCGACACCTT TCTGAACTGG
ATCCAGCATC GCCCTATCCA GCTGCGTTTC ATCGAACTGA TGGAAACGGG CGAGGGCAGT
GAGCTCTTCC GTAAACATCA CATCTCTGGT CAGGTTCTGC GTGACGAGCT ACTGCGTCGC
GGCTGGATCC ACCAATTACG TCAACGCAGC GACGGTCCCG CGCAAGTCTT TTGCCATCCG
GATTACGCCG GAGAGATTGG CCTTATCATG CCGTATGAAA AAGACTTCTG CGCCACTTGC
AACCGCCTGC GCGTTTCCTC CATTGGTAAA CTCCATCTCT GCCTGTTTGG TGAAGGCGGC
GTTAACCTGC GCGATCTGCT GGAAGACGAT GCCCAGCAAC AGGCGCTGGA AGCGCGTATT
TCAGCGGCGC TGCGGGAGAA GAAACAGACC CATTTCCTGC ATCAAAACAA CACCGGTATT
ACGCAAAACT TATCGTACAT TGGCGGCTAA
 
Protein sequence
MASQLTDAFA RKFYYLRLSI TDVCNFRCTY CLPDGYKPSG VTNKGFLTVD EIRRVTRAFA 
SLGTEKVRLT GGEPSLRRDF TDIIAAVREN DAIRQIAVTT NGYRLERDVA NWRDAGLTGI
NVSVDSLDAR QFHAITGQDK FNQVMAGIDA AFEAGFEKVK VNTVLMRDVN HHQLDTFLNW
IQHRPIQLRF IELMETGEGS ELFRKHHISG QVLRDELLRR GWIHQLRQRS DGPAQVFCHP
DYAGEIGLIM PYEKDFCATC NRLRVSSIGK LHLCLFGEGG VNLRDLLEDD AQQQALEARI
SAALREKKQT HFLHQNNTGI TQNLSYIGG