Gene EcolC_2862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2862 
SymbolmoaA 
ID6065234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3128368 
End bp3129390 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content54% 
IMG OID641602268 
Productmolybdenum cofactor biosynthesis protein A 
Protein accessionYP_001725817 
Protein GI170020863 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2896] Molybdenum cofactor biosynthesis enzyme 
TIGRFAM ID[TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0111085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0131262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCGC CTCCCGTATC TGGAAAGGTG TACATGGCTT CACAACTGAC TGATGCATTT 
GCGCGTAAGT TTTACTACTT GCGCCTGTCG ATTACCGATG TGTGTAACTT TCGTTGCACC
TACTGCCTGC CGGATGGCTA CAAACCGAGC GGCGTCACCA ATAAAGGCTT TCTTACCGTC
GATGAAATTC GCCGGGTTAC GCGCGCCTTC GCCAGTCTGG GCACCGAAAA AGTGCGCCTG
ACAGGAGGAG AGCCGTCTTT ACGCCGCGAC TTTACCGATA TCATCGCCGC TGTGCGGGAA
AACGACGCTA TCCGCCAGAT TGCGGTCACA ACCAATGGTT ACCGTCTGGA ACGCGATGTG
GCGAACTGGC GCGATGCGGG ACTTACTGGC ATTAACGTCA GTGTCGACAG TCTGGACGCC
CGCCAGTTTC ACGCTATTAC CGGGCAGGAT AAATTCAACC AGGTCATGGC AGGGATTGAT
GCTGCATTTG AGGCCGGTTT TGAGAAGGTC AAAGTCAATA CCGTGCTGAT GCGTGATGTT
AATCATCACC AGCTCGACAC CTTTCTGAAC TGGATCCAGC ATCGCCCTAT CCAGCTGCGT
TTCATCGAAC TGATGGAAAC GGGCGAGGGC AGCGAGCTCT TCCGTAAGCA TCACATCTCT
GGTCAGGTTC TGCGTGACGA GCTACTGCGT CGCGGCTGGA TCCACCAATT ACGTCAACGC
AGCGACGGTC CCGCGCAAGT CTTTTGCCAT CCAGATTACG CCGGAGAGAT TGGCCTTATC
ATGCCGTATG AAAAAGACTT CTGCGCCACT TGCAACCGCC TGCGCGTTTC CTCCATTGGT
AAACTCCATC TCTGCCTGTT TGGTGAAGGC GGCGTTAACC TGCGCGATCT GCTGGAAGAC
GATACCCAGC AACAGGCGCT GGAAGCGCGT ATTTCAGCGG CGCTGCGGGA GAAGAAACAG
ACCCATTTCC TGCATCAAAA CAACACCGGT ATTACGCAAA ACTTATCGTA CATTGGCGGC
TAA
 
Protein sequence
MTSPPVSGKV YMASQLTDAF ARKFYYLRLS ITDVCNFRCT YCLPDGYKPS GVTNKGFLTV 
DEIRRVTRAF ASLGTEKVRL TGGEPSLRRD FTDIIAAVRE NDAIRQIAVT TNGYRLERDV
ANWRDAGLTG INVSVDSLDA RQFHAITGQD KFNQVMAGID AAFEAGFEKV KVNTVLMRDV
NHHQLDTFLN WIQHRPIQLR FIELMETGEG SELFRKHHIS GQVLRDELLR RGWIHQLRQR
SDGPAQVFCH PDYAGEIGLI MPYEKDFCAT CNRLRVSSIG KLHLCLFGEG GVNLRDLLED
DTQQQALEAR ISAALREKKQ THFLHQNNTG ITQNLSYIGG