Gene SbBS512_E2571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2571 
SymbolmoaA 
ID6268942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2371556 
End bp2372545 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content54% 
IMG OID641726553 
Productmolybdenum cofactor biosynthesis protein A 
Protein accessionYP_001881033 
Protein GI187733362 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2896] Molybdenum cofactor biosynthesis enzyme 
TIGRFAM ID[TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0000120862 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCAC AACTGACTGA TGCATTTGCG CGTAAGTTTT ACTACTTGCG CCTGTCGATT 
ACCGATGTGT GTAACTTTCG TTGCACCTAC TGCCTGCCGG ATGGCTACAA ACCGAGCGGC
GTCACCAATA AAGGCTTTCT TACCGTCGAT GAAATTCGCC GGGTTACGCG CGCCTTCGCC
AGTCTGGGCA CCGAAAAAGT GCGCCTGACA GGAGGTGAGC CGTCTTTACG CCGCGACTTT
ACCGATATCA TCGCCGCTGT GCGGGAAAAC GACGCTATCC GCCAGATTGC GGTCACCACC
AATGGTTACC GTCTGGAACG CGATGTGGCG AACTGGCGCG ATGCGGGACT TACTGGCATT
AACGTCAGTG TCGACAGTCT GGACGCCCGC CAGTTTCACG CTATTACCGG GCAGGATAAA
TTCAACCAGG TCATGGCAGG GATTGATGCT GCATTTGAGG CCGGTTTTGA GAAGGTCAAA
GTCAATACCG TGCTGATGCG TGATGTTAAT CATCACCAGC TCGACACCTT TCTGAACTGG
ATCCAGCATC GCCCTATCCA GCTGCGTTTC ATCGAACTGA TGGAAACGGG CGAGGGCAGC
GAGCTCTTCC GTAAGCATCA CATCTCTGGT CAGGTTCTGC GTGACGAGCT ACTGCGTCGC
GGCTGGATCC ACCAATTACG TCAACGCAGC GACGGTCCCG CGCAAGTCTT TTGCCATCCA
GATTACGCCG GAGAGATTGG CCTTATCATG CCGTATGAAA AAGACTTCTG CGCCACTTGC
AACCGCCTGC GCGTTTCCTC CATTGGTAAA CTCTATCTCT GCCTGTTTGG TGAAGGCGGC
GTTAACCTGC GCGATCTGCT GGAAGACGAT ACCCAGCAAC AGGCGCTGGA AGCGCGTATT
TCAGTGGCGC TGCGGGAGAA GAAACAGACC CATTTCCTGC ATCAAAACAA CACCGGTATT
ACGCAAAACT TATCGTACAT TGGCGGCTAA
 
Protein sequence
MASQLTDAFA RKFYYLRLSI TDVCNFRCTY CLPDGYKPSG VTNKGFLTVD EIRRVTRAFA 
SLGTEKVRLT GGEPSLRRDF TDIIAAVREN DAIRQIAVTT NGYRLERDVA NWRDAGLTGI
NVSVDSLDAR QFHAITGQDK FNQVMAGIDA AFEAGFEKVK VNTVLMRDVN HHQLDTFLNW
IQHRPIQLRF IELMETGEGS ELFRKHHISG QVLRDELLRR GWIHQLRQRS DGPAQVFCHP
DYAGEIGLIM PYEKDFCATC NRLRVSSIGK LYLCLFGEGG VNLRDLLEDD TQQQALEARI
SVALREKKQT HFLHQNNTGI TQNLSYIGG