Gene Nmul_A1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1497 
Symbol 
ID3785374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1712233 
End bp1713522 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content52% 
IMG OID637811585 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_412192 
Protein GI82702626 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAA ATGGCATGAA ACTGGAATCT CTGGCATTGC ATTACGGATA TAAATCAGAA 
GCTTCGACGA AGGCCGCGGC AGTACCGATT TATCAGACGA CTTCCTATAC ATTTGACAAT
ACGCAGCATG GGGCCGACCT GTTTGACCTG AAGGTGGCGG GAAATATTTA TACCCGGATA
ATGAATCCAA CCAATGCAGT GCTGGAGCAG CGTCTGGCAG AGATGGAGGG CGGAGTGGGT
GGGCTGGCAG TTGCTTCAGG TATGGCGGCA ATCACTTACG CCATCCAGTG CATCGCCAAT
GCCGGCGACA ACATTGTGAG CACCAGCCAA CTGTATGGCG GCACGTACAA TCTGTTTGCG
CATACTTTCC CCAGACAGGG GATCGAGGTG CGCATGGCCT CACATGAGGA TTTCGATGGG
ATTGAGCAAC GTATAGATGC CAAGACCCGT GCAATATTTT GTGAATCGAT TGGAAATCCC
TCGGGAAATG TTATCGATAT CGAGAAGATC GCTGACATCG CTCATCGGCA CGGGGTGCCC
CTCATCGTCG ACAGTACGGT TTCCACCCCC TATCTATGTC AACCTTTCAA GCTCGGTGCG
GATATTGTCG TGCATTCCCT GACGAAATAT ATCGGTGGAC ATGGTACAAC AGTCGGAGGA
ATGATTATCG ATTCCGGACA ATTCGACTGG GCAAAGCATA AGGACCGCTT TCCCTTGTTG
AACGAGCCTG ATCCTTCATA TCATGACGTT GTTTATACAG AAGCATTTGA AGCCGCGGCC
TTTATCGGAC GCTGCCGGGT TGTGCCATTG CGCAACACGG GGGCAGCGCT CGCGCCCCAC
AGCGCATTTT TGATACTGCA GGGATTGGAA ACGCTGGGCT TGAGAATGGA GCGGCACTGC
GAGAATGCAT TGAAGGTGGC GCAGTATCTG GAAGCGCATC CCGTCGTCGA GCGCGTCAAT
TACGCGGGTT TGCCCAGCAG TAAATATCAT GAACTGTGCA ACAGGATCAG CAAAGGGAAA
GCATCCGGCC TGTTGAGCTT TGAGATCAAG GGTGGAGCGG TGGCCGGCAG CGAGTTTATC
GATGCCCTGA AGATGATCCT GCGGCTGGTC AATATCGGCG ATGCCAAATC GCTAGCCTGT
CATCCTGCAT CAACCACTCA CAGACAGTTG AGCCCTGATG AATTAAAGTC TGCGGGAGTT
TCACCGGGTC TCGTGAGGAT ATCGGTCGGA ATTGAGCATA TTGACGACAT CATCGGCGAT
ATCGCCCAGG CGCTTGATGC AATCACCTGA
 
Protein sequence
MEENGMKLES LALHYGYKSE ASTKAAAVPI YQTTSYTFDN TQHGADLFDL KVAGNIYTRI 
MNPTNAVLEQ RLAEMEGGVG GLAVASGMAA ITYAIQCIAN AGDNIVSTSQ LYGGTYNLFA
HTFPRQGIEV RMASHEDFDG IEQRIDAKTR AIFCESIGNP SGNVIDIEKI ADIAHRHGVP
LIVDSTVSTP YLCQPFKLGA DIVVHSLTKY IGGHGTTVGG MIIDSGQFDW AKHKDRFPLL
NEPDPSYHDV VYTEAFEAAA FIGRCRVVPL RNTGAALAPH SAFLILQGLE TLGLRMERHC
ENALKVAQYL EAHPVVERVN YAGLPSSKYH ELCNRISKGK ASGLLSFEIK GGAVAGSEFI
DALKMILRLV NIGDAKSLAC HPASTTHRQL SPDELKSAGV SPGLVRISVG IEHIDDIIGD
IAQALDAIT