Gene Nmul_A0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0221 
SymbolmetX 
ID3784598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp235602 
End bp236735 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content54% 
IMG OID637810293 
Producthomoserine O-acetyltransferase 
Protein accessionYP_410921 
Protein GI82701355 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCATGC AAGATTCGAA TTCATTCAGT ACTGTTACTC CACAGGTGGC GCGTTTTGAC 
ACGCCTTTGC ATTTGAAAAG CGGCGCGGTG CTCGACAGCT ACGAGCTGGT GTACGAAACA
TACGGGGAGC TCAACGCGGC GCGGTCGAAC GCCGTACTGG TATGCCACGC CCTTTCAGGG
AATCATCATC TCGCAGGCCT TTACGACGAT AACCCCAAGA GTGCCGGCTG GTGGAACAAC
ATGATCGGGC CGGGCAAATC GATCGATACC CAGAAATTTT TCTTAATCGG GGTAAACAAC
CTGGGCGGTT GTCATGGATC CACCGGACCG GCGAGTATTG ATGTCAGGAC TGGAAAGTGT
TACGGCCCGA ATTTTCCGGT TGTGACAGTG GAAGACTGGG TTCAGACACA AGTCCGCCTT
GCCGATTATC TGGGTATCGA TCAGTTTGCC GCCGTGGCTG GCGGTAGTCT TGGCGGAATG
CAGGCTTTGC AGTGGACACT TGATTTTCCC GAGAGGGTGC GCCACGCGCT GGTTATCGCC
GCGGCTGCAA AGTTGACTGC GCAGAACATC GCATTCAACG ATGTGGCACG CCAAGCTATC
ATTACCGATC CTGATTTCCA TGGCGGCGAC TATTATTCAC ACGGTGTCAT TCCGCGGAGA
GGATTACGCC TGGCGCGCAT GCTGGGACAT ATCACCTACC TCTCGGACGA CTCGATGGCG
GCTAAATTCG GCCGGGAACT GCGGAATGGA GCGCTTGCCT TCGGTTACGA CGTGGAGTTC
GAAATAGAAT CGTATCTTCG TTATCAGGGA GATAAATTCG CTAGCCAGTT CGATGCGAAC
ACGTATCTGC TGATGACAAA GGCATTGGAC TATTTTGATC CTGCCTTTCC GCACAACAAC
GACCTCAGCG CCGCATTCCG ATTCGCCAGG GCTAATTTCC TGGTGCTGTC GTTTACTACC
GACTGGCGTT TTTCCCCGGA GCGCTCGCGC GCCATCGTAA GGGCGCTGCT GGACAACGAA
CTGAACGTCA GTTATGCCGA AATTACATCC AGTCATGGCC ACGACTCGTT CCTCATGGAG
GATCGGCATT ATCACAGGCT GGTGCGGGCT TACATGGATA ACGTGGTCGT ATGA
 
Protein sequence
MLMQDSNSFS TVTPQVARFD TPLHLKSGAV LDSYELVYET YGELNAARSN AVLVCHALSG 
NHHLAGLYDD NPKSAGWWNN MIGPGKSIDT QKFFLIGVNN LGGCHGSTGP ASIDVRTGKC
YGPNFPVVTV EDWVQTQVRL ADYLGIDQFA AVAGGSLGGM QALQWTLDFP ERVRHALVIA
AAAKLTAQNI AFNDVARQAI ITDPDFHGGD YYSHGVIPRR GLRLARMLGH ITYLSDDSMA
AKFGRELRNG ALAFGYDVEF EIESYLRYQG DKFASQFDAN TYLLMTKALD YFDPAFPHNN
DLSAAFRFAR ANFLVLSFTT DWRFSPERSR AIVRALLDNE LNVSYAEITS SHGHDSFLME
DRHYHRLVRA YMDNVVV