Gene Namu_4433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4433 
Symbol 
ID8450060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4918449 
End bp4919762 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content67% 
IMG OID645043480 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_003203708 
Protein GI258654552 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.367833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.781269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCCC AGTGGTCCTT TGAGACCCGC CAGATCCATG CCGGCCAGAC CCCCGACCCG 
ACCACCAAGG CCCGGGCGCT GCCGATCTAC CAGACGACGT CCTACGCGTT CGATTCGTCC
GAACACGGCC GCAAGCTGTT CGCCCTCGAA GAGCTGGGCA ACATCTACAC GCGGATCATG
AACCCCACCC AGGCCGTGGT GGAGGACCGG ATCAACTCCC TCGAGGGCGG CGTCGGTGCG
CTGCTGGTGG CCTCCGGGCA GTCGGCCGAG ACGTTGGCCA TCCTGACCCT GGCCGAGGCC
GGGGACCAGA TCGTCTCCTC GCCGCGCCTG TACGGCGGAA CCTACAACCT GTTCCACTAC
ACGCTGCCCA AGATGGGCAT CACCGTCGAC TTCGTCGAGA ACCCCGACGA TCCGGAGTCG
TGGCGGGCCG CGGCCAAGCC GAACACGAAG GCGTTCTACG GCGAGTCGAT CTCCAACCCG
GCGCTGGACG TCCTGGACTT CGCCGCCGTG TCCGCGGTTG CGCACGAGGT CGGGGTGCCG
CTGATCGTCG ACAACACCGT GCCCAGCCCG TATCTCATCC GGCCGATCGA GCACGGCGCG
GACATCGTGG TGCATTCGGC GACCAAGTAT CTGGGCGGTC ACGGCACCGC GATCGGTGGC
GTCATCGTCG ACTCGGGCAA CTTCGACTGG GTCGCCAACG CCGAGCGCTT CCCGAACTTC
AACACCCCCG ACCCCAGCTA CAACAACCTC ACCTGGGGGG TCGACCTGGG ACCGGAGGGA
CTGTTCAAGT CCAACGTCGC CTTCATCTTC AAGGCCCGGC TGCAGGGGCT GCGCGACATC
GGCCCGGCGA TCAGCCCGTT CAATGCCTTC CTGATCTCCC AGGGTGTGGA GACCCTTTCG
CTGCGGGTGC AGCGGCACAA CGACAATGCG GCCCGGGTCG CCGAATTCCT GTCCGGCCGG
GACGAGGTCG AATCGGTCTC CTACCCCGGT CTGGCGTCCA GCCCCTGGCA CCACCTGCAG
CAGAAGTACG CGCCGCTGGG CGGTGGCCCG ATCGTCACCT TCGAGATCAA GGGCGGGGTC
GAGGCGGGAC AGACGTTCAC CGACGCGCTG GAGCTGTTCA CCAACCTGGC CAACATCGGT
GACGTGCGCT CGCTGGTGAT CCACCCGGCG TCGACCACGC ACGCGCAGCT GGCGCCGGCC
GAGCAGCTGA CCACCGGCGT CACCCCAGGC CTGATCCGGT TGGCCGTCGG TATCGAGCAC
ATCGACGACA TCCTGGCCGA CCTGGAGGCC GGCTTCCGGG CCGCCAAGGG GTGA
 
Protein sequence
MSSQWSFETR QIHAGQTPDP TTKARALPIY QTTSYAFDSS EHGRKLFALE ELGNIYTRIM 
NPTQAVVEDR INSLEGGVGA LLVASGQSAE TLAILTLAEA GDQIVSSPRL YGGTYNLFHY
TLPKMGITVD FVENPDDPES WRAAAKPNTK AFYGESISNP ALDVLDFAAV SAVAHEVGVP
LIVDNTVPSP YLIRPIEHGA DIVVHSATKY LGGHGTAIGG VIVDSGNFDW VANAERFPNF
NTPDPSYNNL TWGVDLGPEG LFKSNVAFIF KARLQGLRDI GPAISPFNAF LISQGVETLS
LRVQRHNDNA ARVAEFLSGR DEVESVSYPG LASSPWHHLQ QKYAPLGGGP IVTFEIKGGV
EAGQTFTDAL ELFTNLANIG DVRSLVIHPA STTHAQLAPA EQLTTGVTPG LIRLAVGIEH
IDDILADLEA GFRAAKG