Gene Namu_1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1781 
Symbol 
ID8447384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1956442 
End bp1957497 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content70% 
IMG OID645040908 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_003201160 
Protein GI258652004 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.000215592 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.125783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCG TCCTCACCCC CACCGACGTC CACCGCGCAC CCGAACCGCT TGAGCAACGA 
ATCCGCCGTC ACCCGCACAC GTTTCGCGTG CTCTCGGGCG ATCGACCCAC CGGTGCGCTG
CATCTGGGCC ACTATCTGGG CACTTTGCGC AACCGGGTCC AGCTGCAGAA CCTCGGCGTG
CCCGTCGTCG TCGTCATCGC CGACTATCAG GTCATCACGG ACCGCTCGGA TCTCGGGCCG
GTCCGCGACC GGGTGCGCAC CCTGGTCGCC GAATACCTGG CCGCCGGGCT CGATCCGGCC
CGCAGCGTGA TCTTTCCGCA CTCCGCGGTC GCGGCCCTGA ACCAGCTGAT GCTGCCATTT
CTGTCGCTGG TCACCGACGC CGAACTGCGC CGCAACCCCA CCGTCAAGGC CGAGGCACTG
GCATCCCGGC GGCCGTTGGG CGGGCTGCTG CTGACCTACC CGGTGCATCA GGCGGCCGAC
ATCCTGGGGG TGGGCGGCAC GGTCGTGCCG GTGGGGCGTG ACCAGCTCCC GCACCTGGAG
CTGACCCGGG TCATCGCCCG GCGGTTCAAC GAGCGCTACG GGCCGGTGTT CGCCCTGCCC
GAGCCGTTGC TGAGCGGCAC GCCGAACCTG CTGGGTACCG ACGGCGCGAA GATGTCCAAG
ACCCGCGGCA ACACGATCGC CCTGGGTGAC ACCGCGGACC GGACCGCAGC GATCGTCCGG
GCCGCGCAGA CCGACTCGAC CCGCCGGATC ACCTTCGAAC CGACCAGTCG ACCCCAGGTT
GCCAACCTGC TGGCGATCAT CGGCGAGATC ACCGGTCGCG ACCCGGCAGC GGTCGCCGAC
GAGATCGGGG ACGGGGGAGC GGCCGAGCTC AAACGGCAGG CCATCGAGAC CATCAACGAG
GAGCTGGCAC CGCTGCGGCG CCGCCGCGCC GAGCTGCTCG CCGATCCGGT CCAGTTGGAC
GGGGTGCTGC TCGACGGCAT CGCGGCGGCG ACCGCCGTGG CCGGGGACAC CCTGGCTCGG
GTGCGCAGTG CGATGGGGAT GGACTACCTG CGATGA
 
Protein sequence
MTAVLTPTDV HRAPEPLEQR IRRHPHTFRV LSGDRPTGAL HLGHYLGTLR NRVQLQNLGV 
PVVVVIADYQ VITDRSDLGP VRDRVRTLVA EYLAAGLDPA RSVIFPHSAV AALNQLMLPF
LSLVTDAELR RNPTVKAEAL ASRRPLGGLL LTYPVHQAAD ILGVGGTVVP VGRDQLPHLE
LTRVIARRFN ERYGPVFALP EPLLSGTPNL LGTDGAKMSK TRGNTIALGD TADRTAAIVR
AAQTDSTRRI TFEPTSRPQV ANLLAIIGEI TGRDPAAVAD EIGDGGAAEL KRQAIETINE
ELAPLRRRRA ELLADPVQLD GVLLDGIAAA TAVAGDTLAR VRSAMGMDYL R