Gene Namu_2200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2200 
Symbol 
ID8447811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2429328 
End bp2431085 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content71% 
IMG OID645041322 
Productprolyl-tRNA synthetase 
Protein accessionYP_003201566 
Protein GI258652410 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000481191 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000820909 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGATCACAC GGCTGTCCAC CCTGTTCCTG CGCACCCTGC GGGAGGACCC GGCGGATGCG 
GAGGTGCCCA GCCATCGGTT GCTGGTGCGC GCCGGTTACA TCCGGCGGGC GGCGCCGGGC
ATCTATACCT GGTTGCCGCT GGGCTACCGG GTGCTGCGCA ACGTGGAGCG CATCGTCCGC
GAGGAGATGG ACGCGATCGG CGCGCAGGAG GTGCACTTCC CGGCCCTGCT GCCGCGGGAG
CCCTACGAGG CGACCGGCCG CTGGACCGAG TACGGCGACA ACCTGTTCCG GCTCCAGGAC
CGCAAGCAGG GCGACTACCT GCTCGGGCCC ACCCACGAGG AGATGTTCAC CCTCCTGGTC
AAGGACATGT ACTCGTCCTA CAAGGGTCTG CCGCTCTCGC TCTACCAGAT CCAGACCAAG
TACCGGGACG AGGCCCGGCC CCGGGCCGGC ATCCTGCGCG GTCGCGAGTT CGTGATGAAG
GACTCCTATT CCTTCGACCT GGACGACGCC GGGCTGGCGG CCTCCTACCA GCGGCATCGG
GACGCCTACA TCCGGATCTT CGACCGGCTC GGCCTGCGGT ACGTCATCGT CGCCGCCATG
TCTGGCGCGA TGGGCGGGTC GGCCTCCGAG GAGTTCCTGG CCGACTGCGT CAACGGCGAG
GACACCTACG TCCGCTCACC GGCCGGCTAC GCGGCCAACG TGGAGGCGGT CACCACCCCG
GTGCCGGCGG CCATCCCGCT GACGGACCAG CCGGCCGCCC ACGTCGAGGA CACCCCCGAC
ACACCGACCA TCGACACCCT GGTCGCGCAC AGCAACGCCG CCCACCCCCG GGCCGACCGG
CCGTGGACGG GTGCGGACAC CCTCAAGAAC GTGCTGGTGA TGCTGCGGAA CCCGGACGGA
TCGCGCGAGC CGCTGGCCAT CGGGTTGCCC GGCGACCGGG AGGTCGACCT CAAGCGGCTC
GAGGCGCAGG TCGCGCCGGC CGAGGTCGAG CCGTTCGACG AGGTCGAGTT CGCCAAGCAC
CCGTCGCTGG TCAAGGGCTA CATCGGGCCG GGCGTGCTCG GCTCGACCGG CACCTCTGGC
ATCCGCTACC TACTCGACCC GCGGGTGGTG CCCGGAACCG CGTGGATCAC CGGCGCGAAC
GAGCCGGGCC GGCACGTCTA CGACCTGGTC GCGGGCCGCG ACTTCACCGC CGACGGCACC
GTCGAGGCCG CCGAGGTGCG CGAGGGTGAC CAGTCGCCCG ACGGATCCGG ACCGCTGACC
CTGGCCCGGG GCATCGAGAT GGGCCACATC TTCCAGTTGG GCCGCAAGTA CGCGCAGGCG
CTGGGCCTGC AGGTGCTGGA CGAGAACGGC AAGCTGGTCA CCGTCACCAT GGGCTCCTAC
GGCATCGGCG TGTCCCGGGC CGTCGCCGCC ATCGCCGAGT CGTCCTACGA CGACAAGGGG
CTGATCTGGC CGCGGGAGGT CGCGCCGGCC GACGTGCACG TGGTGATCGC CGGCAAGTCG
GCCGAGATCG TGGACACCGC GGAGCTCATC GCGGCCGGCC TGGACGCCGC CGGGATCACG
GTCATGCTGG ACGACCGGAA TGCCTCGGTC GGGGTCAAGT TCGCCGATGC CGAGCTGATC
GGGGTGCCGA CCATCTGTGT GGTCGGCCGG GGCGTGGCCA ACGGAGTCGT CGAGGTGCGC
GACCGGGCGA GCGGGGACAA GACCGAGGTG CCGCTGGTCG AGGTCGTCGA GCAGTTGCGA
GCCCGCGTCC GCGGCTGA
 
Protein sequence
MITRLSTLFL RTLREDPADA EVPSHRLLVR AGYIRRAAPG IYTWLPLGYR VLRNVERIVR 
EEMDAIGAQE VHFPALLPRE PYEATGRWTE YGDNLFRLQD RKQGDYLLGP THEEMFTLLV
KDMYSSYKGL PLSLYQIQTK YRDEARPRAG ILRGREFVMK DSYSFDLDDA GLAASYQRHR
DAYIRIFDRL GLRYVIVAAM SGAMGGSASE EFLADCVNGE DTYVRSPAGY AANVEAVTTP
VPAAIPLTDQ PAAHVEDTPD TPTIDTLVAH SNAAHPRADR PWTGADTLKN VLVMLRNPDG
SREPLAIGLP GDREVDLKRL EAQVAPAEVE PFDEVEFAKH PSLVKGYIGP GVLGSTGTSG
IRYLLDPRVV PGTAWITGAN EPGRHVYDLV AGRDFTADGT VEAAEVREGD QSPDGSGPLT
LARGIEMGHI FQLGRKYAQA LGLQVLDENG KLVTVTMGSY GIGVSRAVAA IAESSYDDKG
LIWPREVAPA DVHVVIAGKS AEIVDTAELI AAGLDAAGIT VMLDDRNASV GVKFADAELI
GVPTICVVGR GVANGVVEVR DRASGDKTEV PLVEVVEQLR ARVRG