Gene Ndas_3598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3598 
Symbol 
ID9247467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4310862 
End bp4312610 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content72% 
IMG OID 
Productprolyl-tRNA synthetase 
Protein accessionYP_003681504 
Protein GI297562530 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.393405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTACTGC GGATGTCGAC CCTGTTCCTG CGCACCCTGC GTGAGGACCC GGCGGACGCC 
GAGGTGCCGA GCCACAAGCT GCTGGTCCGG GGCGGGTTCG TGCGCCGGGC CGCACCCGGC
GTCTACACCT GGCTGCCCCT GGGCAAGATC GTCCTGGAGA ACGTCGCCCG GATCGTGCGC
GAGGAGATGG ACGCCATCGG CGCCCAGGAG GTGCTCCTGC CCGCGCTGCT GCCCCGCGAG
TACTACGAGG CCACCGGGCG CTGGGAGGAG TACGGCGACA CCCTGTTCCG CCTCAAGGAC
CGCAAGGGCG CCGACTACCT GCTCGGCCCC ACCCACGAGG AGCTGTTCAC ACTCCTGGTC
AAGGGGGAGT ACTCCTCCTA CAAGGACTTC CCGGTCACGC TGTACCAGAT CCAGGAGAAG
TTCCGCGACG AGGCGCGTCC CCGCGCGGGC GTGCTGCGCG GCCGCGAGTT CCACATGAAG
GACTCCTACT CCTTCGACAT CGACGACGAG GGCCTGCGCG CGTCCTACGC CGACCACCGC
GCCGCCTACA TCCGCGTCTT CGACCGGCTG GGCCTGGAGT ACGTGATCGT GTCGGCCACG
TCGGGCGCCA TGGGCGGATC GGCCTCGGAG GAGTTCCTGG CCGTCGCCCC GACCGGCGAG
GACACCTTCG TGCGCAGCAC GGAGTCCGAC TACGCCGCCA ACGTCGAGGC CGTGGCCGTC
CCGGCCCCCG AGGCGCTGCC GGTCGAGGGG CTGCCCGAGG CCGCGGTCCA CCACACCCCG
GACACCGCCA CCATCCAGAC CCTGGTGGAC TTCCTCAACG GCGCCGGGCT GGGCCGCGAC
TTCAGCGAGG CCGACACCCT CAAGAACGTC CTGGTCAAGA CCCGCGCGCC CGGTGCGAAG
GAGTGGGAGC TGCTGGCCGT CGGCCTGCCC GGCGACCGTG AGGTGGACTT CAAGCGCCTG
GAGGCGGCGC TGGAGCCCGC CGAGGTCGCC CTGCTGGAGG AGGCCGACTT CGCGGCCAAC
CCCTTCCTGG TCAAGGGCTA CATCGGCCCC CGCGCGCTGC TGGACAACAA GGTCCGCTAC
CTGGTCGACC CCCGGGTGGT CACCGGCACC TCCTGGGTGA CCGGTGCGGA CGAGGCCGAC
CACCACGTCG TCGGACTGGT CGCGGGCCGC GACTTCGTCC CCGACGGCAC CATCGACGTC
GCCGAGGTGC GCGACGGCGA CCCCTCGCCC GACGGCAGGG GGACCCTGTA CACCGCGCGC
GGCATCGAGA TCGGCCACAT CTTCCAGCTG GGCCGCAAGT ACACCGACGC CTTCCAGGTG
GACGCCCTGG GCCCCGACGG CAAGCCCCGG CGGATCACCA TGGGCTCCTA CGGCATCGGT
GTCTCGCGCG CCGTCGCCGC GGTCGTCGAG CAGTCCCACG ACGACAAGGG CGTCGTCTGG
CCGCGCGAGG TGGCGCCCGC CGACGTGCAC GTGGTCGGCA CCGGCAAGGG AGAGCAGATC
GAGGAGGCGC TGCGGATCGC CCGGGAGCTG GAGGCCAGGG GCCTGCGCGT CCTCGTGGAC
GACCGCAAGG GCGTCTCGCC CGGCGTCAAG TTCACCGACG CCGAACTCCT GGGCGTCCCC
ACCGGCGTCA TCGTCGGCCG CGGCCTCAAG GACGGCCTGG TGGAGCTGCG CGACCGCGCC
ACCGGCGACC GCGAGGAGGT CGCCCTGGCC GAGATCGTGG ACCGCGCCGT CGCCGCCTGC
CGCGCGTAG
 
Protein sequence
MLLRMSTLFL RTLREDPADA EVPSHKLLVR GGFVRRAAPG VYTWLPLGKI VLENVARIVR 
EEMDAIGAQE VLLPALLPRE YYEATGRWEE YGDTLFRLKD RKGADYLLGP THEELFTLLV
KGEYSSYKDF PVTLYQIQEK FRDEARPRAG VLRGREFHMK DSYSFDIDDE GLRASYADHR
AAYIRVFDRL GLEYVIVSAT SGAMGGSASE EFLAVAPTGE DTFVRSTESD YAANVEAVAV
PAPEALPVEG LPEAAVHHTP DTATIQTLVD FLNGAGLGRD FSEADTLKNV LVKTRAPGAK
EWELLAVGLP GDREVDFKRL EAALEPAEVA LLEEADFAAN PFLVKGYIGP RALLDNKVRY
LVDPRVVTGT SWVTGADEAD HHVVGLVAGR DFVPDGTIDV AEVRDGDPSP DGRGTLYTAR
GIEIGHIFQL GRKYTDAFQV DALGPDGKPR RITMGSYGIG VSRAVAAVVE QSHDDKGVVW
PREVAPADVH VVGTGKGEQI EEALRIAREL EARGLRVLVD DRKGVSPGVK FTDAELLGVP
TGVIVGRGLK DGLVELRDRA TGDREEVALA EIVDRAVAAC RA