Gene Ndas_4156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4156 
Symbol 
ID9248030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4962163 
End bp4963908 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content71% 
IMG OID 
Productarginyl-tRNA synthetase 
Protein accessionYP_003682057 
Protein GI297563083 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.204707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACC CGCAGGAAGT ACTCTCGCGA CGGGTCCAGT CCGCCCTCGG CGCCGCCTTC 
GGCCCCGAGT TCGCCGACAC CGACCCCGTC ATCCGCCCGT CCCAGTTCGC CGACTACCAG
GCGAACGCCG CGCTCGCGCT CGCCAAGCGA CTGGGCCGAA AGCCGCGTGA GGTGTCCGCG
GCGATCATGG AGCACCTGGA CGTGGCCGAC GTCTGCCGCG ACGTCGAGGT CAGCGGGCCG
GGCTTCATCA ACCTGACGCT GCGCGAGGAC TGGATCGCCG GGCAGACCCG CCGGCTCCTG
GACGACCCGC GGCTGGGCGT CCCCGAGCAG GAGCGCCAGA ACATCCCGCT GGACTACTCC
GCGCCCAACG TGGCCAAGGA GATGCACGTC GGGCACCTGC GCACCACCGT GGTCGGCGAC
GCGCTCGCGC GCACCCTGGA GTTCCTGGGC CACAACGTCA TCCGCCAGAA CCACATCGGC
GACTGGGGCA CCCCCTTCGG CATGCTCATC GAGCACCTGC TGGAGGTGGG CGAGGACTCC
CCCGAGGCGG ACCTGGTCAC CACCGACCCC AACGCCTTCT ACCAGGCGGC CCGGACCAAG
TTCGACTCGT CGGGCCCCGG CCCCGACGAC TTCGCCGCGC GCGCCCGCCG CCGCGTGGTC
TCCCTCCAGA GCGGCGACGA GGAGACGCTG CGCCTGTGGC GCAAGCTCGT CGGCCTCTCC
AAGGTCTACT TCAACAAGGT GTACGCCAAG CTCGGCGTGA CCCTCACCGA CGAGGACCTG
GCCGGGGAGA GCAGCTACAA CGCCGCGCTC GCCGACGTGT GCGAGGAGCT GGAGGCCAAG
GGCATCGCCG AGATCAGCGA CGGTGCCCTG TGCGTGTTCC TGGAGGGCTT CACCGGCCGC
GAGGACAAGC CCGTCCCGCT GATCGTGCGC AAGAGCGACG GCGGCTACGG CTACGCCACC
ACCGACCTGG CCACCGTCCG CCACCGCGTG GACGACCTCA AGGCCGACCG CATCCTGTAC
GTGGTCGGCG CACCGCAGGC CATGCACTTC AAGATGGTGT GGGCCACCGC CCGCGAGGCC
GGATGGCTGC CGGAGGGCGT GGAGACCGTC CACGTGCAGA TCGGCAACGT GCTGGGCAGC
GACGGCAAGA TCCTGCGCAC CCGCAGCGGC GCCCCGGTGC GCCTCATGGC GCTGCTGGAC
GAGGCGATCG AGCGCGCCGC CGCGGTGGTC GCGCAGAACC GCCCCGATCT GGACGCCGAG
GGCCAGGCCG AGATCGCCCG CCAGGTGGGC ATCGGCGCGG TCAAGTACGC CGACCTGTCG
GTCGCGCACG ACACCGAGTA CACCTTCGAC TTCGACCGGA TGCTGGCCCT GACCGGCAAC
ACCGGCCCGT ACCTGCAGTA CGCCCAGGCC CGGATCCGGT CCATCTTCCG CAAGGGCGGC
CTGGAGCTGT CGCAGGCCAG GGGGGACATC AGCGTCTCCG AGGGCGCCGA GCGCGACCTG
GCGCTCAAGC TGCTGGAGTT CGGCGCGGTG GTGGCGCAGG TCGGCGACCT GCTCCAGCCG
CACCGGCTGT CCACGTACCT GTTCGAGCTG GCGCAGTCCC TGACGGCCTT CTACGAGCAC
TGCCCGGTGC TGACCGCTCC GAACGAGGCC GAGCGCGAGT CGCGCCTGGC GCTGGTCGCG
GTGGCGTTCC GGGTCCTGGT CCAGGGTCTG GACCTGCTGG GCGTGGAGGC TCCCGAGCAC
ATGTAG
 
Protein sequence
MADPQEVLSR RVQSALGAAF GPEFADTDPV IRPSQFADYQ ANAALALAKR LGRKPREVSA 
AIMEHLDVAD VCRDVEVSGP GFINLTLRED WIAGQTRRLL DDPRLGVPEQ ERQNIPLDYS
APNVAKEMHV GHLRTTVVGD ALARTLEFLG HNVIRQNHIG DWGTPFGMLI EHLLEVGEDS
PEADLVTTDP NAFYQAARTK FDSSGPGPDD FAARARRRVV SLQSGDEETL RLWRKLVGLS
KVYFNKVYAK LGVTLTDEDL AGESSYNAAL ADVCEELEAK GIAEISDGAL CVFLEGFTGR
EDKPVPLIVR KSDGGYGYAT TDLATVRHRV DDLKADRILY VVGAPQAMHF KMVWATAREA
GWLPEGVETV HVQIGNVLGS DGKILRTRSG APVRLMALLD EAIERAAAVV AQNRPDLDAE
GQAEIARQVG IGAVKYADLS VAHDTEYTFD FDRMLALTGN TGPYLQYAQA RIRSIFRKGG
LELSQARGDI SVSEGAERDL ALKLLEFGAV VAQVGDLLQP HRLSTYLFEL AQSLTAFYEH
CPVLTAPNEA ERESRLALVA VAFRVLVQGL DLLGVEAPEH M