Gene Ndas_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3040 
Symbol 
ID9246896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3635421 
End bp3636719 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content70% 
IMG OID 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_003680956 
Protein GI297561982 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.485663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACA TCATCGACGA ACTCCAGTGG CGCGGCCTGC TCGCGCAGAC GACCGACCTT 
GACGCACTGC GCAAGGCGCT CGCCGATGGT CCGATCACCC TGTATTGCGG ATTCGACCCG
ACGGCGGGCA GCCTGCACGT GGGCCACCTC ACCCAGATCC TGACCCTGGC CCGTTTCCAG
CAGGCAGGCC ACCGCCCGAT CGCGCTGGTC GGCGGCGGTA CCGGTCTCAT CGGCGACCCC
AAGCCCAACG CCGAGCGCCA GCTCAACTCG CTGGAGACCG TGCGGGGCTG GGTCGACAAC
CTGGGCGGGC AGCTGTCCGC GTTCCTGCGC TTCACCCCCG AGGGGGAGCA GCCGGAGCCC
ACCGACGCCG TCCTGGCCAA CAACGCCGAC TGGCTCGGCG AGATCAACGC CATCGAGCTG
CTGCGCGACG TCGGCAAGCA CTTCAGCGTC AACCAGATGC TCGCCCGGGA GACGGTGAGG
AGCCGGCTCG ACGGTGAGGG CATGAGCTAC ACCGAGTTCA GCTACGTGCT CCTGCAGTCC
TACGACTACG TCCAGCTCTA CCGCCGCTTC GGCTGCACGC TGCAGACCGG CGGCTCCGAC
CAGTGGGGCA ACATCACCGC GGGCCTGGAC CTGGTCCGCA GGATGGACGG AAACGAGCCG
CACGGCCAGG CGCACGGGCT GACCACGAAC CTGCTCACCA AGGCCGACGG CACCAAGTTC
GGCAAGACGG AGTCCGGCAG CGTCTGGCTC GACCCCGAGC TGACCTCGCC GTACGCCTTC
TACCAGTTCT GGTTCAACGC CGACGACCGA GACGTGGTCC GCTACCTGAA GACCTTCAGC
TTCCGCGACC GCGACGAGAT CGAGGAACTG GAGCGCCAGA CCGAGGAGCG TCCGCAGGCC
CGCGCCGCCC AGCGCGTCCT GGCCGAGGAC CTCACCGTCC TCGTGCACGG CGAGGAGGAG
TGCCGCAAGG TCAAGGAGGC CAGCCTGGCC CTGTTCGGCC GCGCGGACCT GTCCGACCTG
GACGCCCGCA CGCTGGAGGC CGCCCTGGCC GAGGTTCCCC GGGCCGAGGT CGACGGTCCC
GTCGACGGGC TGCTGGTGAC CGACGCGTTC GCGCGGAGCG GCCTGACGCC GAGCAAGTCC
GCGGCCCGCC GCGCCATCCA GGAGGGCGGG GCCTACGTGA ACAACGTCAA GGTCACCGAC
GTGGAGGCCG TGCTCCGCCC CGAGGACGTG CTCCAGGGAC GCTTCGTCGT GCTGCGCAAG
GGCAAACGCA ACATCGGCGG CTTGGTCCTC AACGGCTGA
 
Protein sequence
MTDIIDELQW RGLLAQTTDL DALRKALADG PITLYCGFDP TAGSLHVGHL TQILTLARFQ 
QAGHRPIALV GGGTGLIGDP KPNAERQLNS LETVRGWVDN LGGQLSAFLR FTPEGEQPEP
TDAVLANNAD WLGEINAIEL LRDVGKHFSV NQMLARETVR SRLDGEGMSY TEFSYVLLQS
YDYVQLYRRF GCTLQTGGSD QWGNITAGLD LVRRMDGNEP HGQAHGLTTN LLTKADGTKF
GKTESGSVWL DPELTSPYAF YQFWFNADDR DVVRYLKTFS FRDRDEIEEL ERQTEERPQA
RAAQRVLAED LTVLVHGEEE CRKVKEASLA LFGRADLSDL DARTLEAALA EVPRAEVDGP
VDGLLVTDAF ARSGLTPSKS AARRAIQEGG AYVNNVKVTD VEAVLRPEDV LQGRFVVLRK
GKRNIGGLVL NG