Gene Ndas_3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3332 
Symbol 
ID9247194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3982521 
End bp3985016 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content71% 
IMG OID 
Productleucyl-tRNA synthetase 
Protein accessionYP_003681244 
Protein GI297562270 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCGG CAGACGGCGA CCAGACGACG GGCGACACCT ACGACGCCCG TGCCCTCCAG 
GACAAGTGGC AGGCGCGCTG GGCCGCTGAG CTTCCGTTCC AGGCCGACGA GGACCCCGGG
GACACCCGCC CGCGCTCCTA CATCGTCGAC ATGTTCGCCT ACCCCTCCGG CGACCTGCAC
ATGGGCCACG CGGAGGCCTA CGCCATCGGC GACGTCATCA GCCGCTACCG CCTCCAGCGC
GGCGACAACG TCCTGCACCC CGTCGGCTGG GACTCCTTCG GCCTGCCCGC CGAGAACGCG
GCCATCAAGA ACAACTCCCA CCCCGCCGAG TGGACCTACG CCAACATCGA GACCCAGGCG
GCCTCGTTCC GGCGCTACGG CATCAGCGTG GACTGGTCGC GACGCCTGCA CACCAGCGAC
CCCGAGTACT ACCGCTGGAA CCAGTGGCTG TTCACCCGCT TCTTCGAGCG CGGACTGGCC
TACCGCAAGG ACGGCCACGT CAACTGGTGC CCCCAGGACC AGACCGTGCT GGCCAACGAG
CAGGTCGTAC AGGGCAGGTG CGAGCGCTGC GGAGCCGACG TCGTGCGGCG CAGCCTCAAC
CAGTGGTACT TCCGCATCAC CGACTACGCC CAGCGCCTGC TGGACGACAT GGACCAGCTC
GACGACGGCC GCTGGCCGGA CGAGATCCTG GCCATGCAGC GCAACTGGAT CGGCCGCTCC
ACCGGAGCCG ACGTCCACTT CGCCATCGAG GGCCGCGACG AGCCCGTCAC CGTCTTCACC
ACCCGGCCCG ACACCCTCTA CGGGGCCACC TTCTTCGTCG TGGCCGCCGA CGCCGACCTG
GCCGACGAGC TGTGCGCGCC CGAACAGCGC GAGGCCTTCG ACGCCTACCG CGCCGACGTC
GCCAAGCTCT CCGACATCGA GCGCCAGGCC ACCGAGCGCC CCAAGACCGG CGTGTTCCTG
GGCCGCTACG CGATCAACCC CGTCAACGGC GAGCGCATGC CCGTGTGGGC GGCCGACTAC
GTCCTGGCCG ACTACGGCCA CGGCGCCATC ATGGCCGTGC CCGCGCACGA CCAGCGCGAC
CTGGACTTCG CCCTGGCCTT CGACCTCCCG GTCCGGGTCG TGGTGGAAAC CAGCGAACCC
GACCCCGTCG AGACGGGCAC GGCCACCGCG GGGGAGGGCG TCCTGCGCGA CTCCGGCCCG
CTGGACGGCC TGGCCAAGAC CGAGGCCATC GAGCGCATCA TCGAGATCCT GGCCGAGCGC
GGCACCGGAC AGGGCACGAT CAACTACCGC CTGCGCGACT GGCTGCTGTC CCGCCAGCGC
TACTGGGGCA CCCCCATCCC GATCATCCAC TGCCCCTCCT GCGGCGAGGT GCCCGTCCCC
GACGAGCAGC TGCCAGTGAC CCTGCCCGAA CTCAAGGGCG CCGAGCTGGC CCCCAAGGGC
GTCTCGCCCC TGGCCGCGGC CACCGAATGG GTCAACGTCG ACTGCCCCTC CTGCGGCGGC
CCCGCCGAGC GCGACACCGA CACCATGGAC ACCTTCGTGG ACTCGTCCTG GTACTTCCTG
CGCTACTGCT CGCCGCGCCT GGACACCGCC CCCTTCGACA CCGAGGCGGT GGAGAAGTGG
GGACCGGTGG ACCACTACAT CGGCGGCAAG GAGCACGCCA CGCTCCACCT GATGTACGCC
CGGTTCTTCA CCAAGGTCCT GTACGACATG GGCATGGTGT CCTTCACCGA GCCCTTCCGC
CGCCTGACCA ACCAGGGCCA GGTCATCAAC CAGGGCCGCG CCATGTCCAA GTCCCTGGGC
AACGGCGTGG ACCTGGGCAA GGAGATCGAC GCCTACGGCG TGGACGCGGT CCGGCTGACC
ATGCTCTTCG CCTCCCCGCC CGAGGAGGAC GTGGACTGGG CCGACGTCTC CGTCGTCGCC
GCGCAGAAGT TCCTCAACCG CGCCTACCGG GTGATGAGCG AGGCGGGCGC GGCCAGCGCG
CCGGGAACCG ACCCCGCCAA GGGCGACACC GACCTGCGCC GGACCACGCA CCGCACCGTC
GACCAGATCA CCGCCCTGGT CGAGTCCCGC CGGTTCAACG TGGCCATCGC CCGCACCATG
GAACTGGTCT CGGCCGTCCG CAAGGCCATC GACTCCGGCC CCGGCGCCGC CGACCCGGCC
GTCCGCGAGG CCGCCGAGGC CGTGGCCGTC TCCCTGTCCC TGTTCGCGCC CTACGTGGCC
GAGGAGGGCT GGGAGAAGCT CGGCCGCAGC GGCAGCGTCG CCGTCGGCAA CTGGCCCGAG
GTCGACCCGG CCCTGCTGGT GCAGGAGTCC GTGACCTGCG TGGTGCAGGT GCAGAGCAAG
GTCCGTGACA AGCTGTCCGT GGCGCCCGAC ATCGACCCGG CGGAGCTGGA GCGGCTGGCC
CTGGCCTCGG AGAAGGCACA GGGCTTCATC GGCGACAAGC AGATCCGCAA GGTGGTCGTG
CGCGCGCCCA AGCTGGTGAA CATCGTCGTC GGCTGA
 
Protein sequence
MTAADGDQTT GDTYDARALQ DKWQARWAAE LPFQADEDPG DTRPRSYIVD MFAYPSGDLH 
MGHAEAYAIG DVISRYRLQR GDNVLHPVGW DSFGLPAENA AIKNNSHPAE WTYANIETQA
ASFRRYGISV DWSRRLHTSD PEYYRWNQWL FTRFFERGLA YRKDGHVNWC PQDQTVLANE
QVVQGRCERC GADVVRRSLN QWYFRITDYA QRLLDDMDQL DDGRWPDEIL AMQRNWIGRS
TGADVHFAIE GRDEPVTVFT TRPDTLYGAT FFVVAADADL ADELCAPEQR EAFDAYRADV
AKLSDIERQA TERPKTGVFL GRYAINPVNG ERMPVWAADY VLADYGHGAI MAVPAHDQRD
LDFALAFDLP VRVVVETSEP DPVETGTATA GEGVLRDSGP LDGLAKTEAI ERIIEILAER
GTGQGTINYR LRDWLLSRQR YWGTPIPIIH CPSCGEVPVP DEQLPVTLPE LKGAELAPKG
VSPLAAATEW VNVDCPSCGG PAERDTDTMD TFVDSSWYFL RYCSPRLDTA PFDTEAVEKW
GPVDHYIGGK EHATLHLMYA RFFTKVLYDM GMVSFTEPFR RLTNQGQVIN QGRAMSKSLG
NGVDLGKEID AYGVDAVRLT MLFASPPEED VDWADVSVVA AQKFLNRAYR VMSEAGAASA
PGTDPAKGDT DLRRTTHRTV DQITALVESR RFNVAIARTM ELVSAVRKAI DSGPGAADPA
VREAAEAVAV SLSLFAPYVA EEGWEKLGRS GSVAVGNWPE VDPALLVQES VTCVVQVQSK
VRDKLSVAPD IDPAELERLA LASEKAQGFI GDKQIRKVVV RAPKLVNIVV G