Gene Ndas_3608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3608 
Symbol 
ID9247477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4324052 
End bp4325386 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content70% 
IMG OID 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003681514 
Protein GI297562540 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATC AGCGCGTCGT CCGACCCACT CCCATCAGCG GTTTCCCCGA GTGGTCCCCT 
CAGGTCCGGG CCGTGGAGCA GCGCTGGCTG GACCATATCC GCGCCGGGTT CGAGTCCTTC
GGCTTCGCGT CGGTGGAGAC GCCGTCCGTG GAGAACCTCG ACGTGCTGAT GGCCAAGGGC
GAGACCTCCC AGGAGGTCTA CACCCTGCAC CGCCTCCAGG CCGACGCCAA GGACGACTCC
GACGCCAGAC TCGGGCTGCA CTTCGACCTC ACCGTGCCCT TCGCCCGGTA CGTCGCCCAG
CACTTCAACG AGCTGGCCTT CCCCTTCAAG CGCTACCAGA TGCAGCGCGT GTGGCGGGGC
GAGCGCCCCC AGGAGGGGCG CTTCCGGGAG TTCACCCAGT GCGACATCGA CGTCATCAAC
GTCGACTCCG TCCCGCTGCA CTTCGACGCC GAGCTGCCGC GCATCGTGCA CCGGGTGCTC
AGCGGGCTGG ACCTGCCCGC ATGGACGCTC AACGTCAACA ACCGCAAGGT CCTCCAGGGC
TTCTACGAGG GCCTGGGCAT CAAGGACCCG ATCGCGGTCA TCCGCGCCGT GGACAAGCTG
CACAAGATCG GCGACGACGG CGTGCGCCAG ATCCTGCTGG CCGACGGCCT GACCGACGAG
CAGGCCGACG CCTGCCTCGC GCTGGCCCGC GTCCGGGGCG CCGGGACCGG CGTCGCCGAG
CGGGTCCGCG GGCTCGGGGT CTCCTCCGAG CTGCTCGACG AAGGGCTGGA GGAGCTCTCC
TTCGTGCTGG AGGACCTGGC CGACCTGCCC GAGGGCAGCG TCGTGGCCGA CCTGTCCATC
GCCCGCGGCC TGGACTACTA CACGGGCACG GTCTACGAGG CCTCCTTCGA CGACGACCCC
GGCTACGGGA GCATCTGCGC GGGCGGGCGC TACGACGACC TGGCCGGGCA GTTCATCCGC
AGGCGGCTGC CGGGCGTGGG CATCTCCATC GGCCTCACCC GCATCTTCGC CAAGCTCGTC
GCCGAGGGCC GGATCGAGGA GGGGCGCGCC TGCCCCACCG ACGTGATGGT CGTGGTGCCC
GGCGCCGAGC GCCGCAGGGA GGCCCTGCTC ACCGGGGAGC TGCTGCGCGG GCGCGGGTTC
AACACCGAGG TGTTCCACTC CACCGCCAAG GTCGGCAAGC AGATCCAGTA CGCCGCCAAG
AAGGGCATCC CCTTCGTGTG GTTCCCGCCC TTCGGCGAGG ACGGCGGCCA CGAGGTCAAG
GACATGGGGT CGGGGGAGCA GGGTCCGGCC GACCCCGCCA CCTGGGCCCC GAAGGGCACG
GAAAGCGCGG AGTAA
 
Protein sequence
MSDQRVVRPT PISGFPEWSP QVRAVEQRWL DHIRAGFESF GFASVETPSV ENLDVLMAKG 
ETSQEVYTLH RLQADAKDDS DARLGLHFDL TVPFARYVAQ HFNELAFPFK RYQMQRVWRG
ERPQEGRFRE FTQCDIDVIN VDSVPLHFDA ELPRIVHRVL SGLDLPAWTL NVNNRKVLQG
FYEGLGIKDP IAVIRAVDKL HKIGDDGVRQ ILLADGLTDE QADACLALAR VRGAGTGVAE
RVRGLGVSSE LLDEGLEELS FVLEDLADLP EGSVVADLSI ARGLDYYTGT VYEASFDDDP
GYGSICAGGR YDDLAGQFIR RRLPGVGISI GLTRIFAKLV AEGRIEEGRA CPTDVMVVVP
GAERRREALL TGELLRGRGF NTEVFHSTAK VGKQIQYAAK KGIPFVWFPP FGEDGGHEVK
DMGSGEQGPA DPATWAPKGT ESAE