Gene Ndas_4954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4954 
Symbol 
ID9248842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp95798 
End bp97921 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content74% 
IMG OID 
Productthymidylate kinase 
Protein accessionYP_003682842 
Protein GI297563869 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.768884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.910988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAT CTGCACCCCT GGGAGCGCCG GCCGAGGCGC GCAACGTTCT TGCGATCACA 
CCCTTCCGAA GGCTGTGGAT CTCTCTCTCC CTCTCCAGCC TGGGCGACTG GCTCAGCCTG
CTGGCACTGG TGTCCCTGGC GACCGTCTTC ACCGCCGACG GATCCCAGCT CGTCCAGTAC
CTGGCCGTCA GCGGCGTCGT GGCGATCAAG CTCGCCCCGT CGATCCTGCT CAGCCCCCTG
GTCGGGTCGG TCGCCGACCG CCTCGACCGC AGGTGGACGA TGGTGGTCGG CGACGTCCTG
CGCGCCGTGC TGTACGTCTC CATCCCGGTC GTGGGCCTCC TCTTCCCCGG CTTCGCCCTG
GAGTGGCTCC TCATCGCCAC CCTCCTGGCC GAGGTCGTCG CCCTGTTCTG GACCCCGGCC
AAGGACGCCA CGGTCCCCAA CCTGGTGCCC CGCAAGCTGC TGGGGCAGGC GAACCGGCTG
AGCCTGCTCA CCGCCTACGG CACCGCGCCC GTCGCGGCGC TGCTGTTCGC CGCGCTCGCC
TCGGTCAGCA ACGTGCTCGG CGCGTTCCTG CCCTCCATGG CCAGCCCCGA GGCCGACGTC
GCCCTCTACC TCAACGGCCT CACCTTCGTC GTCGCGGCGG TCGTCGTCGC CGGGCTGCCG
ATCCCCAGGC ACAAGCCCTC CAAGGACGCG GAGAGCGACA CCCGCGACAG CGGCATCCTG
CGCGCGCTGG GGACCGGTCG GCGCCACGCG GGCGGCACGC CCCGCGTGCG CGGCCTGGTT
CCGGGCATGC TCTGCGTCGT GGCCGCCGGG GGCGTGGTCA TCGGTGTCGG CCGCGTGCAC
GTCGAGGGCC TGGGCGCGGG CAACGCCGGT TTCGGCGTGG TCTTCGCCGC GGTCTTCGCG
GGCATGGCCC TGGGCGTGCT CGCCGGGCCG CGCGTCCTCA AGCAGTTCAG CCGCAGCCGG
CTGTTCGGTC TGAGCATCGC CCTCGCCGGG CTGGCCCTGC TGTTCGCCGG GGCCGTCGCC
GACATGGTGC TCACCGCCGT GCTCACCGCG CTGCTCGGCG TGGGCGCGGG TATCGCCTGC
GTGATCGGCC TGGCGGTGTT CGACCGCGAG GTGGAGGACG AGCACCGGGG TTCCGCCTTC
GCCTTCCTGC ACGGCGCCGC CCGCGTCACC CTGGTCGGCG CCGCCGTGCT CGCCCCGCTG
GCCGCCGGGC TCATCGGCAG TTACCGGATC CCGGTCGGCC CCCTGAGCTA CGACCTGCGC
GGCAGCGGCC TCGTCCTCAT GCTCTCCGGC CTGGCCGTCC TGGTCGTGGC GCTGGTCTGC
TACCGGCGGA TGAACCGCCG GGACGACCCC GAGGCCGGTC CCGGCCTGCT CCCGGAGCTG
TTCGCGGCGC TGCGCGGCGT CGCGATCGCG CCGGAGGAGG ACGAGGAGGC CAGGCTCGCG
GGCGCGTTCA TCGTCGTCGA GGGCGGCGAG GGCGCGGGCA AGTCCACCCA GGTGCGCGAG
CTGACGGTGT GGCTGCGCGA CCAGGGGTTC GAGGTGATCG GCACCCGCCA GCCGGGCGCG
ACCAAGCTCG GCATGCGCCT GCGCGGCCTG CTCCTGGACC GGGAGAACTC GCACATCACC
CCGCGCGCCG AGGTGCTGCT CTACGCGGCC GACAAGGCCG ACCACGTCCA GCAGGAGATC
CTGCCCGCCC TGCGGCGCGG CGCGGTCGTC ATCAGCGACC GCTACGTGGA CTCCCTGCTG
GCCTACCAGG GCTCGGGGCG CGACCTGTCC TCGGACGAGA TCCGCCGGAT CAGCGACTGG
GCCACGCAGG GCCTGGTTCC GGACCTGACG GTGCTGCTCG ACGTGCGGCC GGAGGACGGC
CTGTCCCGCC TGGGCGGCCC GGCCGACCGC ATCGAGGGCG AGCCTGCGGA GTTCCACGAC
CGGGTCCGCC GGGGCTTCCT GGAGCTGGCC AGGGCCGCGC CGGAGCGCTA CCTGGTGCTC
GACGCCCGCG AGCCGCAGGA CAGGATCACC CGCGAGATCC AGCGCCGGGT GCGCTCCCTG
CTGCCCGACC CGGTCCCGAG CAGCGCCGAG GCCGTCACCG GCATGATCCC GGTGATCAGG
AACGACGAGG TCGGACAGGG CTGA
 
Protein sequence
MSRSAPLGAP AEARNVLAIT PFRRLWISLS LSSLGDWLSL LALVSLATVF TADGSQLVQY 
LAVSGVVAIK LAPSILLSPL VGSVADRLDR RWTMVVGDVL RAVLYVSIPV VGLLFPGFAL
EWLLIATLLA EVVALFWTPA KDATVPNLVP RKLLGQANRL SLLTAYGTAP VAALLFAALA
SVSNVLGAFL PSMASPEADV ALYLNGLTFV VAAVVVAGLP IPRHKPSKDA ESDTRDSGIL
RALGTGRRHA GGTPRVRGLV PGMLCVVAAG GVVIGVGRVH VEGLGAGNAG FGVVFAAVFA
GMALGVLAGP RVLKQFSRSR LFGLSIALAG LALLFAGAVA DMVLTAVLTA LLGVGAGIAC
VIGLAVFDRE VEDEHRGSAF AFLHGAARVT LVGAAVLAPL AAGLIGSYRI PVGPLSYDLR
GSGLVLMLSG LAVLVVALVC YRRMNRRDDP EAGPGLLPEL FAALRGVAIA PEEDEEARLA
GAFIVVEGGE GAGKSTQVRE LTVWLRDQGF EVIGTRQPGA TKLGMRLRGL LLDRENSHIT
PRAEVLLYAA DKADHVQQEI LPALRRGAVV ISDRYVDSLL AYQGSGRDLS SDEIRRISDW
ATQGLVPDLT VLLDVRPEDG LSRLGGPADR IEGEPAEFHD RVRRGFLELA RAAPERYLVL
DAREPQDRIT REIQRRVRSL LPDPVPSSAE AVTGMIPVIR NDEVGQG