Gene Ndas_0747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0747 
Symbol 
ID9244589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp915815 
End bp917065 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content74% 
IMG OID 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_003678698 
Protein GI297559724 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.360686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.792853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACA ATCCCCGCGA GGGCGGAGCG CTCGGCTACA CCGACCAGGA CACCGAGCGC 
ATGGCCCTGG AGAACCGCAA GAACCGGGCC CGCGACCCCT TCGAGCGGGA CCGGGCCCGG
GTCCTGCACA GCGCCGCCCT GCGCCGCCTG GCCGCCAAGA CCCAGGTCGT CCAGCCCGGT
GTGAGCGACT TCCCGCGCAC CCGCCTCACC CACTCCCTGG AGTGCGCCCA GATCGGCCGC
GAACTCGGCC AGGCCCTGGG CTGCGACCCC GACCTGGTGG AGGCGGCCTG CCTGTCCCAC
GACCTGGGCC ACCCGCCCTT CGGCCACAAC GGCGAGCGCG CCCTGGACGA GGCCGCCGCC
GACTGCGGCG GGTTCGAGGG CAACGCCCAG AGCCTGCGCC TGCTCGTGCG CCTGGAGGGC
AAGGTCATCG ACCCCGACGG GCGCAGCGCG GGGCTCAACC TCACCCGCGC CACCCTCGAC
GCCACCGTCA AGTACCCCTG GCTCCGGGGC GAGGGCGGCG ACACCCACAA GTTCAACTGC
TACCCCGACG ACACCGAGGT GTTCGACTGG CTGCGCAGGG ACGCGCCCCC GGGCCGCACC
TGCTTCGAGG CCCAGGTCAT GGACTGGGCC GACGACGTCG CCTACTCCGT GCACGACGTC
GAGGACGCCC TGCACGCCGG GCTGGTGGAC CCCGCGGCCC TGCGCGGCGC CGCCGAGCGC
GCCGAGGTCG TGCGGATCGC CGCCGCCGAC TACTGCGACG CCGACCCGGC CGAACTCGAC
GAGGTCTTCA CCGACCTGAT CGCCCACCCC GCGTGGCCGC GCGAGTTCAC CGGGGACCTC
GCCTCGCTCG CCGCGCTCAA GAACCTCACC AGCGGGCTCA TCGGCCGCTT CTGCCGGGCC
GCGGAGGAGG CCACCCGCGC CGCGTACGGT CCCGGGCGCC TCACCCGTTA CGGCGGCGAC
CTCATCGTGC CCCGCCGCCC CCTGCTGGAG TGCGCCCTGC TCAAGGCGGT CGCCGCCCAC
TTCGTGCTCT CGCGGGAGGA GGCCCGGGTC TACCAGGCCG AGGAACGCCG CCTGATCACC
GAGCTGGTCG GCCTCCTGTG GAAGAACGCC CCCGAGGGCC TGGACCCGCA GTTCCGCGCC
GCCTTCACCG GGGCCGCCGA CGACTCCGCC GCCCTGCGCG TGGTCATCGA CCAGGTCGCC
TCGCTCACCG ACACCTCCGC GACCGCGCTG CACGCCAGCC TCACGGGCTG A
 
Protein sequence
MTNNPREGGA LGYTDQDTER MALENRKNRA RDPFERDRAR VLHSAALRRL AAKTQVVQPG 
VSDFPRTRLT HSLECAQIGR ELGQALGCDP DLVEAACLSH DLGHPPFGHN GERALDEAAA
DCGGFEGNAQ SLRLLVRLEG KVIDPDGRSA GLNLTRATLD ATVKYPWLRG EGGDTHKFNC
YPDDTEVFDW LRRDAPPGRT CFEAQVMDWA DDVAYSVHDV EDALHAGLVD PAALRGAAER
AEVVRIAAAD YCDADPAELD EVFTDLIAHP AWPREFTGDL ASLAALKNLT SGLIGRFCRA
AEEATRAAYG PGRLTRYGGD LIVPRRPLLE CALLKAVAAH FVLSREEARV YQAEERRLIT
ELVGLLWKNA PEGLDPQFRA AFTGAADDSA ALRVVIDQVA SLTDTSATAL HASLTG