Gene Ndas_4013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4013 
Symbol 
ID9247885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4798965 
End bp4801397 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content74% 
IMG OID 
ProductEndonuclease/exonuclease/phosphatase 
Protein accessionYP_003681916 
Protein GI297562942 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.48932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCCCA ACCGACCCGG ACGCCTGGTC GCGTCCACGT CCGCCGCGGC GGCGGTGCTC 
CTGGGCCTCG GGCTCACGTC CGCGCCGTCC GCCCTCGCCG ACACCGCCAC CCCGGTGGTC
AGCGAGGTCT ACGGCGGAGG CGGCAACAGC GGCGCGGACC TGCGCCACGA CTTCGTCGAA
CTGGGCAACC CCACCGGCGA GGCCTTCTCC CTCGACGGCT GGAGCGTGCA GTACCTGCCC
GCCAACCCCA GGCCCGCCAC CCAGGTGCAG GTCACGGCAC TGAACGGCGA GATCGGCGCG
GACGGGTACT ACCTCGTGCG CCAGGCGGCG GGCGCCGGGG GCAGCACCGA CCTCCCCGCC
CCCGACGCCA CCGGCGGCAC CAACATGTCC GCCACCTCCG GCACGGTCCT GCTCGTGCGG
GGCACCGACC CCGTCACCTG CCGGACCGCC GCGGAGTGCA CCGCGGACGC GTCCGTCGCC
GACCTCGTCG GCTACGGGGA CGCCGTCGTC CACGAGGGTG CGCCCGCGCC CCGGCTGGGC
AACACCACCT CCGCCTCGCG CGACGACGCC CTCACCGACA CCGGCGACAA CGCGGCGGAC
TTCACGGCCG CGGCCCCCTC GCCCACCAAC ACCGCGGGCC AGACACCCGG CGGGGGCGGC
ACCGACCCCG AGGAACCCGT CGAGCCCGGC GACCTGCGCG TCCACGACGT CCAGGGCACC
ACCCGGACCT CCCCCTACGC CGGACAGCAG GTCACGGGCC TGCCGGGCGT GGTCACGGCG
GTCAACCCCT TCGGCGGCGC GCGGGGGTTC TGGTTCCAGG ACACCGAGGG CGACGGCGAC
CCGCGCACCA GCGAGGGGCT GTTCGTCTTC ACCGGGTCCA CCACCCCGGA CGTGGCCGTC
GGCGACGAGG TCCTGGCCGC CGGACGGGTC AGCGAGTACA GCCCCGCCTC CGGTGCGCAG
ACCATCACCC AGCTCGACCA GGCCCGCTGG ACCGTCCTGT CCTCGGGCAA CGCCGCGCCC
GACCCGGTCC TGCTCGACGA GGACGCCGTC CCCGAGGCGT ACGCGCCCGA ACCCGGCGGC
GACGTCTCCG CGCTCGACCT GGAACCCGCG GAGTACGCCC TGGACTTCTG GGCCGCGCAC
GAGCACATGC TCGTCCGCGT GGAGGACGCG CCCGTCGTCG GCGCCACCGA CGACTACTCC
GCGCTCTGGG TGACCACCAA GCCCGGACAG AACCCCACCG TCAACGGCGG CACCCGCTAC
GGCTCCTACG GCGACCCCAA CTCCGGACGG ATCAAGGTCG AGTCGCTGCT GGACCGCGAC
CAGCACCCCT TCCCCGAGGC CAACGTGGGC GACACCCTCG CCGGGGTCAC CGAGGGACCC
CTCTACTACA GCCGGTTCGG CGGCTACCTC ATCCGCGCCA CCACCCTGGG CGGGCACGTG
CGGGGCGGCC TGGAGCGCAC CGCCGTCCGC GACGCGCGGC GCCACGAGGT CAGCGTCGCC
ACCTACAACG TCGAGAACCT CGGCGGGCGC GACGACCAGG CCAGGTACGA CGCGCTGGCG
GCCGGGATCG TCGAGTCCCT GAACTCGCCC GACATCATCG GCCTGGAGGA GATCCAGGAC
AACACCGGCC CGACCGACGA CGGCGTCGTG GACGCCGACG TCACCCTGGA CCGGCTGGTG
GAGGCCGTGG AGGAGCACGG CGGCCCCGCC TACGAGTGGC GGCAGATCAG CCCCGAGGAC
AAGCAGGACG GCGGCCAGCC CGGCGGCAAC ATCCGCAACG CGTTCCTGTT CGACCCCGAG
CGCGTGCAGT TCGTCGACCG CGAGGGCGGC GACGCGACCA CCGCCGTGGA GGTCGTGGAG
GGCGGGCGCG GCGCCGAGCT GTCGGTGTCC CCGGGCCGGA TCAGCCCGCG GGACGGGGCC
TGGGACTCCA GCCGCAAGCC GCTGGTCGGC CACTTCCGCG CCCTGAACCG GGACGTCTAC
GTGGTCACCA ACCACTTCAA CTCCAAGGGC GGGGACGAGT CCCTGCACGG CGTCCACCAG
CCGCCGCGGC GCACCAGCGA GGCCCAGCGC CACGCCCAGG CCGGGCTCGT GCGCGACTTC
GCCGAGGAGC TGCTGGCCGT GGACCCCGAG GCCAACCTGG TGGTCATGGG CGATCTGAAC
GACTTCCAGT TCTCCCGGAC CCTGGAGATC CTGACCGCCG ACGGGCCGCT GCACAACCCG
ATGACGGACC TGCCCGTGGA GGAGCGCTAC AACTACGTCT TCGACGGCAA CTCCCAGGCG
CTGGACCACA TCCTGGTCAA CCAGGCGCTC GCGGGCCGGG TCGAGTACGA CATCGCGCGG
ATCAACTCCG AGTTCTCCGA CCAGGTCAGC GACCACGACC CCCAGGTGCT GTGGCTGGAC
ACCCGCCGCG GCACCCCGCC CGGGCGCCCC TGA
 
Protein sequence
MSPNRPGRLV ASTSAAAAVL LGLGLTSAPS ALADTATPVV SEVYGGGGNS GADLRHDFVE 
LGNPTGEAFS LDGWSVQYLP ANPRPATQVQ VTALNGEIGA DGYYLVRQAA GAGGSTDLPA
PDATGGTNMS ATSGTVLLVR GTDPVTCRTA AECTADASVA DLVGYGDAVV HEGAPAPRLG
NTTSASRDDA LTDTGDNAAD FTAAAPSPTN TAGQTPGGGG TDPEEPVEPG DLRVHDVQGT
TRTSPYAGQQ VTGLPGVVTA VNPFGGARGF WFQDTEGDGD PRTSEGLFVF TGSTTPDVAV
GDEVLAAGRV SEYSPASGAQ TITQLDQARW TVLSSGNAAP DPVLLDEDAV PEAYAPEPGG
DVSALDLEPA EYALDFWAAH EHMLVRVEDA PVVGATDDYS ALWVTTKPGQ NPTVNGGTRY
GSYGDPNSGR IKVESLLDRD QHPFPEANVG DTLAGVTEGP LYYSRFGGYL IRATTLGGHV
RGGLERTAVR DARRHEVSVA TYNVENLGGR DDQARYDALA AGIVESLNSP DIIGLEEIQD
NTGPTDDGVV DADVTLDRLV EAVEEHGGPA YEWRQISPED KQDGGQPGGN IRNAFLFDPE
RVQFVDREGG DATTAVEVVE GGRGAELSVS PGRISPRDGA WDSSRKPLVG HFRALNRDVY
VVTNHFNSKG GDESLHGVHQ PPRRTSEAQR HAQAGLVRDF AEELLAVDPE ANLVVMGDLN
DFQFSRTLEI LTADGPLHNP MTDLPVEERY NYVFDGNSQA LDHILVNQAL AGRVEYDIAR
INSEFSDQVS DHDPQVLWLD TRRGTPPGRP