Gene Ndas_0627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0627 
Symbol 
ID9244469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp770074 
End bp773061 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content78% 
IMG OID 
Productexonuclease 
Protein accessionYP_003678579 
Protein GI297559605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.246624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTGC ACACCCTCAC CGTCCAGGCG TTCGGCCCGT TCGCGGGCAC CGAGGAGGTG 
GACTTCGACC GCCTCAGCGC GGGCGGCCTC TTCCTCATCC ACGGCCCCAC CGGCGCGGGC
AAGACCACCG TGCTGGACGC GGTGTGCTTC GCCCTGTACG GCAACGTGCC GGGCGCCCGG
GGCAAGGACC GCTCCCCCAA GAGCGACCAC GCCCCGCTGG ACGCCGAGCC CAGGGTGGTC
CTGGAGTTCA CGGTCCAGGG GCGCCGCATC TGGATCGAGC GCAAGCCGCG CTGGGACCGC
CCCAAGAAGC GGGGCACCGG CACGGTGTCG CAGAACACCA AGGTCGTCGT CAAGGAGTTC
ACCGGCGGCA GGTGGGAGGG CGTCACCACC CGCCCCGACG AGGCGGGGCA GTTCGTCGGC
GACCTGCTGG GCCTGACCCT CGCGCAGTTC TGCCAGACCG TCATGCTCCC CCAGGGCGAC
TTCGCGCGGT TCCTGCGCGC CAAGTCCGAG GAGCGCCGCG AGTCCCTGGA GCGCATCTTC
AACACCCGGC TGTTCCGGGA CGTGGAGACG TGGCTGGGAG CGCACGCCAA ACGGCTCGAA
CACGAGGTCC GGACGGCCAA CGCCCAGGTG CGCGCGGTCG CGGACCGCAT CGCCGAGGTC
GGCGGGTCCC GGGCCCCCGA ACCGCCCGAG GAGCTGTCGG CCTGGGCCGC GGAGCTGGCC
TGCGTCACCG CCGCGACCGC ACGCGACGCC GGGCGGGTGG CCGGGGAGTT CACCGGGGCC
CGTGAGGAGG CCCAGCGCGC CCTGTCCGAG GGGCGTGAGC TGCGCGCGCT CCAGGAGCGG
CTGGCCGCCG CCCGCGGGCG CCGCGCCCAG CTGGCCGAGC ACACCGCGTG GCGGGCCGCG
CTCGACGCCC AGCTCGACGA CGCCGGGCGC GCCGAGTCGG TGCTGCCCTT CCTGCGCGCC
CGCGACAACC GCCGCACCGA GCTGGACAAG GCGGAACTCG CCGTCGCCGA CCACCTGGCG
CTGGTGGGCG GGCTGCCCGG CTTCTCCGCG CGCTCCGACG GGGCGGTGCC GCGCGAGGAG
CTGGGCGCCG CCGAGCGGGA GCGCCGGGAC GAACTGGCCC GTCTGGACGC GCTGCGCGCG
GACGCCGAGC GCCGCCGGTC CCTGGGCCGC TCCGTCACCG CGCTCACGGG GCGCCTGGAC
GAGATCGCCG CGCAGCTGGA TCGCCAGCGC GCGCTGGCCG CCTCCCTGCC CGCCCGGGTG
GAGGCGCTCA CCGCCGAGCT GCGGCGCCTC CACGACCAGG CCGGACAGGA GGGGGCCGCC
GAGACCGCCC TGGCCGCCGC CCGCAGGCGC CTCGACGCCG CCGCCGAGTA CGAGCGCCTG
GGCCGGGAGC TCTCGGAGGC GCAGGAGCGC CACCGGGAGG CCGTGGACGG GGCCCAGGCC
GCCCGCGACC GCGCCCTCGA CCTGCGCGAG CGCCGGATCA CCCACATGGC CGCCGAACTG
GCGTCCGGCC TGGTCGACGG GGAGCCGTGC GCGGTGTGCG GGTCGGTGGC GCACCCCTCC
CCCGCCCGGC CCTCCGGCGA GGGCCTGGTC TCCGCCGAGG AGGAGCGGCG GGCCCAGGCG
GCCGCCGACG CCGCGGCCTC GCGCCGCGCG GAGGCCGAGA GCGCCGCGGT GGCCCTGCGG
GAGCGGCGCA CCGCCGCGCG GGAGCGCGCC GAGGACGCCA CCGCCGACGC CGCCCGTCAG
GAGGTGGAGG CGCACACCGC GGCGCTGGCG GAGGCCCGGG CCGCCGCCGG TGAGGCCCGG
CGGGTGGAGG AGGAGCTGGA GCGCACCACC GGCGACCTGG AGCGCGCCCG CACCCGCGAG
GGCGAGCTGG TCCGGCAGGA GGCCCAGGTC GGGGCCGACC GGGACAACGC CGTCCGCGAG
CACGGGAGGC TGACCGCGCT GCTGGACCGG GCGCGCGGGG ACGACGCCGG GCTGGACGAG
CGGATCGCCA GGCTGGGCGG CGAGGCCGAC CTGCTGCGCG CCGCGGCCGA GGCGGCCGTC
AACCGCGACC GCGCGGCCGA CGAACTGCGC GCCGCCGTCG CCGAGGCCGA GCGGCAGCGG
GCCGAGGCCC GCTTCGCCGA CGAGGCCGGG GTGTGGGAGG CCGCGCTGGA CGAGGAGCGG
CGCCGCGCGC TGCGCGAACG CGCCCGCTCC TTCGACGACG CCCTGGCCGC CGTCGAGGCG
TCGCTGGCCG ATCCGGAGCT GACCGCGGCC GGGGAGCTGC CCGTCCCCGA CCTGGAGGAG
CTGGCCCGCG CCGCGCGGAC CGCGTCGGAG GCCGCCGACC GCGCGGTGGC CTGGCGGAGC
ATGCTGGAGG AGCGGGCGCG TCGGCTCGCC TCCCTGCGCG GGGAGCTGGA CCTGCGGCTG
AGGGACTGCG AACCGGCGCT GCGCCGCTAC GCGGTGGCCG AGGGGCTGCG CGGCCTGACC
GCCGGGACCT CCTCCGACAA CGCCGACAAC GTGCGTCTGT CGGCGTACGT TCTGGCGGCC
CGCCTGGAGC AGGTGGTGGC CGCCGCCAAC GACCGCCTGG TGACGATGTC GGACGGCCGG
TACGAGCTGC GGTACACGGT GGACAAGGCC GCCGGGGACG GGCGCGCGCG TTCGGCCGGC
GGCCTGGGCA TGCGGGTGGT GGACGCCTGG ACGGGTGTGG AGCGCGACCC GGCCACGCTC
TCGGGCGGGG AGACGTTCTT CAGCTCGCTG GCGCTGGCCC TGGGGCTGGG CGACGTGGCC
AGTGCCGAGG CGGGCGGCGC CGACATCGAC ACGCTGTTCG TGGACGAGGG GTTCGGCACG
CTGGACGAGG ACACCCTGGA GGAGGTGCTG GACGTCCTGG ACCGGCTGCG GGACGGCGGC
CGGGCGGTGG GTGTGGTGAG CCACGTCGCC GACCTGCGGC AGAGGGTGTC CGCCCGGCTG
AAGGTGGTCA AGACGGCGGC GGGCTCGCGG GTGGAGCACA CCGGCTGA
 
Protein sequence
MRLHTLTVQA FGPFAGTEEV DFDRLSAGGL FLIHGPTGAG KTTVLDAVCF ALYGNVPGAR 
GKDRSPKSDH APLDAEPRVV LEFTVQGRRI WIERKPRWDR PKKRGTGTVS QNTKVVVKEF
TGGRWEGVTT RPDEAGQFVG DLLGLTLAQF CQTVMLPQGD FARFLRAKSE ERRESLERIF
NTRLFRDVET WLGAHAKRLE HEVRTANAQV RAVADRIAEV GGSRAPEPPE ELSAWAAELA
CVTAATARDA GRVAGEFTGA REEAQRALSE GRELRALQER LAAARGRRAQ LAEHTAWRAA
LDAQLDDAGR AESVLPFLRA RDNRRTELDK AELAVADHLA LVGGLPGFSA RSDGAVPREE
LGAAERERRD ELARLDALRA DAERRRSLGR SVTALTGRLD EIAAQLDRQR ALAASLPARV
EALTAELRRL HDQAGQEGAA ETALAAARRR LDAAAEYERL GRELSEAQER HREAVDGAQA
ARDRALDLRE RRITHMAAEL ASGLVDGEPC AVCGSVAHPS PARPSGEGLV SAEEERRAQA
AADAAASRRA EAESAAVALR ERRTAARERA EDATADAARQ EVEAHTAALA EARAAAGEAR
RVEEELERTT GDLERARTRE GELVRQEAQV GADRDNAVRE HGRLTALLDR ARGDDAGLDE
RIARLGGEAD LLRAAAEAAV NRDRAADELR AAVAEAERQR AEARFADEAG VWEAALDEER
RRALRERARS FDDALAAVEA SLADPELTAA GELPVPDLEE LARAARTASE AADRAVAWRS
MLEERARRLA SLRGELDLRL RDCEPALRRY AVAEGLRGLT AGTSSDNADN VRLSAYVLAA
RLEQVVAAAN DRLVTMSDGR YELRYTVDKA AGDGRARSAG GLGMRVVDAW TGVERDPATL
SGGETFFSSL ALALGLGDVA SAEAGGADID TLFVDEGFGT LDEDTLEEVL DVLDRLRDGG
RAVGVVSHVA DLRQRVSARL KVVKTAAGSR VEHTG