Gene Ndas_5408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5408 
Symbol 
ID9249311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp586806 
End bp588710 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content76% 
IMG OID 
ProductEndonuclease/exonuclease/phosphatase 
Protein accessionYP_003683293 
Protein GI297564320 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.521256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGGG ACCTGCTCGC CGCCGCCGTC GGCACCGTGC TGCTCCTGGA CGCCCTCCGG 
GTGTTCCTGC CGTCCCTGAT CACCGTGTTC GGCCAGGCCG GGACGACCGA TCCCGCGCTG
ATGGGCGCGT TCGCGCTCGC CTGGTTCCTG GCCCCGTTCC CCTTCGTCCC CCTCGCCCGC
CGGATCCCGC CCACGGCCAT GGCGGGCGCC GCCGTCGTGG TGATGGCGGC GGGGCGCCTG
GCGCTCCAGG CCACCGAGGG CGGCGACCCG CAGCTGTACG TCTCCGCGGC CACGGTGGGC
GCGGGGACCG TGTGGCTGGT GTGCACCGCG ATGGCGTCCG CCGTCGACGG ACGCCTGTGC
GGACGCGAGG TGGTGACCGG GGTCGTCGCC GCGATCGCGG CCACCGCGGT CGTACGCTCC
CTCTTCGGCG TGATCGACCT CGTGTGGTGG CCGACGCCCC TGGCGTGGGC GCCGGTCATC
GCGGAGGTCG GGCTCCTCCT GCTCCTGCTG CTCAACGTCC GGGACACCGC CCCGGGGCCC
GCCCGCCCCG TGCCGCCCCG GGCCTGGCTG GTCCTGGGCC CCCTGTTCTT CCTCAGCGGC
CTGTACACCG CCAATCCGGC GGTGGGGCAG ACCCTCGCGG GGTCACCTCT GGGCGCGGCG
GCCGTCGCCA CGGGCGCCGT GCTGTCGGTG GGCCTCGTCC GGCGTCCGCT CCTTCCGGGC
CGGAGCCGGT GGGCGGCCCC CGTCGTGCTC CTGGCCGCGC TGGCGTGGCT GGCCTGGGCG
TCCTCCGACG GCGTGACCCC GGAGGCGGCG GCCCTGCCGT CCGCGGCGGC GCTGGTCGCC
GGTCAGCTCG CCCTGGCCGC CTGCGCGGGC CGGGCCGTGT TCGCCCGGCC CGGGCGGGCG
TCGGCGGCGC GCAGCGGGCT GGCCGCCGCG TCGGGCCTGC TGGTGTTCGT GGTGCTGGTA
TTCGCGTTCT ACTCCGCCTA CGACCTGTAC GTGCCCAACG CGTACGTGCC CTTCCTCGCC
GCGGCCCTGC TGCTGCCCGC CGTGTCCTCC CCGGTTCTGG AGACCGGTGG GCCTCCGCGC
CCTCGCGCAC TCGCGCTGAC CGCGGGGACG GCCGCGCTCA CCCTGGCGGC CACCGCCCTC
TGGCCCGTGT TCGCGGCCCG CACGTTCGCA CCCGCACCCG CGAGCGGGAC CGAGGGGCTG
CGCGTGGCCG CCTACAACGT GCGGATGGGC TTCGGCATGG ACGGCCGGTT CTCCGTCACC
GAGCAGGCCG GGGCGCTGCG CCGCCTGGAC GCCGACGTCG TCGTCCTCAG CGAGGTGGAC
CGGGGCTGGC TGCTCAACGG AGGCAACGAC GTGCTGTCCC GGCTCGCGCT CGAACTCGGC
ATGGCCGCGC ACTGGGGACC GGCCGACGGG CCGCTGTGGG GCGACGCCGT GCTCACCTCC
CTGCCGGTCA CCCGTGAGCG GCGGCACCCG CTCACGCCGA GCGGTCCCAC CGGGGCCCAG
GCGCTGGAGG TGACCGTGGA CCACGGCGGT ACCGGGGTCA CGGTGGTCTC CACCCACGTG
CAGCCCGCCG ACCGCGGCTT CCGCGCGGAG TCCTCCCGGC GCCAGCTGCG CGAGATCGCG
GAGATCGCCC GCCGGGCGCG GGAGCGCGGA ACGCCCGTCG TGGTGGCGGG GGACCTCAAC
ATCGAACCGG ACGACCCCGC GTGGGGCCTG CTCACCGAGC ACGGACTGCT CGACGCGTTC
CGGAACACGC GTCCCTTCCC CACTCTGCCG GGGGAGACCG GCTCCGACCA GCAGATCGAC
CACGTCCTGC ACACCGGGGA CCTGGCGCCG AGCGATCCGG CCAACCCGGA CGTGCCGCAC
TCCGACCACC GGCCGGTGGC CGTCACGCTG ACCCCGGTCG CCTGA
 
Protein sequence
MRRDLLAAAV GTVLLLDALR VFLPSLITVF GQAGTTDPAL MGAFALAWFL APFPFVPLAR 
RIPPTAMAGA AVVVMAAGRL ALQATEGGDP QLYVSAATVG AGTVWLVCTA MASAVDGRLC
GREVVTGVVA AIAATAVVRS LFGVIDLVWW PTPLAWAPVI AEVGLLLLLL LNVRDTAPGP
ARPVPPRAWL VLGPLFFLSG LYTANPAVGQ TLAGSPLGAA AVATGAVLSV GLVRRPLLPG
RSRWAAPVVL LAALAWLAWA SSDGVTPEAA ALPSAAALVA GQLALAACAG RAVFARPGRA
SAARSGLAAA SGLLVFVVLV FAFYSAYDLY VPNAYVPFLA AALLLPAVSS PVLETGGPPR
PRALALTAGT AALTLAATAL WPVFAARTFA PAPASGTEGL RVAAYNVRMG FGMDGRFSVT
EQAGALRRLD ADVVVLSEVD RGWLLNGGND VLSRLALELG MAAHWGPADG PLWGDAVLTS
LPVTRERRHP LTPSGPTGAQ ALEVTVDHGG TGVTVVSTHV QPADRGFRAE SSRRQLREIA
EIARRARERG TPVVVAGDLN IEPDDPAWGL LTEHGLLDAF RNTRPFPTLP GETGSDQQID
HVLHTGDLAP SDPANPDVPH SDHRPVAVTL TPVA