Gene Ndas_0628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0628 
Symbol 
ID9244470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp773061 
End bp774218 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content74% 
IMG OID 
Productnuclease SbcCD, D subunit 
Protein accessionYP_003678580 
Protein GI297559606 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.013566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.458746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGTG CCATGCGCCT GCTGCACACC TCGGACTGGC ACCTGGGCCG CTCCTTCCAC 
CGGGAGAACC TCATCGACGC CCAGGCGGCC TTCCTGGACC ACCTCGTCGA CACCATCCGC
GAACACCGGG TCGACGTGGT GGTCGTGGCC GGTGACCTCT ACGACCGCGC CCTGCCGTCG
GTGGACGCGG TCCGGCTCTT CGACCGCGCC CTGGGCCGGA TCCGCGAGAC CGGGGCCCGG
GCCGTCCTCA TCAGCGGCAA CCACGACTCC ATGGCGCGGA TGTCCTTCGC CACCGGCCTC
ATCGACGCCT CCGGCGTCCA CCTGCGCAGC TCCCTGGACG GCGTCGGCAC GCCCGTGGTC
ATCGAGGACG AACACGGACC CGTGGCCTTC TACGGCATCC CCTACCTGGA ACCGGAGATC
GCGCGCCACC ACTGGGACCT GCCCGAGCGG GGGCACGCGG CGGCGCTCGG GCACGCCATG
GACCTGGTGC GCGCCGACCT CGCCGAGCGG CCCGGCACCC GGTCGGTGGT CCTGTCGCAC
GCGTTCGTGA CCGGCGGCGA GCCCTCCGAC AGCGAACGCG ACATCTCGGT GGGCGGTGCC
TCGCACGTGC CGGTCCCGGT CTACGACGGG GTCGACTACG TGGCCCTGGG CCACCTGCAC
GGGCGGCAGA CCATCACCCC GTCCGTGCGC TACTCGGGCA GCCCGCTGGC CTACTCCTTC
TCCGAGGAGC ACCACGTCAA GGGCTACTGG CTGGTGGACC TGGACGCCGA CGGCCTCGCG
GGCGCCGAGT TCGCCGCGGC GCCCGTCCCC CGCCCCCTGG CCCGTATCCG CGGCCGGATC
GAGGACCTGC TCACCCGTCC CGAGTGGGAG TCCCGCACCG GCCACTGGCT CCAGATCACC
CTGACCGACC CGCGCCGCCC CGCCCACCCC ATGGACCGGC TGCGCGAACG CTTCCCCCAC
GTCCTGGTCC TGGACTTCGC CCCCGAGGGC GGTGCCCCCG ACACCCGTCC GATCGCCGCC
TCCGTCGCCG ACCGCAGCGA GCGCCAGGTG GTCGGCGACT TCGTGGAGTG GGCCCGCGGC
ACCCCCGCCA CCCCCGAGGA GGAGGCCCTC GTGAACACCG CCGTCGAGCA GGTCCGGCTC
CAGGAGGGGA CGCGCTGA
 
Protein sequence
MSSAMRLLHT SDWHLGRSFH RENLIDAQAA FLDHLVDTIR EHRVDVVVVA GDLYDRALPS 
VDAVRLFDRA LGRIRETGAR AVLISGNHDS MARMSFATGL IDASGVHLRS SLDGVGTPVV
IEDEHGPVAF YGIPYLEPEI ARHHWDLPER GHAAALGHAM DLVRADLAER PGTRSVVLSH
AFVTGGEPSD SERDISVGGA SHVPVPVYDG VDYVALGHLH GRQTITPSVR YSGSPLAYSF
SEEHHVKGYW LVDLDADGLA GAEFAAAPVP RPLARIRGRI EDLLTRPEWE SRTGHWLQIT
LTDPRRPAHP MDRLRERFPH VLVLDFAPEG GAPDTRPIAA SVADRSERQV VGDFVEWARG
TPATPEEEAL VNTAVEQVRL QEGTR