Gene Ndas_5373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5373 
Symbol 
ID9249276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp552843 
End bp555203 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content74% 
IMG OID 
Productglycoside hydrolase 15-related protein 
Protein accessionYP_003683259 
Protein GI297564286 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.534575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTCA CCCAAGCGGT CGAAGCGGTT CCGGGCATGC CGGGGGACAC CGCTCCCGCG 
CTCATGCCCG TGGACGTCCG GCAACGCGTC TCCGCCCTCG CCAGAACCTC ACGGCTCCTG
GTGGCCTGCG ACTACGACGG CGCCCTCTCC ACGGTCGACC CTGCCGACCG GCGTCCGCTG
CCCGAGGCCC TGCGCGCGCT GCGCGACCTG GCCGACCTGC CGGGCACCGT GTGCGCGGTC
ATCTCCGGCC GCCCCCTGCG CGACCTGGCC GCGCTCTCGC GCCTGCCCGC CGAGGTGCGC
CTCGTGGGCG CCCACGGCAC CGAGTTCGAC ACCGACGTGA CCGTCGACCC CGACCACCGC
AACGCCGACT CGCCCTCGGA CAAGGCGGCC GCCCTGGAGC TGCTGCGCGA CCAGGTCGAG
GCCACCGCCA TCCTCTACAT CGGCGGCGGC GAGGGCGAGG AGCCCGTCTT CATGCGGCTG
ACCGGCGCCG ACGCGGGCGT GCGGGTCGGC GAGGGGCCCA CCGTGGCCTC CCACCGCGTG
GCCGACACCC CGACCGCCGC GGCCCTGCTG TCCGCGCTGG CAGCCGAGCG CCGCTTCTGG
GTCTTCGGCG AGCGCCCCAC GCCCATCGAG CGCATGTCGA TGCTCTCCAA CCAGGGCTCC
GTGGCCCTGG TCGGCCCGGA CGCGCGCCTG CTGTGGTTCT GCCACCCCGA GCCCGACTCG
AACGCCGTGT TCGCCGAGGT CCTGGGCGGC CGCCAGGCGG GCGTGTTCGC CGTCGCCCCC
GCCCACGGAG GCCGTCCGCT GGGCCAGCGC TACCTGCCCG GCACCATGAC CGTGCGCACC
CGCTGGTCGC GCATGGACGT CACCGACTAC CTCGCACACG GCACCCCCCA GGGGCGCACC
GACCTGATCC GGGTGATCAG CGGCGTCACC CCGGCCACGG TGGAGTTCGC CCCCCGTCCC
GACTTCGGCC GCGCGCCGGT GCGCATCACG CCGCAGGAGA ACGGCCTGCT GGTCGAGGGC
GCCGACTTCC CGATGGCCCT GTACTCGCCC GGCGTGGAGT GGGAGATCGA CCACGACGGC
GTCTCCGACC TCGCCCGCGC GGTCGTCCAC CCGCGCTCGG AGCAGCCCGT GGTGCTGGAG
CTGCGCTGCG GCACCGACTC CCTCGTGCAC GGCACGCTGC CCGAGTCCGA GCGTCAGCGG
ACCTCCGGCG AACACTGGTC GCGCTGGCTC GACGGGCTCA CCCTGCCCGA CACCGCCCAG
CAGCTGGCGG CGCGCTCCGC GCTCACCCTG CGCGGGCTGT GCACCCCCTC CGGGGGCGTG
ATGGCCGCCG CGACCACGTC GCTGCCCGAG GAGATCGGCG GCGTACGCAA CTGGGACTAC
CGGTACTGCT GGCTGCGCGA CGGCGCCCTG ACCGTGCAGT CGCTGGTCTC CCTGGGCTCG
ACCGCCGAGG CCGAGGAGTT CCTGGACTGG GTGCACCGGG TCGTGGACTC GCTGCCCGGC
CCGGAGATGC TGCGCCCGCT GTACTCGCTG CGGGGCACGA ACCTGGGCCC GGAGGAGGTC
ATCGAGTCGC TGTCCGGGTA CGCGGGCTCG CGGCCGGTGC GGATCGGCAA CCTGGCCGAC
CACCAGGTGC AGCTGGACGT GTTCGGCCCC GTCGTGGAGC TGATCGAGAA GCTGTCCTCG
GTGCGCGGCA CCCTGGCCGA CCGCGACTGG GACCTGGTGC GGTCGATGGC CGAGGCGGTC
GCCCGCCGCT GGCACGAGCC CGACCACGGC ATCTGGGAGG AGCGCGACGA GCCCCGCCAG
CGCGTCTACT CCAAGGTGAT GTGCTGGGTG ACCCTGGACC GCGCCGTCTC CCTGGCCCGT
GCCTACGGCC GCGAGGTGGA CCCGTCCTGG CCGGACCTGC GCGACGGCAT CGCCGCCGAG
GTGCTGGACA AGGGCTGGAA CGAGGAGGCG CAGGCCTTCA CCACCGCCTA CGACGGCACC
GACCTGGACG CCGCGTCCCT GCACATCGGC CTGTCCGGGC TGATCGACCC GGCCGACGAG
CGCTTCCAGG CCACCGTGAC CGCCGTGGAG GCGGAGCTGC GCAGCGGCCC GACCGTGTAC
CGGTACCACC GCGACGACGG CCTGCCGGGC GGCGAGGGCG GCTTCCACCT GTGCACCGCG
TGGCTGATCG AGGCGTACCT GCTGACCGGC CGCCGCGCGG AGGCGGACGA GCTGTTCAAG
CACCTGGTGG ACTGCGCCGG ACCGACCGGG CTCATCCCGG AGGAGTTCGA CCCGGTCACC
GAGCGGGCGC TGGGCAACCA CCCGCAGGCG TACTCGCACC TGGGCCTGAT CCGCTGCGCC
CAGCTGCTCG ACCGCCTCTG A
 
Protein sequence
MTVTQAVEAV PGMPGDTAPA LMPVDVRQRV SALARTSRLL VACDYDGALS TVDPADRRPL 
PEALRALRDL ADLPGTVCAV ISGRPLRDLA ALSRLPAEVR LVGAHGTEFD TDVTVDPDHR
NADSPSDKAA ALELLRDQVE ATAILYIGGG EGEEPVFMRL TGADAGVRVG EGPTVASHRV
ADTPTAAALL SALAAERRFW VFGERPTPIE RMSMLSNQGS VALVGPDARL LWFCHPEPDS
NAVFAEVLGG RQAGVFAVAP AHGGRPLGQR YLPGTMTVRT RWSRMDVTDY LAHGTPQGRT
DLIRVISGVT PATVEFAPRP DFGRAPVRIT PQENGLLVEG ADFPMALYSP GVEWEIDHDG
VSDLARAVVH PRSEQPVVLE LRCGTDSLVH GTLPESERQR TSGEHWSRWL DGLTLPDTAQ
QLAARSALTL RGLCTPSGGV MAAATTSLPE EIGGVRNWDY RYCWLRDGAL TVQSLVSLGS
TAEAEEFLDW VHRVVDSLPG PEMLRPLYSL RGTNLGPEEV IESLSGYAGS RPVRIGNLAD
HQVQLDVFGP VVELIEKLSS VRGTLADRDW DLVRSMAEAV ARRWHEPDHG IWEERDEPRQ
RVYSKVMCWV TLDRAVSLAR AYGREVDPSW PDLRDGIAAE VLDKGWNEEA QAFTTAYDGT
DLDAASLHIG LSGLIDPADE RFQATVTAVE AELRSGPTVY RYHRDDGLPG GEGGFHLCTA
WLIEAYLLTG RRAEADELFK HLVDCAGPTG LIPEEFDPVT ERALGNHPQA YSHLGLIRCA
QLLDRL