Gene Ndas_5261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5261 
Symbol 
ID9249159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp425041 
End bp427461 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content76% 
IMG OID 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003683147 
Protein GI297564174 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAC CCTCGGAACA CGTCGCGGAC CGGGAGGCCG CCGACGACGG TTGGCGCGAC 
CCCTCCCTCC CCGACGCGTC GCGCGTCGAG CGACTGCTGG GATCGATGAC CCTGGAGGAG
AAGGTCGCCC AGCTTCACGG TGTGTGGGTG AGCGCGGACG CCTCCGGCGA GGCCGTCGCA
CCGCACCAGC ACGACCTCAC CCAGGAACCC CCCGCCTGGG AGAAGGTCAT CGAGCACGGT
CTCGGCCAGC TCACCCGGCC CTTCGGCACC GCCCCGGTCG ACCCCGCCGC CGGCCGGGCG
TCGCTGGCCC GCAGCCAGCG GGAGATCATG GCCGCCAACC GGTTCGGCGT CCCGGCCCTG
GCCCACGAGG AGTGCCTGGC CGGGTTCGCC GCCTGGACCG CGACGATCTA CCCGGTCCCC
CTGGCCTGGG GCGCCAGCTT CGACCCCGAC CTGGTCGAGC GGATGGCCGC GCAGATCGGC
GCGAGCATGC GCCGGGTCGG CGTCCACCAG GGCCTGGCCC CCGTGCTGGA CGTGTCCCGC
GACCCGCGCT GGGGCCGCAC CGAGGAGACC ATCGGCGAGG ACCCCCACCT CGTGGCGACC
GTGGGCACCG CCTACGTGCG CGGCCTCCAG TCCGCCGGGA TCGTGGCCAC CCTCAAGCAC
TTCACGGGCT ACTCGGCCTC CCGCGGCGGC CGCAACCTCG CGCCGGTCTC GATCGGTCCG
CGCGAGTTCG CCGACGTCCT GCTGCCCCCC TTCGAGATGG CCGTGCGCGA CGGAGGCGCC
GGGTCGGTCA TGTCCGCCTA CAACGACAAC GACGGAGTGC CCGCCGCCGC GGACACGCGC
CTGCTCACCG GCCTGCTGCG GGACCAGTGG GGCTTCGAGG GCACCGTGGT GGCCGACTAC
TTCGGCGTCG CCTTCCTCCA GACCCTGCAC CGCGTCGCCG ACTCCGCCGA GCGGGCCGGC
GCCCTGGCCC TGACCGCGGG CGTGGACGTC GAACTGCCCA CCGTGCACTG CTACGGCGAC
CGGCTCACCG CCCTGGTCCG CTCGGGCGAG GTGCCCGAGG AGCTGGTCGA CCGGGCCGCC
CGGCGCGTGC TGACGCAGAA GTGCCAGCTG GGCCTGCTCG ACGCCGGCTG GTCCCCGGAG
CCCGAGGACC CCGCCGTCCC CGTGGACCTG GACCCCGCCG AACACCGCGC GCTGGCCCGC
GAGCTCGCCG AGCGCTCGGT GGTCCTGCTC TCCAACACCG ACGACGCGCT CCCCCTGGCC
GACACCGGGG ACCTGGCCCT GGTCGGCCCG CTCGCCGACA CCGCCGACGC CGTGCTCGGC
TGCTACTCCT TCCCCGCCCA CGTGGGCAGG CGCCACCCCG GCACCGCCGT CGGCGTGGAG
ATACCCACCC TGCTGGAGTC CCTGGGCGCC GAACTGCCCG GCGTCCGCGT CGAGCACCGC
GCCGGGTGCT CCGTGGACGG CGACTCCACC GAAGGCTTCG CCGAGGCCGT GTCGGCGGCC
GCACGCGCCC GGGTGTGCGT GGCCGTGGTG GGCGACCGCT CCGACCTGTT CGGCAGGGGT
ACCTCCGGCG AGGGCTGCGA CGTCGAGGAC CTGCGCCTGC CCGGGGTCCA GCAGGAGCTC
CTGGAGGCCC TGGCCGACAC CGGCACCCCC GTGGTCGCGG TGGTGGTGTC CGGACGGCCC
TACGCGCTGG GCCCGGTCGC CGACCGGCTG GCCGCCGTCG TCCAGGCCTT CCTGCCCGGC
GAGGAGGGCA TGCCCGCCGT GGCGGGGGTG CTCTCGGGCC GGGTCAACCC CAGCGGGCGC
CTGCCGGTGT CCGTGCCGCG TTCCTCCGGC GGCCAGCCCG TCACCTACCT CGGCCCCGAC
CTGGCCCACC GCAGCGAGGT CAGCTCGGTG GACCCGACCC CGCGCCACCC CTTCGGCCAC
GGCCTGTCCT ACACCCGGTT CGTCTGGGAG GACCCGCGCG TGGACGCGGG CGCCGTCCGC
CCGGAGGAGG CCACCCGGGT GGGCACCGAC GGCGAGGTCA CCGTCGGCTG CACCGTCCGC
AACGTCGGCG GCTCCGCCGG GACCGAGGTC GTCCAGCTCT ACCTGCACGA CCCCGTCGCC
CAGGTGGCCA GGCCCCGCAG ACAGCTGGTC GGCTACGCGC GGGTGCACCT GGAGTCCGGG
GAGGCGCGGG CGGTGGACTT CTCCGTCCAC GCCGACCTGG CCTCCTACAC CGGTCCGGAC
GGGCGGCGCG TCGTCGAGCC CGGCCGCCTG GAGCTGCTGC TGTCCGCGTC CAGCGAGGAC
GTCCGGCACA CCGTGCCGGT CCTGCTCACC GGGCCCACGC GCGTCGTGGA CCACACCCGG
CGCCTGGCCT GCGGGGTCCA GCTCGACCCC GTCGACCAGT CCGGCGGGGC CGACCGGGAG
GAGGTCGCCG CAGGCGGCTG A
 
Protein sequence
MTGPSEHVAD REAADDGWRD PSLPDASRVE RLLGSMTLEE KVAQLHGVWV SADASGEAVA 
PHQHDLTQEP PAWEKVIEHG LGQLTRPFGT APVDPAAGRA SLARSQREIM AANRFGVPAL
AHEECLAGFA AWTATIYPVP LAWGASFDPD LVERMAAQIG ASMRRVGVHQ GLAPVLDVSR
DPRWGRTEET IGEDPHLVAT VGTAYVRGLQ SAGIVATLKH FTGYSASRGG RNLAPVSIGP
REFADVLLPP FEMAVRDGGA GSVMSAYNDN DGVPAAADTR LLTGLLRDQW GFEGTVVADY
FGVAFLQTLH RVADSAERAG ALALTAGVDV ELPTVHCYGD RLTALVRSGE VPEELVDRAA
RRVLTQKCQL GLLDAGWSPE PEDPAVPVDL DPAEHRALAR ELAERSVVLL SNTDDALPLA
DTGDLALVGP LADTADAVLG CYSFPAHVGR RHPGTAVGVE IPTLLESLGA ELPGVRVEHR
AGCSVDGDST EGFAEAVSAA ARARVCVAVV GDRSDLFGRG TSGEGCDVED LRLPGVQQEL
LEALADTGTP VVAVVVSGRP YALGPVADRL AAVVQAFLPG EEGMPAVAGV LSGRVNPSGR
LPVSVPRSSG GQPVTYLGPD LAHRSEVSSV DPTPRHPFGH GLSYTRFVWE DPRVDAGAVR
PEEATRVGTD GEVTVGCTVR NVGGSAGTEV VQLYLHDPVA QVARPRRQLV GYARVHLESG
EARAVDFSVH ADLASYTGPD GRRVVEPGRL ELLLSASSED VRHTVPVLLT GPTRVVDHTR
RLACGVQLDP VDQSGGADRE EVAAGG