Gene Ndas_4565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4565 
Symbol 
ID9248446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5409324 
End bp5411195 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content74% 
IMG OID 
Productglycoside hydrolase 15-related protein 
Protein accessionYP_003682458 
Protein GI297563484 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGG GCACCGCGAG CATCGGCGAC CACGGCTTCC TCTCGGACTG CCACACCGCC 
GCGCTCACCA CACCCGACGG CACCGTCGAC TGGCTGTGCG TTCCCCGGTT CGACGGGCCC
GCCCTCGTCT CGGGGATCCT GGACCCGCGC GGAGGCGGAT GGACCCTGGA GGTCGAGGGG
GCGAGCCCGG CCGGACGCGC CTACGTCGAC GACACGCTCG TCCTGGAGAC GCTCTGGCGG
GGTACGGACA CCGAGGTGGC CGTCCGCGAC CTGCTCGCGG TGCGAAGGCC GGAGGAGGGC
GGGGCGGGCC TGTACCGGCA GGGTCTCCTC CTGCGGGTCG TCGAGTGCCG CTCCGGGAGC
ACGTCCGTGC GCTCGCGGCT CGACGCCAGA CCCGACTTCG CGCGTGCCGA ACCCGTGTGG
GAGCGGGTGG ACGGCGGACT GCGCGAGGCC TCGGGGCCGA TGCTGTCGGG TTCGCCCGCG
CCCGCGCTCG CGCGGGACGG CGTGCCGGAG TACCGCGTGG AGCTGGCCGA GGGGGACACC
GCCGTGTTCG CCCTGGACTA CCTGGAGGGC GGGCGCCGCG TCGGACTGGG GGAGGGCCGG
GCGCTGCTGC GGGAGACCCT GGACGCCTGG CGGGAGTGGT CCGGCCGGAC CGACTACGAC
GGGGTCGGCG CCACGCACGT GCGCCGCAGC GCCCTCACCC TGCGCGGTCT GCTCCACGAG
GAGAGCGGCG CCCTGATCGC CGCACCCACC ACGTCCCTGC CCGAGTGGCC GGGAGGCCCG
CGCAACTGGG ACTACCGCTA CGTCTGGCAC CGCGACGCCG CGCTCGTCGT CCTGGCCTTC
CTTCGGCTCG GGCACGCCGA GGAGGCGGGG CACTACCTGC GCTTCCTGCT GCGCATGTGC
GGTCAGCCGA TCGACTGGGT CCCCCCGGTG CAGGCGGTCG ACGAACAGCC GCCGCCTGAG
GAGGAGACCC TGGACCACCT CGCCGGACAC GCCGGGTCCA GACCGGTCCG CGTCGGCAAC
GACGCCTACT CACAGCACCA GCTGGACGTG TACGGGCACG TGCTCGACGC CGCGCTGTCC
TACGAGGAGG CCACCGGCGG GCTCGGGCGC GGGGACGTCG AACAGCTCTC CTCGATGGTC
GACGCGGCGT GCCGGGTCTG GCGCGAGCCG GACGAGGGCA TGTGGGAGGT GCGGTCGCGG
CCGCGGCACT GGACGAGTTC CAAGGTCTAC GCCTGGGTGT GCCTGGACCG CGGGATCCAG
CTGGCCACCG AGTCCGGCAA GGCGGGCGGG GACGTCCCGC TGGACAAGTG GCGCAAGGAG
CTGGACGCCG TGCGCCAGGA GGTCCTGGAC CGGGGCTACG ACGCGGAGGC CGGGACCTTC
ACGCAGTCCT ACTGCTCGTC CCACGTGGAC GGGTCGCTGC TGAGGATCCC GCTCCTGGGC
TTCCTGGAGG GGACCGACCC GCGCGTGCTC GCGACCCTGG AACGGGTGGA CGCGGAGCTG
GGCGGGGAGG GCGGGCTCGT CCACAGGTAC GACCCCGGGA CGACCGACGA CGGACTGGGC
ACCCCGGAGG GCGCCTTCCT CCTCTGCTCC TTCGACATGG TCTCCGCCCT GGTGCTCGCC
GGGCGGACCG AGGAGGCGCG GCGGAGGTTC GAGGAACTGT GCGGGAGCTC GGGAGAGCTC
GGCCTGCACG CGGAGGAGAT GGCCGCCGAC GGCACCATGC TGGGCAACTT CCCCCAGGCC
TTCACCCACC TCGCGCTGAT CGAGGCGGCC GTCAACCTCG ACCAGGCGGG GGACGGGGAG
GCGCTGCACT CGTGGGTGCG CGACAGGTCG AGCGGCGCGA CACGACGAAG GAGGACTGGA
GCAGATGGCT GA
 
Protein sequence
MAQGTASIGD HGFLSDCHTA ALTTPDGTVD WLCVPRFDGP ALVSGILDPR GGGWTLEVEG 
ASPAGRAYVD DTLVLETLWR GTDTEVAVRD LLAVRRPEEG GAGLYRQGLL LRVVECRSGS
TSVRSRLDAR PDFARAEPVW ERVDGGLREA SGPMLSGSPA PALARDGVPE YRVELAEGDT
AVFALDYLEG GRRVGLGEGR ALLRETLDAW REWSGRTDYD GVGATHVRRS ALTLRGLLHE
ESGALIAAPT TSLPEWPGGP RNWDYRYVWH RDAALVVLAF LRLGHAEEAG HYLRFLLRMC
GQPIDWVPPV QAVDEQPPPE EETLDHLAGH AGSRPVRVGN DAYSQHQLDV YGHVLDAALS
YEEATGGLGR GDVEQLSSMV DAACRVWREP DEGMWEVRSR PRHWTSSKVY AWVCLDRGIQ
LATESGKAGG DVPLDKWRKE LDAVRQEVLD RGYDAEAGTF TQSYCSSHVD GSLLRIPLLG
FLEGTDPRVL ATLERVDAEL GGEGGLVHRY DPGTTDDGLG TPEGAFLLCS FDMVSALVLA
GRTEEARRRF EELCGSSGEL GLHAEEMAAD GTMLGNFPQA FTHLALIEAA VNLDQAGDGE
ALHSWVRDRS SGATRRRRTG ADG