Gene Ndas_2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2447 
Symbol 
ID9246297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2901007 
End bp2902440 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content70% 
IMG OID 
Productglycoside hydrolase family 10 
Protein accessionYP_003680373 
Protein GI297561399 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGTCA CCAAGGGGCG AAGACGGCTC CGAGGAGGGG CCGCGGCCGC GGTCGCCCTC 
GCCCTCAGCC TGGTCACGGG CGGCCTGACC GCCGCCGCCC ACGCCGAGGA CACCGCGCTC
ACGGCGCAGG AGGCGGCCCG GGCCCAACAG GCCACCCTGC GCGAACTCGC CGACCAGCGC
GGGCTGCGCA TGGGCACCGC GGTCGCGCCG CAGCACCTGA ACCAGGCGGC CTACGCGCAG
ACCGCCGCGA CCGAGTTCAA CTCCGTCACC CACGAGAACG ACCTCAAGTG GGAGACCGTC
CAGCCCCAGC CCGGACAGTT CAACTGGACC AACGCCGACC GCATCGTGGA CTTCGCCCAG
CAGAACGACC AGCTGATCCA CGGCCACACG CTGGTGTGGC ACAGCCAGCT GCCCTCGTGG
GTGAGGGACG GCACCTTCAC CGAGGGCCAG CTGCTGGACG TCATGGACAC CCACATCAGC
ACCACCGTGG GGCGCTACCG CGACGACATC GCCACCTGGG ACGTGGTCAA CGAGCCCATC
GGGGACGACG CCCGGTTCCG CGACTCGGTC TTCTACCGGA CGCTGGGGGA GGACTTCATC
GCGGAGGCCT TCCGGATGGC CGACCGGGCC GACCCCGACG CCCGCCTGTT CATCAACGAC
TACAACATCG ACGGGATCAA CGCCAAGAGC GACGCCTACT ACGACCTGGT CCGGGACCTG
CTGGCGCAGG GCGTGCCGAT CGACGGCATC GGCTTCCAGG GCCACCTCAT CGCGGGGCAG
GTCCCCTCCA GCGTCCAGCA GAACATCCAG CGCTTCGTGG ACCTGGGCCT GGAGGTGATG
ATCACCGAGC TGGACATCCG CATCCAGCTG CCCGTCACCC AGCAGAAACT GGAGCAGCAG
GCCAGGGACT ACGAGCAGGT GGTCAACGCC TGCTACGCGG TCGACGGGTG CTCGGGTGTG
ATCGTGTGGG GCGTCACCGA CGCCCACTCC TGGGTTCCCG GTACCTTCCC GGGCCAGGGC
GCGGCGCTGC CCATCGACGA GAACTACGAG CCCAAGCCCG CCTACTGGTC GATCCACGAG
GCGCTCGGCG GCGACCCCGG CCCCGGCCCG GATCCTGATC CGACCGACCC GCCGTCGGGT
GAGTGTGAGG CGGTGTACTC GGTGACGAAC CAGTGGCAGG GGGGTTTCCA GGCGCAGGTG
ACGGTGACGG CGGGTGCGGC TCTGTCGGGG TGGACCGTGG AGTGGACCTT CGGTAGCGGT
GAGAGTGTCT CGCACGCCTG GAACGCGTCG GTGTCGGGCA GCGGTGGCAG TGTGACCGCG
ACGAACGTGG GTTACAACGG CGTGCTGGCG GCGGGGCAGT CGACCTCGTT CGGGTTCGTC
GGCAACTCCA GCGGCGGTGT CTCGACCCCC GAGCTGACGT GCAGCGCGTC CTGA
 
Protein sequence
MLVTKGRRRL RGGAAAAVAL ALSLVTGGLT AAAHAEDTAL TAQEAARAQQ ATLRELADQR 
GLRMGTAVAP QHLNQAAYAQ TAATEFNSVT HENDLKWETV QPQPGQFNWT NADRIVDFAQ
QNDQLIHGHT LVWHSQLPSW VRDGTFTEGQ LLDVMDTHIS TTVGRYRDDI ATWDVVNEPI
GDDARFRDSV FYRTLGEDFI AEAFRMADRA DPDARLFIND YNIDGINAKS DAYYDLVRDL
LAQGVPIDGI GFQGHLIAGQ VPSSVQQNIQ RFVDLGLEVM ITELDIRIQL PVTQQKLEQQ
ARDYEQVVNA CYAVDGCSGV IVWGVTDAHS WVPGTFPGQG AALPIDENYE PKPAYWSIHE
ALGGDPGPGP DPDPTDPPSG ECEAVYSVTN QWQGGFQAQV TVTAGAALSG WTVEWTFGSG
ESVSHAWNAS VSGSGGSVTA TNVGYNGVLA AGQSTSFGFV GNSSGGVSTP ELTCSAS