Gene Ndas_0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0923 
Symbol 
ID9244768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1129337 
End bp1131577 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content73% 
IMG OID 
Productcellulose-binding family II 
Protein accessionYP_003678873 
Protein GI297559899 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0984309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCCA ACCCCCCTGC CGCCCGTCAT CGGACACGGC GCGGCGCCTG CCTCGCGACC 
GCACTCGCCA CGGGCGCCGC GCTCGCCGCC GTGGCGACCG GCCCCGCCTC CGCCGCCGCC
GGCTGCGAGG TCGCCTACAC CGTCCAGAAC CAGTGGAACG ACGGGTTCAC CGCCGCCGTG
TCCGTCACCA ACCTCGGTGA TCCCGTCGAC GGCTGGACCC TGGAGTGGGA CTTCACCGCG
GGCGAGCGCG TCACAAGCGG CTGGAACGGC GAGTTCTCCG GCTCGGGGAA CCGTGTCAGC
GTGGCCGACA CCGGGTGGAA CGCCGCCATC GCCACCGGAG CCTCGGTCAG CTTCGGCTTC
AACGGCTCCC ACGGCGGCAC CCCGGAAGTC CCCGACTCCT TCACGCTCAA CGGCACCGTC
TGCACCGGCG GCACCGAGGA ACCGCCCGGT GAGGAGCCGC CCGACCCCGA GGACCCGCCC
ACCACCGGAG AGGGGCGCCA GGTCGAGGAG CTGGACCGCG GTCTGGTGAG CGTGCAGGGC
TCGCAGGGCA ACCTGGTGAG CTGGCGGCTG CTGGCCACCG ACCCCGAGGA CGTCGCCTTC
AACGTCTACC GGGGCTCCAC ACGCCTGAAC TCACAGCCCC TCACCGGAGC CACCTCCTAC
CTGGACGGCG GAGCCTCCGC CGACGCCCGC TACACGGTCC GCCCCGTGAT CGACGGCCAG
GAGGCGGAGG CCTCGGGGGC CTCACTGGCG CTGCGCGCCG GCTACCTGGA CGTGCCCCTG
GACGTCCCGT CCGGCGGCAG CGGCTACACC TACGACGCCA ACGACGCCAG CGTGGGCGAC
CTCGACGGCG ACGGCGAGTA CGAGATCGTC CTGAAGTGGG AGCCCACCAA CGCCAGGGAC
AACTCCCAGT CGGGCCGCAC CGGCCCCGTA CTGCTCGACG CCTACGAACT CAGCGGTGAG
CGGCTCTGGC GCATCGACCT GGGACGCAAC ATCCGCGCCG GTGCGCACTA CACCCAGTTC
CAGGTCTACG ACTTCGACGG AGACGGGCGG GCCGAGGTGG CCGTGAAGAC GGCGGACGGC
ACCGTGGACG GCACCGGCGG GGTCATCGGC GACGCCGGGG CCGACCACCG CAACGGCGAC
GGCTACGTCC TGAGCGGGCC GGAGTTCCTC ACCATGTTCG ACGGCCGCAG CGGGGCCGAA
CTGGACTCGG TGGACTACGT TCCCGGGCGC GGTGACGTCT GCGACTGGGG CGACTGCTAC
GGCAACCGGG TGGACCGGTT CCTGGCGGGC GTCGCCTACC TGGACGGCCA GCGGCCCAGC
CTGGTCATGG CGCGCGGCTA CTACACGCGC TCGGTGATCG CGGCCTGGGA CTGGCGCGAC
GGGCGCTTCA CCCGGCGCTG GACCTTCGAC AGCGACGAGG CCGGGTCGCG GTGGGCCGGT
CAGGGCAACC ACCAGTTGAG CATCGCCGAC GCCGACGGTG ACGGCCGCGA CGAGGTCATG
TACGGGTCGA TGGCCGTGGA CGACGACGGA AGCGGGATGT GGGCCACCGG CCACGGCCAC
GGGGACGCCA TGCACGTGGG CGACCTGCTG CCCGACCGCT CCGGCCTGGA GGTCTTCACC
ATCACCGAGC GCACGGGAGG CGCGGCGGGC GCCTACGTGG CCGACGCCGA CACCGGCGGG
GTGGTCTGGC GCAGGCCGAC CGCCAGCGGT GAGGAGGGGC CGGGCCGGGG CGTGGCCGCG
GACGTGTGGC CGGGCAACCC CGGCGCGGAG TTCTGGGTCA CGGGCGGCGG TATCAGCGGC
ATGTACGACG CCGGGGGGAA CCGCCTGGGC GTTCGCGCCC CCTCGTCGGC GAACTTCGTG
GCCTGGTGGG ACGGCGACCC GCTGCGGGAA CTGCTGGACG GCACCCGTAT CGACGAGTAC
GGGACCGGTG GCGACACCCG GCTGCTGACC GGCTCCGGGG TCGCCTCCAA CAACGGCACC
AAGTCCACAC CGGCGCTGAG CGGGGACATC CTCGGTGACT GGCGTGAGGA GGTCGTCTGG
CGCACCGCGG ACAACCGGGC GCTGCGGATC TACTCCACGC CGCTGCCCAC CGATCTGCGG
ATGCCGACGC TGATGCACGA CACCCAGTAC CGGGTGGCCG TCGCCTGGCA GAACACCGCC
TACAACCAGC CGCCGCACCC GAGCTTCTTC ATCGGGGACG GCATGGAGCC CGCTCCCCGG
CCGGAGGTGT ACACGCCCTG A
 
Protein sequence
MPPNPPAARH RTRRGACLAT ALATGAALAA VATGPASAAA GCEVAYTVQN QWNDGFTAAV 
SVTNLGDPVD GWTLEWDFTA GERVTSGWNG EFSGSGNRVS VADTGWNAAI ATGASVSFGF
NGSHGGTPEV PDSFTLNGTV CTGGTEEPPG EEPPDPEDPP TTGEGRQVEE LDRGLVSVQG
SQGNLVSWRL LATDPEDVAF NVYRGSTRLN SQPLTGATSY LDGGASADAR YTVRPVIDGQ
EAEASGASLA LRAGYLDVPL DVPSGGSGYT YDANDASVGD LDGDGEYEIV LKWEPTNARD
NSQSGRTGPV LLDAYELSGE RLWRIDLGRN IRAGAHYTQF QVYDFDGDGR AEVAVKTADG
TVDGTGGVIG DAGADHRNGD GYVLSGPEFL TMFDGRSGAE LDSVDYVPGR GDVCDWGDCY
GNRVDRFLAG VAYLDGQRPS LVMARGYYTR SVIAAWDWRD GRFTRRWTFD SDEAGSRWAG
QGNHQLSIAD ADGDGRDEVM YGSMAVDDDG SGMWATGHGH GDAMHVGDLL PDRSGLEVFT
ITERTGGAAG AYVADADTGG VVWRRPTASG EEGPGRGVAA DVWPGNPGAE FWVTGGGISG
MYDAGGNRLG VRAPSSANFV AWWDGDPLRE LLDGTRIDEY GTGGDTRLLT GSGVASNNGT
KSTPALSGDI LGDWREEVVW RTADNRALRI YSTPLPTDLR MPTLMHDTQY RVAVAWQNTA
YNQPPHPSFF IGDGMEPAPR PEVYTP