Gene Ndas_1368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1368 
Symbol 
ID9245218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1676889 
End bp1678169 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content73% 
IMG OID 
ProductCellulase 
Protein accessionYP_003679306 
Protein GI297560332 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.828638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCAAC CCCCCGCCCC CGCCCGACGG CGTCCCGCCC CCGTCCGCGC CGCCCTCGCC 
GGCGCGCTCC TGCTCGCACC CCTCGCGCTC GTCGCCGCGC AGGGCACGGC CGCGGCCGCC
CCCGCCTCCT CGGCGGCCGA GGCGCCGGTC CGGGCCGCCG CCAGCCCCGT CGAGACCCAC
GGCCAGCTGC GGGTCTGCGG CCTCAAACTC TGCGACGAGA GCGGAGAGAC TGTCCAGCTC
ACCGGGATGA GCTCCCACGG CCTGCAGTGG TACAGCGACT GCCTCACCGA CGGCTCCCTG
GACGCGCTCG CCCACGACTG GAACGCCGAC GTCCTGCGCG TGTCCATGTA CATCCAGGAG
GGCGGCTACG AGACCGACCC GCGCGGCTTC ACCGACCGGG TCCACGAACT GATCGAGGAG
GGCACCGCAC GCGGCATGTA CGTCATCGTC GACTGGCACA TGCTCACGCC CGGCGACCCG
AACCACAACA CCGACATGGC GCGCACCTTC TTCTCCGAGA TCGCCTCCGT CCACGCCGGC
AAGGACAACG TCCTGTACGA GATCGCCAAC GAGCCCCACG GGGTCTCCTG GGACGCGATC
CGCGGCTACG CGGAGGAGAT CATCCCGGTC GTGCGCGCCG AGGACCCCGA GGCCGTCGTG
CTCGTCGGCA CCCGCGGCTG GTCCTCCCTC GGCCTGTCCG AGGGCTCCGA CCACACCGAG
ATCACCGCCG ACCCGGTGGA CGCCGACAAC ATCATGTACG TCTTCCACTT CTACGCCGCC
ACGCACGGCG ACTTCCACCG GGAGGGCCTG CGGGCCGCGG CGCGGGACCT GCCGCTGTTC
GTCACCGAGT TCGGCACCCA GGAGGCCACC GGCGACGGGC CCAACGACTT CGCATCGGCC
CAGGCCTACC TCGACCTCCT GGCCGAGGAG CAGATCAGCT GGGTGAACTG GAACTTCTCC
GACGACTTCC GCTCCGGCGC GGTGTTCGAG ACGGGCACCT GCGCCGCCGA CGGCCCCTGG
ACCGGAACCG AGTCGCTCAA GCCCGCGGGG GAGTGGATCC GCGACCGCAT CCGCGAGAGC
GGGGTCGTCC CGACCGACCC CACCGATCCC ACCGACCCGG ACGCGTGCGA GACCCCGGCC
TGGTCCTCGG GAGAGGTGTA CACCGGCGGC GACCAGGTCA GCCACGGCGG GCGCCTGTAC
CGGGCCCAGT GGTGGACACG GGGGGAGGAG CCCGGGACCA CGGGTGAGTG GGGCGTCTGG
CGGGACCTCG GCGCCTGCTG A
 
Protein sequence
MAQPPAPARR RPAPVRAALA GALLLAPLAL VAAQGTAAAA PASSAAEAPV RAAASPVETH 
GQLRVCGLKL CDESGETVQL TGMSSHGLQW YSDCLTDGSL DALAHDWNAD VLRVSMYIQE
GGYETDPRGF TDRVHELIEE GTARGMYVIV DWHMLTPGDP NHNTDMARTF FSEIASVHAG
KDNVLYEIAN EPHGVSWDAI RGYAEEIIPV VRAEDPEAVV LVGTRGWSSL GLSEGSDHTE
ITADPVDADN IMYVFHFYAA THGDFHREGL RAAARDLPLF VTEFGTQEAT GDGPNDFASA
QAYLDLLAEE QISWVNWNFS DDFRSGAVFE TGTCAADGPW TGTESLKPAG EWIRDRIRES
GVVPTDPTDP TDPDACETPA WSSGEVYTGG DQVSHGGRLY RAQWWTRGEE PGTTGEWGVW
RDLGAC