Gene Ndas_0355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0355 
Symbol 
ID9244190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp429931 
End bp431226 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content72% 
IMG OID 
Productputative cellulose-binding protein 
Protein accessionYP_003678309 
Protein GI297559335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.647615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTCAA ACAACATCGA CACCCAGCTC AACAACATCT TCGACGAGGA CAACACCCCG 
CACGAATTCG ACGTGGTGCT GCGTGGGTAT GACCGCACCC AGGTGGATGA CTACGCGGCC
AGCCTGCGCA ACGAGCTCAA GCAGTACTCC AAGCAGGTCG AGAAGTTCAA GGCCGAGCTC
AACGCCAAGA ACCGCCAGCT CCAGGAGCGC GAGCGCCCCT CCTACTCCGG TCTGGGCTCG
CGTATCGAGG AGCTGCTGCG CCTGGCCGAG GAGCAGGCCA ACGAGCTGGT CCAGAGCGCG
CAGATCGACG CCAACGACAT CCGCTCGGCC GCCAAGATCG AGGCCGCCGA CATGCGCGCG
GCCGCCGAGT CCGAGGCCAC CGAGGTGCGC GCCCTCGCCC AGCGCGAGGC CGACGAGACC
CGTCAGACCG CCGAGTCCGA GGCGGAGGAG ATCTCCACCA CCGCCCGCCG CGAGGCCGAC
GAGCTCACCT CCACCACCGA GCGCGAGGTG CAGAAGAAGC GCTCCGCGGT CGACCACGAG
ATCGCCGAGA AGCGGGCGAC CTTCGAGGGC GAGATCGCCA AGCTGCGCAC CACCACCGAG
CGCGAGTGCG CCCAGGCCCG CGCCGCGGCC AAGCGCGAGC GCGACGAGAC CATCCAGTCG
GCCAAGAGCC AGGCCGAGGA GCTGCGCAAG AACGCCGAGC GCGCCTACGC CGAGTCCGAG
GCCCGGCGCA CCGAGGCCGA GGACCAGTTC GAGCTCCAGC TGGCCGACCG CCGCGCCGAG
GCCGAGCGCC AGGACGCCGA GCGCCTCGCC GCCGCCCAGG CCGCCACGCA GAAGATGGTC
AACGAGGCCG AGGAGCGGGC CGCCAGCGCC GAGCAGCGCG CCACCAAGGC GAGCCAGCAG
GCCGAGCAGA CCCGCCGCGA CGCCGAGAAC CACGCCAAGC AGCTGGTCGG CAACGCCAAG
AAGAACGCCG CCCAGATCGA GGCCGAGGCC AAGTCCAAGG CCGAGCACCA GCTCGGGGAC
GCCAAGTCCG AGGCCAACCG GATCATGACG GCCGCCAAGA AGGAGGTCGA CGAGCTCAAC
CGCCAGCGCG ACAGCATCCA GTCGCACCTC CAGCAGCTGC GCCAGCTGCT CGGCGGTGGC
GGCCCGGCCG CCCCGGCCCC GGTTCCCGCC GCGCCGGTCG CCCCCGCTCC GGCCCCGGCC
GCGATCCCGC AGGAGCCCGC TCCGGCCGCC GAGGAGACCC GGCAGCCGGT GCACTCCGGC
AAGGGCGCCG ACGACGAGGA CTGGTGGCAG GAGTAG
 
Protein sequence
MPSNNIDTQL NNIFDEDNTP HEFDVVLRGY DRTQVDDYAA SLRNELKQYS KQVEKFKAEL 
NAKNRQLQER ERPSYSGLGS RIEELLRLAE EQANELVQSA QIDANDIRSA AKIEAADMRA
AAESEATEVR ALAQREADET RQTAESEAEE ISTTARREAD ELTSTTEREV QKKRSAVDHE
IAEKRATFEG EIAKLRTTTE RECAQARAAA KRERDETIQS AKSQAEELRK NAERAYAESE
ARRTEAEDQF ELQLADRRAE AERQDAERLA AAQAATQKMV NEAEERAASA EQRATKASQQ
AEQTRRDAEN HAKQLVGNAK KNAAQIEAEA KSKAEHQLGD AKSEANRIMT AAKKEVDELN
RQRDSIQSHL QQLRQLLGGG GPAAPAPVPA APVAPAPAPA AIPQEPAPAA EETRQPVHSG
KGADDEDWWQ E