Gene Ndas_0360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0360 
Symbol 
ID9244195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp437530 
End bp439488 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content71% 
IMG OID 
Productglycoside hydrolase family 18 
Protein accessionYP_003678314 
Protein GI297559340 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGCAC GACTTCGCCA ACGAATCGCG GCACTGGCCG CCGCGGTCGT CCTACCCCTC 
GCACTGGCAC CCGTCCCGGC CGCGTCGGCC GACACCGCGG GCGTCACCGT CACCTACGTG
GAGACCAGTC GCTGGGAGAC CGGCTACGGC GGGCAGCTGA CCATCGCCAA CGGCTCGGGG
TCGGCGCTGA CCGACTGGAG CATCGGCTTC CGGCTCCCGT CCGGCACCGC CATCACCAGC
CTGTGGAACG CCACCCTCAG CCGCTCCGGC GACGCCTACA CCGTCACCCC GCCCTCGTGG
GGCGCCTCCG TCCCGGCGGG CGGCAGCTAC TCCATCGGGT TCAACGGCAC CCACGGCGGC
GGCGACACCG CTCCCGTGGA CTGCACGGTC AACGGCGGCG GCTGCTCGGG CGAGCCCGGC
GAGGAGGACA CCGAGCCTCC CACCGCGCCG ACCGGCCTGA CCGTCACCGG CACCACCTCG
ACCACCGTGG CCCTGCAGTG GGGCCCCGCG GACGACAACG CCGGGGTCGC GGGCTACGAG
GTCCTCTCCG GCGGCGAGGT CGTCCGCGCG GTCACCGGCA CCACCGCCAC CGTCACCGGG
CTGGCGCCCC AGACCGAGCA CACGTTCACC GTGCGCGCCT ACGACACCTC CAACAACAGG
GGTCCCGAGA GCGGCGCCGT CACCGCCACC ACCGACGCGG ACGGCGGCGG CCCCACCGAC
CCGCCCCAGG AGCGCCGGGT CGCCTACTTC ACCCAGTGGG GCATCTACGG CCGCGACTAC
CTGGTGAACG ACCTGGTCAC CTCGGGCACC GCCGAGAAGC TCACCCACAT CAACTACGCC
TTCGGCAACA TCAACGCGAA CGGCGAGTGC TTCATGGCCA ACCAGCTCGG CCAGGGCGAC
GCCTGGGCCG ACTACGGCCG CTCCTTCGGG GCCGCCGACA GCGTCGACGG GGTCGGCGAC
ACGTGGGACC AGGACCTGCG CGGCAACTTC AACCAGCTGC GCGAGCTCAA GGAGATGTAC
CCCGACCTCA AGGTCAACAT CTCCCTGGGC GGCTGGACCT GGTCCGAGCA CTTCTCCGAC
GCGGCGCTGA CCGCCGAGTC GCGTGAGCGC ATGGTCTCCT CCTGCATCGA CCAGTTCCTG
CGCGGCAACC TGCCCGTGTT CGACGGCGCG GGCGGCCCCG GCTCCGCCTA CGGCGTCTTC
GACGGCATCG ACCTGGACTG GGAGTGGCCG GGATCGGCGG GCCACGAGCA CAACACCGTC
CGCCCCGAGG ACAAGGAGAA CTTCACCGCC CTGGTGCAGG AGTTCCGCGA CCAGCTGGAC
GCCCTGGAGG CCGAGACGAG CCGCCAGTAC GAGCTGACCG CGTTCCTGCC CGCAGACCCG
GAGAAGGTCG AGCTCGGCTT CGAGATGCCG CAGCTCATGA CCGACTTCGA CTTCATCACG
GTGCAGGGCT ACGACTACCA CGGCGGTTGG GAGACCACCG CCAACCACCA GTCTAACCTG
CTCCTGGACC CGGCCGACCC CGGCCCGGAC CTGTACTCCA CCGAGACCAC GGTCCAGGCC
TACCTCGACC GCGGCGTCGA CCCCGCCGAC ATGGTGCTCG GCGTGCCGTT CTACGGCCGC
GGCTGGACCG GTGTGGAGCC CGGTCCGAAC GGCGACGGTC TCTTCCAGAG CGCTACCGGT
CCCGCCCCCG GTAGCTACGA GGCGGGGATC GACGACTGGA AGGTCCTGAA GGACCTGGTG
GGCACCGGCG GCTACGAGCT GTACCGCGAC GACGCGCTGG GCACCGCCTG GCTGTACAAC
GGCAGCACCT TCTGGACCTA CGACGACGAG ATCTCCATGG CCCAGAAGAC CGACTGGGCC
CAGGCCCAGG GCCTGGGCGG CGTCATGATC TGGTCCGTTG ACGGCGACGA CGCCAACGGC
AGCCTCATGA ACGCCATCGA CACGGCGCTG GCCGGGTAG
 
Protein sequence
MRARLRQRIA ALAAAVVLPL ALAPVPAASA DTAGVTVTYV ETSRWETGYG GQLTIANGSG 
SALTDWSIGF RLPSGTAITS LWNATLSRSG DAYTVTPPSW GASVPAGGSY SIGFNGTHGG
GDTAPVDCTV NGGGCSGEPG EEDTEPPTAP TGLTVTGTTS TTVALQWGPA DDNAGVAGYE
VLSGGEVVRA VTGTTATVTG LAPQTEHTFT VRAYDTSNNR GPESGAVTAT TDADGGGPTD
PPQERRVAYF TQWGIYGRDY LVNDLVTSGT AEKLTHINYA FGNINANGEC FMANQLGQGD
AWADYGRSFG AADSVDGVGD TWDQDLRGNF NQLRELKEMY PDLKVNISLG GWTWSEHFSD
AALTAESRER MVSSCIDQFL RGNLPVFDGA GGPGSAYGVF DGIDLDWEWP GSAGHEHNTV
RPEDKENFTA LVQEFRDQLD ALEAETSRQY ELTAFLPADP EKVELGFEMP QLMTDFDFIT
VQGYDYHGGW ETTANHQSNL LLDPADPGPD LYSTETTVQA YLDRGVDPAD MVLGVPFYGR
GWTGVEPGPN GDGLFQSATG PAPGSYEAGI DDWKVLKDLV GTGGYELYRD DALGTAWLYN
GSTFWTYDDE ISMAQKTDWA QAQGLGGVMI WSVDGDDANG SLMNAIDTAL AG