Gene Ndas_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1406 
Symbol 
ID9245256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1722970 
End bp1724775 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content71% 
IMG OID 
Productglycoside hydrolase 15-related protein 
Protein accessionYP_003679344 
Protein GI297560370 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.334429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGTGA GCCGCGTGCC AGGCTGGATC GAGGACTACG CAATGATCGG CGACATGCAG 
ACCGCCGCGC TGGTCGGGCG CGACGGGTCC ATCGACTGGG CGTGCCTTCC CGACTTCGAC
TCCTCGGCCT GTTTCGCCGC GCTGCTGGGC GACGAGCAGA ACGGCTGCTG GACCCTGCGC
CCCGCCGAGG GCGAACCGCG CGCCACCCGC CGCCGCTACC GGGGCGACAC GCTCATCCTG
GAGTCGGAGT GGGACACCCC CTCCGGGTCG GTCCGGGTCA TCGACTTCAT GCCGCCGCGC
GGCGGCGCCC CGCACATCGT GCGCATCGTC GAGGGGCTGA GCGGCTCCGT GCGCATGGAG
ACGACCATGC GCATCCGCTT CGACTACGGC CACGTCGTGC CGTGGGTGCA CCGGACCGGG
GCCGAACTGG TGGCCATCGC CGGACCCGAC GCCATCTGGC TCAGCACCCC CATCTCCCTC
CAGGGCCACA ACTTCACCCA CGACGCCACC TTCACCGTCA CCGCGGGCCA GCGGGTGCCC
TTCGTGATGA CCTGGCACCC CTCCCAGGTG GAGGAGTCCG ACCACCTGGA CGCGGAGAAG
GCGCTCTCGC GCACCGAGCG CTTCTGGGAG AAGTGGGTCA ACCAGTGCAC CTACGAGGGC
CCCTACCGCG AGGCGGTGAT CCGCTCCCTC ATCGTGCTCA AGGCCCTGAC CTACCGCCCC
ACCGGCGGGA TCGTCGCCGC CCCCACCACC TCCCTGCCCG AGGAGATCGG CGGGGTGCGC
AACTGGGACT ACCGCTACTG CTGGCTGCGC GACGCCACCA TCACGCTGGA GGCGATGATC
CGCTCCGGCT ACAAGGACGA GGCGCTGGCC TGGCGCGAGT GGCTGGTGCG GGCGATCGCG
GGCGAACCCC AGCTCATGCA GATCATGTAC GGCATCCGGG GCGAGCGCAG ACTCACCGAG
TGGGAGGCCG AGTGGCTGCC GGGCTACGAG GCCTCCCGTC CGGTCCGGAT CGGCAACGCC
GCCGTGGGCC AGTACCAGCT CGACGTCTAC GGCGAGGTCA TGGACGTGCT GCACCTGGCC
CGCCGCCACA ACATCCGCGG CGGCGACTAC CTGTGGGGCC TCCAGCGCTC GCTGGTCAAC
TACCTGGAGT GGTGCTGGGA CGAGCCGGAC GAGGGCCTGT GGGAGGTGCG CGGGCCCCGC
CAGCACTTCG TGCACTCCAA GGTGATGGCC TGGGTGGCGG CCGACCGCGC GGTGCGCAGC
ATCGAGGAGT TCGGCAAGGA GGGGCCCATC GAACGCTGGA GGGCCCTGCG CGACACCATC
CACGCCGAGG TGTGCGAGTA CGGCTACGAC CCCCAGCGCA ACACGTTCAC CCAGTACTAC
GGCAGCAAGG AGCTGGACGC GGCGCTCCTG CTGATCCCCG AGGTGGGTTT CCTGCCCTAC
GACGACCCGC GCGTGGTCGG CACCATCGAG GCGGTGCGCA AGGACCTGAT GGTGGACGGG
TTCGTGCTGC GCTACCGCAC CGACCTGGAC GACTCCGCCG ACCAGCTGCC CGGCAACGAG
GGCGCGTTCC TGGCGTGCAG CTTCTGGATG GCCAACGCGC TGCTGTCGAT CGGCCGCCAG
GACGAGGCCC GCGAGCTGTT CGAGCGGCTG CTGTCCCTGC GCAACGACGT GGGCCTGCTG
GCCGAGGAGT GGGACCCGCG CGAGAACCGC CAGGTCGGCA ACTTCCCCCA GGCGTTCAGC
CACGTGCCGC TGGTGACCAC CGCGCTCAAC CTGTCCACCC GCCAGGGGGG ATGGCGCGCC
GAGTAG
 
Protein sequence
MGVSRVPGWI EDYAMIGDMQ TAALVGRDGS IDWACLPDFD SSACFAALLG DEQNGCWTLR 
PAEGEPRATR RRYRGDTLIL ESEWDTPSGS VRVIDFMPPR GGAPHIVRIV EGLSGSVRME
TTMRIRFDYG HVVPWVHRTG AELVAIAGPD AIWLSTPISL QGHNFTHDAT FTVTAGQRVP
FVMTWHPSQV EESDHLDAEK ALSRTERFWE KWVNQCTYEG PYREAVIRSL IVLKALTYRP
TGGIVAAPTT SLPEEIGGVR NWDYRYCWLR DATITLEAMI RSGYKDEALA WREWLVRAIA
GEPQLMQIMY GIRGERRLTE WEAEWLPGYE ASRPVRIGNA AVGQYQLDVY GEVMDVLHLA
RRHNIRGGDY LWGLQRSLVN YLEWCWDEPD EGLWEVRGPR QHFVHSKVMA WVAADRAVRS
IEEFGKEGPI ERWRALRDTI HAEVCEYGYD PQRNTFTQYY GSKELDAALL LIPEVGFLPY
DDPRVVGTIE AVRKDLMVDG FVLRYRTDLD DSADQLPGNE GAFLACSFWM ANALLSIGRQ
DEARELFERL LSLRNDVGLL AEEWDPRENR QVGNFPQAFS HVPLVTTALN LSTRQGGWRA
E