Gene Ndas_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1013 
Symbol 
ID9244859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1238719 
End bp1240365 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content75% 
IMG OID 
ProductAmidohydrolase 3 
Protein accessionYP_003678962 
Protein GI297559988 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.668342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.733638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACC TCAAACTGAC CAACGCCCGC ATCCGCACGG TCGACGACGA CCGGCCCTTC 
GCCACGGTGC TCGGCATCGC CGCCGGGCGC GTCCTCGGCC TGGACGAGGA GGTCGCCGAC
CTGCCCGCGC GCAGGACGGT CGACTGCGGC GGCGCCGTGG TCGTCCCCGG CTTCGGGGAC
GCGCACAACC ACATGGCGTG GTTCGGACAG TCGCTCGCCG AGCTGGAGCT GGAGACGGTG
TCCACCCTGG ACGCCCTCTA CGACGCGGTC GCCCGGGCGG CCGCGACGCT GCCCGAGGAC
ACGTGGATCG TCGGCTCCGG CTACGACGAC GCCCTGCTCG GCGCCCACCC CGACCGCCAC
GGCCTCGACC GGGCCGGGGG CGGCCGACCC GTCTGGCTCA AGCACCGCTC CGGCCACATG
TGCACCGTCA GCAGCGCGGT CCTGCGCCAG GCCGGGATCG ACACCGCCGT GTCCGACACC
GCCGCGGCCG ACCCGGACGG GGGAGTGATC GTCCGCGACG GCGCGGGCGC CCCCACCGGC
CTGCTCCAGG AGCGCGCGCA GGAACTGGTG ACCGCACTGG TCATGCCCTA CCCGGTGACC
GAGCTCGCCG ACGCGCTCGC GCGCGCCTCC CGGGTCTACG CCTCCGAGGG CCTCACCCAC
GTCGTGGAGG CGGGCATCGG GCGCGGTCTG ATCGGCCGCA CCCCCGTGGA GGCCGCCGCC
TACCAGCTCG CCCGCGACCG CGGCGAGCTG CTTCCCCGGG TCGAGCTCAT GGTCGCCGCA
GACAACATGC ACCCGCTGGG CGGGCACGCC GACGACGGGA TCGACACCGG CATCGACCTG
GGGCTGCGCA CCGGCTTCGG CGACGACCGG CTGCGCCTGG GGCCGATGAA GATCTGGCTC
GACGGATCCC TCATCGGCCG CACCGCCGCC GTCACCGAAC CCCTCTGCGG GCACGGCCAC
GGCGTGTACC AGAACTCCCC GGAGGAGATG CGCGCGCTGG TCGTGGCCGC CCACCGGGCC
GGATGGCGCG TGGCCGCGCA CGCCATCGGC GACGACGCCG TCGACGTCGC GCTGGAGGCG
TTCGCCGAGG CCCAGCGCGC GCTGCCGCGC CCCGACGTGC GCCACCGCAT CGAGCACGCG
GGCGTGGTGC GCCCGGACCA GCTGCCCCGC ATCGCCGAGG CGGGCCTGGT CCCCGTGCCC
CAGCCCCGGT TCCTGTACGC CCTGGGCGAC GGCATGGCCG CCGCCGTGGG CCCGGAGCGC
GTGCCCTGGC TCTACCGGCA CCGGTCCTTC CTCGACCACG GGCTGCGCGT CCCGGGCAGC
TCGGACCGGC CGGTCGCCCC GGGGGCGCCG CTCCTGGGCA TGGAGTCGAT GGTGGAGCGC
GTGACCGCCT CGGGAACCGT TCTGGCCGCC GACGAGCGTG TCAGCGCCGA ACAGGCACTC
CGGGCCTACA CGATGGACGC GGCCTGGGCC AGCCACGACG AGCACCGCAG GGGCAGCCTG
ACCCCGGGCA AGCTCGCCGA CCTGGTGGTC CTGGACCGCG ACCCCGTGGA CACCGCGGAG
GAGGGCATCG GCACGATCCG CGTCCTGGCG ACCCTCGTCG GCGGGGAGTG CGTGCACGGG
GCGGACGTCC TCGACGGTCT GACCTGA
 
Protein sequence
MLDLKLTNAR IRTVDDDRPF ATVLGIAAGR VLGLDEEVAD LPARRTVDCG GAVVVPGFGD 
AHNHMAWFGQ SLAELELETV STLDALYDAV ARAAATLPED TWIVGSGYDD ALLGAHPDRH
GLDRAGGGRP VWLKHRSGHM CTVSSAVLRQ AGIDTAVSDT AAADPDGGVI VRDGAGAPTG
LLQERAQELV TALVMPYPVT ELADALARAS RVYASEGLTH VVEAGIGRGL IGRTPVEAAA
YQLARDRGEL LPRVELMVAA DNMHPLGGHA DDGIDTGIDL GLRTGFGDDR LRLGPMKIWL
DGSLIGRTAA VTEPLCGHGH GVYQNSPEEM RALVVAAHRA GWRVAAHAIG DDAVDVALEA
FAEAQRALPR PDVRHRIEHA GVVRPDQLPR IAEAGLVPVP QPRFLYALGD GMAAAVGPER
VPWLYRHRSF LDHGLRVPGS SDRPVAPGAP LLGMESMVER VTASGTVLAA DERVSAEQAL
RAYTMDAAWA SHDEHRRGSL TPGKLADLVV LDRDPVDTAE EGIGTIRVLA TLVGGECVHG
ADVLDGLT