Gene Ndas_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1940 
Symbol 
ID9245790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2363726 
End bp2364736 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content70% 
IMG OID 
Productprotein of unknown function DUF21 
Protein accessionYP_003679873 
Protein GI297560899 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.177932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.094391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA TGAACGTGTG GGTGGCGCTC GCGCTGACCG CGGTGATCAT CGCGCTGAGC 
GCCTTCTTCG TGGCGATCGA GTTCGCCCTG GTGGCGGCGC GCCGCTACCG GCTGGAGGAG
GCCGCCGAGT CCAGCTTCTC GGCACGGGCC GCGGTCAGGA GCGCCCGCGA CCTGTCGCTG
CTGCTGGCCG GTTCGCAGCT GGGGATCACC CTGTGCGCCC TGGCGCTGGG CGCGATCTCC
AAGCCCGCCG TCCACCACAT GCTGGAGCCG CTGTTCGGCG GCCTGCCGGC GGCGGTGGGC
TACGTGGTCT CGTTCGTGCT GTCGCTGATC GTGGTGACCT TCCTGCACCT GGTGGTGGGT
GAGATGGCGC CCAAGTCCTG GGCGATCTCG CACCCGGAGA AGTCGGCGAT CATGCTGGCC
GTGCCGATGC GGGCGTTCAT GTGGTTCACC CGTCCGCTGC TGCTGATGCT CAACGGCATG
GCCAACTGGT GCCTGCACCG GCTGGGCGTG GAGGCGGTGG ACGAGATGTC GTCCGGGCAC
GGTCCCGACG ACGTGCGCGA GCTGGTGGAG CACTCGGCCA AGGCCGGTGC GCTCGACCCC
GAGCGCCGCG CCCAGCTGGC CACGGCGCTG GAGGTCAACT CCCGTCCGCT GAGCGAGATC
GTGACACCGC GCGAGGAGAT CGCGTCGGTG TCCCCGAACT CGACGGTGGA CGACATCAAG
CAGGTGTCGC GGGAGTCCAC GCACCTGCGC CTGGTGGTGA TGGACGGCAC CGAACCCGTG
GGCGTGCTGC ACGTGCGCGA GGCGCTGACG GGCCCGGAGG GGACCACCGC GGCCGACCTG
ATGCGGCCGG TGCTCACCCT GGCCGCGGAG ACGCCGATGT ACGCGGCGAT GGGCATCATG
CGGGAGAGCC GCAGCCACCT GTCCCTGGTG GAGACGGACG GCGAGGTGAT CGGCCTGGTC
ACCCTCCAGG ACATCCTGGA CCGCCTGCTG CTGCTGGACA CGGCCGCCTG A
 
Protein sequence
MSDMNVWVAL ALTAVIIALS AFFVAIEFAL VAARRYRLEE AAESSFSARA AVRSARDLSL 
LLAGSQLGIT LCALALGAIS KPAVHHMLEP LFGGLPAAVG YVVSFVLSLI VVTFLHLVVG
EMAPKSWAIS HPEKSAIMLA VPMRAFMWFT RPLLLMLNGM ANWCLHRLGV EAVDEMSSGH
GPDDVRELVE HSAKAGALDP ERRAQLATAL EVNSRPLSEI VTPREEIASV SPNSTVDDIK
QVSRESTHLR LVVMDGTEPV GVLHVREALT GPEGTTAADL MRPVLTLAAE TPMYAAMGIM
RESRSHLSLV ETDGEVIGLV TLQDILDRLL LLDTAA