Gene Ndas_4348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4348 
Symbol 
ID9248223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5181057 
End bp5182313 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content71% 
IMG OID 
Productdomain of unknown function DUF1727 
Protein accessionYP_003682243 
Protein GI297563269 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0229746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.964565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC TTCCCCTGCG CGCCCAACTG GCATCGGTTC TGGGAAGGAG CGCGGCCAGC 
CTGTCCCGTG CCACCGGACG CGGAGACGGC TCCGTCATCG GCGGCCGGGT GGCGCTCAAG
GTCGAACCCG ACCTGCTCGC CAAGCTCGCC CGGGGCCGCA GGCTCGCCCT GGTCAGCGCC
ACCAACGGCA AGACCACCAC CACCAGGCTC ATCTCGCACG CCCTGCGCGA GTTCGGCGAC
GTGGCCACCA ACGAGCACGG CGCGAACATG CCCACCGGGC ACATCACGGC TCTGTCGAAC
AACCAGTCCG CCGTCAACGG CGTGCTGGAG GTGGACGAGA AGTACCTCCC GCAGGTGCTG
CTCGCCACGC AGCCCGCGTT CGTGGTGCTG ATGAACCTCA GCCGCGACCA GATGGACCGC
GCCTCCGAGA TCAACCTGCT CGCCAAGAAG TGGCGCCTCG CGCTGGGCAA GAGCAACGCC
CACGTCATCG CCAACGCCGA CGACCCGCTC GTGGCCTGGG CGGGCCTGGG CGCGCCCAAC
GCCACCTGGG TGTCCGCGGG TCAGCGCTGG AAGGAGGACT CCTGGTGCTG CCCCGAGTGC
GGCGGCCACC TCAAGCGCGA CGTGGACCCG CACTGGGCCT GCCCCGAGTG CGGGCTGGCC
CGCCCCGCGA CCACCTGGGC GGTGGACAAC GCCTCCGACT CCCTCCTCAC CCCGGAGGGC
CAGAGCATCA AGCTGCGGCT GAACCTGCCC GGTGACGCCA ACCGCTCCAA CGCCGCGATC
GCCGCGGCCA CCGCCGCCGG GTACGGCATC CACCCCGAGC GCACCGTGGA GCGGCTGCGC
GAGATCACCT CCGTCGCGGG CCGCTACACC TCCGTGGTGA CCATGGGCGT CGAGGTGCGG
CTGCTGCTCT CCAAGAACCC CGCCGGATGG CTGGAGTCCT TCGCCGTCCT CGACCCGCCC
CACACCCCGG TGATCCTCTC GGTCAACGCG CAGGTCCCCG ACGGCAAGGA CACCTCCTGG
CTGTGGGACG TGGACTACAC CGTCCTGCGC GGACGCCGCG TGTTCGTCAT GGGCGAGCGC
CGCACCGATC TCGCGCTGCG CCTGGAGACC GACGGCGTGC GGTTCGAGGT GGCCGACCGG
GTCGACGAGG TCCTGGGCCG CATCAAGGCC GACCAGCCGG GCATCACCAA GGTCGACCTC
ATCGCCAACT ACACCGCCTT CCAGCAGATC CGCACGGCGT ACGGCCGCGT CCAGTAG
 
Protein sequence
MSELPLRAQL ASVLGRSAAS LSRATGRGDG SVIGGRVALK VEPDLLAKLA RGRRLALVSA 
TNGKTTTTRL ISHALREFGD VATNEHGANM PTGHITALSN NQSAVNGVLE VDEKYLPQVL
LATQPAFVVL MNLSRDQMDR ASEINLLAKK WRLALGKSNA HVIANADDPL VAWAGLGAPN
ATWVSAGQRW KEDSWCCPEC GGHLKRDVDP HWACPECGLA RPATTWAVDN ASDSLLTPEG
QSIKLRLNLP GDANRSNAAI AAATAAGYGI HPERTVERLR EITSVAGRYT SVVTMGVEVR
LLLSKNPAGW LESFAVLDPP HTPVILSVNA QVPDGKDTSW LWDVDYTVLR GRRVFVMGER
RTDLALRLET DGVRFEVADR VDEVLGRIKA DQPGITKVDL IANYTAFQQI RTAYGRVQ