Gene Ndas_1819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1819 
Symbol 
ID9245669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2225058 
End bp2226290 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content67% 
IMG OID 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003679753 
Protein GI297560779 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.538618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACA AGAGCGTGAA GCGGGCGTTT CGGTACCGCT TCTATCCGAG TGATGCGCAG 
GCGGCTGAGC TGTCGCGCAC GTTCGGGTGC GTGCGCCTGG TCTACAACCG CGCCCTGGCC
GAACGCAGCA CCGCCTGGCA CCAGCGCCGG GAACGGGTGG GCTACTCCCA CACCTCGGCC
ATGCTGACCG CTTGGAAGAA GACGGACGAG CTGTCCTTCC TCACCGAGGT CTCCTGCGTC
CCCCTTCAGC AGACGCTGCG CCACCTGCAC ACCGCCTTTC GCAACTTCTT CGACCGGCGC
GCACAGTATC CGCGGTTCAA GTCCAAGAAG ACGTCGCGTG CCTCGGCGGA GTACACCGCG
AGCGCGTTCC GCTACCGCGA CGGCCACCTG ACCCTGGCCA AGATGACCGA ACCCCTGGAC
ATCGTGTGGT CGCGCCCCCT GCCTGAAGGA GCAAGGCCGT CCACGGTGAC GGTGTCCCGG
GATGCGGCCG GGCGCTGGTT CGTGTCCCTG CTGTGCGAGG ACACCGTCGA GTCGGCTCCG
GCCGCCAACG ACGCGGTGGG TGTGGACAAG GGTGTGACGT CGCTGGTGGT GTTGTCCACG
GGGGAGAAGG TGGCCAACCC CCGCCACGAA CAACGCGACC GCGCCAAGCT GGCCCGCGCC
CAGCGGGCGC TGGCCCGCAA GGCCAAGGGC AGCGCGAACC GGGACAAGGC CCGCCGCAAG
GTCGCGCGGG TGCACGCCCG CATCACCGAC CGCAGGCGCG ACTTCCTGCA CAAGCTCTCC
ACTCGACTCG TCCGCGAGAA CCAAGTGGTC GTGATCGAGG ACCTGACGGT GCGCAACATG
GTCAGAAACC GCAGACTCGC CCGAGCGATT TCGGATGCGG CCTGGCGTGA GCTGCGCACG
ATGCTGGAGT ACAAGTGCGC CTGGTACGGA CGCGATCTGG TCGTGGTGGA CCGGTTCTTT
CCCTCCTCCA AGTTGTGTTC GACGCCCGGG TGCGGGTACC TCAATGTGTC GTTGCCGTTG
CGTGTACGGG AGTGGACGTG TCCCGGTTGT GGGGTGGCTC ATGACCGTGA TGTGAACGCG
GCGCTCAATC TCGAAGCCGC CGGGCTGGCG GTGTTGGCCT GTGGAGCTGG TGTGAGACCT
CAACGGGAGT CCTCCCGGAC GGGGCGACCG GCGGTGAAGC AGGAAGGCCA CGGGGCGACC
CGTGACGAGG CGTTGGCCTC GAACCACCGG TAG
 
Protein sequence
MTNKSVKRAF RYRFYPSDAQ AAELSRTFGC VRLVYNRALA ERSTAWHQRR ERVGYSHTSA 
MLTAWKKTDE LSFLTEVSCV PLQQTLRHLH TAFRNFFDRR AQYPRFKSKK TSRASAEYTA
SAFRYRDGHL TLAKMTEPLD IVWSRPLPEG ARPSTVTVSR DAAGRWFVSL LCEDTVESAP
AANDAVGVDK GVTSLVVLST GEKVANPRHE QRDRAKLARA QRALARKAKG SANRDKARRK
VARVHARITD RRRDFLHKLS TRLVRENQVV VIEDLTVRNM VRNRRLARAI SDAAWRELRT
MLEYKCAWYG RDLVVVDRFF PSSKLCSTPG CGYLNVSLPL RVREWTCPGC GVAHDRDVNA
ALNLEAAGLA VLACGAGVRP QRESSRTGRP AVKQEGHGAT RDEALASNHR