Gene Ndas_3383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3383 
Symbol 
ID9247248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4043852 
End bp4045789 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content71% 
IMG OID 
ProductRadical SAM domain protein 
Protein accessionYP_003681294 
Protein GI297562320 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.660882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCG AAAGCCTCTT CCCGCGGCTG GAACCCCTGC TGACGCAGGT GGCCAAGCCC 
ATCCAGTACG TCGGTGGCGA GCTCAACTCC GTGGTCAAGG AATGGGACGA GACCCGGGTG
CGCTGGGCGC TGATGTACCC GGACGCCTAC GAGGTGGGAG TGCCCAACCA GGGCGTCCAG
ATCCTGTACG AGGTCCTCAA CGAGCGCGAG GGCGTGCTGG CCGAGCGCGC CTACGCGGTG
TGGCCGGACC TGGAGAAGCT GATGCGCGAG CACGGCGTGC CCCACTTCAC CGTGGACTCG
CACCGGCCGC TGGCCGCCTT CGACGTGCTC GGGCTGAGCT TCGCCAGCGA GATGGGCTAC
ACCAACATGC TCACCGCGCT GGACCTGGCG GGCATCCCGC TGCGCGCGGC CGACCGCACC
GCCGACCACC CGGTCGTGCT GGCCGGCGGA CACTCGGCGT TCAACCCCGA GCCGATCGCC
GACTTCCTGG ACGCGGTGGT CCTGGGCGAC GGCGAGGAGA TCACCCTCGC CATCACCGAG
ATCATCCGCG AGTTCAAGGA GGAGGGCGAG CCCGGCGGGC GCGACGGCCT GCTGCTGCGC
CTGGCCGCGA CCGGGGGCGT GTACGTGCCC CGCTTCTTCG ACGTGGCCTA CCACGAGGAC
GGGCGGATCG CCTCCTACAC GCCCAACCGG CCCGGGGTGC CCGGCACGGT GCAGAAGCAC
ACCGTGATGG ACCTGGACCA GTGGCCCTAC CCCAAGAAGC CGATCGTGCC GACCGCCGAG
TCCGTGCACG AGCGCTACAG CGTGGAGATC TTCCGCGGCT GCACGCGCGG CTGCCGGTTC
TGCCAGGCGG GGATGATCAC CCGGCCGGTG CGCGAGCGCA ACAAGGAGAC CGTCACGAAG
ATGGTCGAGG ACGGCGTCGA GGCCTCCGGC TTCCAGGAGG TGGGGCTGCT GTCGCTGTCC
AGCGCCGACC ACAGCGAGAT CGGGCAGATC GCCAAGGGGC TCGCCGACCG CTACGAGGGC
ACCAACACCG GCCTGTCCCT GCCCTCCACC CGCGTGGACG CGTTCAACAT CGACCTGGCC
AACGAGCTGA CCCGCAACGG GCGCCGCTCC GGCCTGACGT TCGCGCCGGA GGGCGGCAGC
GAGCGGATGC GCCGGGTGAT CAACAAGATG GTCACCGAGG AGGACCTCAT CCGCACCGTC
ACCGCCGCGT ACGCGGCCGG GTGGCGGCAG GTGAAGCTGT ACTTCATGTG CGGCCTGCCC
ACCGAGGAGG ACGAGGACGT CCTGGCCATC GCCGACCTGG CCACCGAGGT GATCCGGACC
GGCCGTGAGG TGACCGGCCG CAAGGACATC CGCTGCACGG TGTCCATCGG CGGGTTCGTG
CCCAAGCCGC AGACCCCGTT CCAGTGGGCG GCGCAGACCT CGCACGAGGC CGTCGACGCC
CGGCTGCGCA AGCTGCGCGA CAGGCTGCGG GGCGACCGCA AGTACGGCAA GTCGATCGGT
CTGCGCTACC ACGAGGGCCG CCCCTCCATC ATCGAGGGCC TGCTCTCCCG GGGGGACCGC
AGGGTCGGCC GGGTGGTCGA GGAGGTGTGG CGCTCGGGGG GCCGCTTCGA CGGCTGGAGC
GAGCACTTCT CCTACGACCG GTGGTCCGAG GCCGCCGAGG CCGCCCTGGC CGACCTCCCG
GTGGACGTGG ACTGGTTCAC CACCCGCGAG CGCGACGAGG ACGAGGTCCT GCCCTGGGAC
CACCTGGACG CGGGCCTGGA CCGGTCGTGG CTGTGGCAGG ACTGGCAGGA CTCGCTGTAC
GGCGAGGAGT CGCTGGAGGT GGACGACTGC CGCTGGAACC CCTGCTACGA CTGCGGGGTC
TGCCCGAGCA TGGGCACCGA GATCCAGATC AACGCCCCTG CGGAGGGCCG TCCGCTGCTG
CCGCTCAACG TGGTCTAG
 
Protein sequence
MSVESLFPRL EPLLTQVAKP IQYVGGELNS VVKEWDETRV RWALMYPDAY EVGVPNQGVQ 
ILYEVLNERE GVLAERAYAV WPDLEKLMRE HGVPHFTVDS HRPLAAFDVL GLSFASEMGY
TNMLTALDLA GIPLRAADRT ADHPVVLAGG HSAFNPEPIA DFLDAVVLGD GEEITLAITE
IIREFKEEGE PGGRDGLLLR LAATGGVYVP RFFDVAYHED GRIASYTPNR PGVPGTVQKH
TVMDLDQWPY PKKPIVPTAE SVHERYSVEI FRGCTRGCRF CQAGMITRPV RERNKETVTK
MVEDGVEASG FQEVGLLSLS SADHSEIGQI AKGLADRYEG TNTGLSLPST RVDAFNIDLA
NELTRNGRRS GLTFAPEGGS ERMRRVINKM VTEEDLIRTV TAAYAAGWRQ VKLYFMCGLP
TEEDEDVLAI ADLATEVIRT GREVTGRKDI RCTVSIGGFV PKPQTPFQWA AQTSHEAVDA
RLRKLRDRLR GDRKYGKSIG LRYHEGRPSI IEGLLSRGDR RVGRVVEEVW RSGGRFDGWS
EHFSYDRWSE AAEAALADLP VDVDWFTTRE RDEDEVLPWD HLDAGLDRSW LWQDWQDSLY
GEESLEVDDC RWNPCYDCGV CPSMGTEIQI NAPAEGRPLL PLNVV