Gene Ndas_3677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3677 
Symbol 
ID9247546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4413277 
End bp4414374 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content71% 
IMG OID 
Productradical SAM enzyme, Cfr family 
Protein accessionYP_003681581 
Protein GI297562607 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCG AGCTCACCTT TGTCGCACCC CGTCGTGCCA AACCTCCCCG GCACCTGGCC 
GACCTCAGCC CCGACGAGCG GGCCGAGGTC GTCCGCGAGC TGGGGGAGAA GCCCTTCCGG
GCCAAGCAGC TCGCCCAGCA CTACTTCGGC TCCCTGGTGT CGGACACCTC GGCCATGACC
GACCTGCCCG CCTCCTCCCG GGAGCGGCTC GGCGAGGCGC TGCTGCCCAC CCTGCTCACG
CCCGTGCGCC ACATCACCTG CGACAACGGC ATGACCCGCA AGACCCTGTG GAAGGCGTTC
GACGGGGTGC TGTTCGAGTC GGTGCTCATG CGCTACCCGG ACCGCGTCAC CCTGTGCATC
TCCTCCCAGG CCGGGTGCGG CATGAACTGC CCGTTCTGCG CCACCGGCCA GGCGGGCCTG
ACCCGCAACC TCTCCACCGG GGAGATCATC GACCAGGTGG TCGCCAGCGC CCGCGACCTG
GCCAACGGCG AGGTCGCCGG CGGCCCGGGC CGCATCAGCA ACATCGTGTT CATGGGCATG
GGCGAGCCCA TGGCCAACTA CAAGCGCGTG CTCCAGTCGG TGCGCCGCAT CACCGACCCC
GTCCCCAACG GCCTGGGCAT CTCCCAGCGC GGGGTCACCG TGTCCACGGT CGGCCTGGTG
CCCGCGATCA ACAAGCTCAT CGACGAGAGG ATGCAGGTCC GCCTCGCGAT CTCCCTGCAC
GCCCCCGACG ACGAGCTGCG CGACGAGCTG GTGCCCATCA ACACCCGCTG GAAGGTCGAC
GAGGTCCTGG ACGCCGCCTG GCGCTACGCG GGCACCACGG GCCGCCGGGT CTCCATCGAG
TACGCGCTGA TCAAGGACAT CAACGACCAG GCCTGGCGGG CCGACCTGCT GGGCAAGCTC
CTCAAGGGCC ACCTGGTGCA CGTCAACCTC ATCCCGCTCA ACCCCACCCC GGGGTCCAAG
TGGACGGCCT CGCGCCCCGA GGACGAGCGC GAGTTCGTGC GGCGGCTGGA GTCGCACGGG
GTGGCGGTGA CCGTCCGCGA CACCCGCGGA CAGGAGATCG ACGGCGCGTG CGGGCAGCTC
GCGGCGGCCG AGAACTGA
 
Protein sequence
MPAELTFVAP RRAKPPRHLA DLSPDERAEV VRELGEKPFR AKQLAQHYFG SLVSDTSAMT 
DLPASSRERL GEALLPTLLT PVRHITCDNG MTRKTLWKAF DGVLFESVLM RYPDRVTLCI
SSQAGCGMNC PFCATGQAGL TRNLSTGEII DQVVASARDL ANGEVAGGPG RISNIVFMGM
GEPMANYKRV LQSVRRITDP VPNGLGISQR GVTVSTVGLV PAINKLIDER MQVRLAISLH
APDDELRDEL VPINTRWKVD EVLDAAWRYA GTTGRRVSIE YALIKDINDQ AWRADLLGKL
LKGHLVHVNL IPLNPTPGSK WTASRPEDER EFVRRLESHG VAVTVRDTRG QEIDGACGQL
AAAEN