Gene Noca_2284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2284 
Symbol 
ID4595832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2433741 
End bp2434940 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content65% 
IMG OID639776883 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_923476 
Protein GI119716511 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCCACG ACGCTCCGGA GGCAGCAGAC GACGCCGACT CTGGCTCTTG CATCGAACTC 
TTTTCCGGTG GTGGTGGCCT CGCGATGGCA CTGCACGAGG CTGGCTTTCG CCACCTTCTG
CTCAACGAGC TGAACAAGCG TGCTTGCGCG ACCCTCAGGG CCAACAACGC CGTGGACTAC
CTGCCCGACG AGACACCTCC CGCGACGCTC GCCGACCCGT GGCCCCTCAT CGAAGGAGGG
ATTGGGGAAG TCGACTTCAC GCCGTTCCTC GGAGACGTCG ATGTCGTCGC TGGCGGCGTA
CCTTGCCAGC CGTTCAGCCT TGGCGGGGCC CACAAGGGTC ACCTCGACGA ACGCAACCTG
TGGCCTGAGT TCAACCGTTG TGTCCGGGAA ACACGACCAC TGGTCATCCT CGCCGAGAAC
GTGCGGGGCC TCCTCCGGCC CTCCTTCGAG CCCTATTGGG ATTACATCCG CCGAGAACTG
GCTGCCCCAT TCGAGCAACG CGTCGACGGG GAACCATGGG CCGATCATGA CCGCCGTCTG
GTCAAGGCGT TGCGCGGTGG CGGCGGCGAT CCCACCGAGC GATACGACAT CGCGTTCAAG
CTGGTCAACG CCGCCGACTA CGGCGTTCCG CAGAATCGAT GGCGAGTTGT GCTGGTCGGT
TTCCGCAAGG ATCTAGGGAT CTCGTGGAGC TTCCCCGACC CCACGCACAG TGCCGGCGCT
CTTCTGCGCG CCCAACTGTC CGGAGAGTAC TCAGATCGGC ATCCCCACGC GCCGATCAAA
GAGCATCCCG GCGTCACGCC ACCGGAGGAC GGTCTACGGC CATGGAAGAC CCTGAGGGAC
GCCATCCACG ACCTGCCCGA ACCGGTAGAG CGACAGGACA CGCCGGGCTA CATCCACCAC
ATCGGCTGGC CAGGCGCTCG TGAGTACCCC GGGCACACCG CCAACGTGCT CGACAGGCCG
GCCAAGACCG TCAAGGCAGG CGTACACGGC GTTCCTGGCG GTGAGTCGGT TCTCCGGCGT
GATGACGGGA GCATCCGCTA CCTGACTGTT CGCGAAGTCG CGCGCATCAT GACGTTCCCT
GACGATTGGC GGCTCGAGGG TCCGCGGGGC GAGCAAATGC GCCAACTCGG GAATGCCGTC
CCCGTCCGTC TCGGTGCAGT GATGGGTCGC GAGATCGCGA AGGTTCTGCG GGAACGATGA
 
Protein sequence
MVHDAPEAAD DADSGSCIEL FSGGGGLAMA LHEAGFRHLL LNELNKRACA TLRANNAVDY 
LPDETPPATL ADPWPLIEGG IGEVDFTPFL GDVDVVAGGV PCQPFSLGGA HKGHLDERNL
WPEFNRCVRE TRPLVILAEN VRGLLRPSFE PYWDYIRREL AAPFEQRVDG EPWADHDRRL
VKALRGGGGD PTERYDIAFK LVNAADYGVP QNRWRVVLVG FRKDLGISWS FPDPTHSAGA
LLRAQLSGEY SDRHPHAPIK EHPGVTPPED GLRPWKTLRD AIHDLPEPVE RQDTPGYIHH
IGWPGAREYP GHTANVLDRP AKTVKAGVHG VPGGESVLRR DDGSIRYLTV REVARIMTFP
DDWRLEGPRG EQMRQLGNAV PVRLGAVMGR EIAKVLRER