Gene Csal_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0084 
Symbol 
ID4026006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp103254 
End bp104741 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content59% 
IMG OID637965235 
ProductN-6 DNA methylase 
Protein accessionYP_572147 
Protein GI92112219 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCTC AAACCCTGGC CAATAAAGTC TGGAACTTCT GCCATACCCT GCGCGACGAC 
GGGGTGGGCT ATGGCGATTA TCTCGAACAG CTCACCTATC TGATCTTCTT GAAGATGGCC
CACGAGTACA GCCAGCCCCC CTATCGCCGA GACACCGGGA TTCCCGCAGG CTACGGCTGG
CCCTCGCTCG TCAGGCGCAC CGGCGCCGAG CTCGAGGCCC ACTATCTCGA CCTGCTGCGT
ACCTTGGGCC AGCAATCGGG CACGCTGGGG CAGATCTTCA CCAAGGCACA GAACAAGATT
CAAGACCCCG CCAAGCTCGC TCGCGTGATT CACATGATCG ACGCCGAGAA GTGGGCCATG
CTGGATGCCG ACGTCAAGGG CGACATCTAC GAGAGCCTAC TGGAGAAGAA CGCCGAAGAT
ACCAAATCCG GCGCCGGCCA GTACTTCACG CCGCGCGCCT TGATTCAGGC CATGGTCGCA
TGCGTACAAC CACAACCCGG CAAGACCATC GCCGACCCTT CCGCCGGCAC CGGCGGTTTC
TTCCTAGCCG CTTACGACTG GATCACCGAG CATCATGGTG CCCGGATGGA TCGCGAGCAG
AAACAGTTCC TCAAGCATCA TGCCTTTCAC GGTAATGAGA TCGTCGCCAA TACTCGGCGG
CTGTGCCTGA TGAACCTTTT CCTTCACAAC ATCGGGGAGA TCGATGATCA ACCCAACATT
GCACCCACCG ATGCGTTGAT CGGCCCGGCC CCGGCGCGTT ATGACTACGT GCTGGCCAAC
CCGCCATTCG GTCGCAAGAG CTCGATGACC GTCACCAACG AAGAAGGCGA ACAGGAAAAG
GAAGACTTCG TTTACAACCG ACAGGACTTC TGGGCGACCA CTTCCAACAA GCAACTCAAT
TTCGTCCAGC ACATCCGCAC CATGCTGAAG GAAAACGGCC AGGCCGCCGT GGTGGTGCCG
GACAATGTGC TGTTCGAGGG CGGGGCGGGC GAGACCGTGC GCCGCAAGCT GCTGACCACC
ACCGAGCTGC ATACCATCCT TCGGCTCCCC ACAGGCATCT TCTACGCCAA CGGCGTCAAG
GCCAACGTGC TGTTCTTCGA TAACAAGCCA GGACGCGCCG AGCCGTGGAC GAAGGACATC
TGGATCTACG ACTACCGCAC CAACGTTCAT CACACGCTTA AGAGGAAGCC GCTTCGCCTC
GAGCATCTCC AGGAATTCAT CGACTGCTAC CAACCGGGCC AGCGCGACCG GCGTCAGGAA
ACCTGGAGCG AGGCGACCCC CGACGGCCGC TGGCGGAAAT ACTCGCTGGA TGAGGTGCTG
AAACGCGACA AGGTCAGCCT GGATATCTTC TGGCTGAAGG ATGAATCGCT TGGAGACATG
GATAACCTAC CTGAACCGGA CGTGCTGATA GGCGACATCA TCGAAAACCT AGAGGCTGGC
CTTGAGGCGT TTCGCAACGT GGCCCAAGGC CTTGAGAACT CAAGATAG
 
Protein sequence
MNAQTLANKV WNFCHTLRDD GVGYGDYLEQ LTYLIFLKMA HEYSQPPYRR DTGIPAGYGW 
PSLVRRTGAE LEAHYLDLLR TLGQQSGTLG QIFTKAQNKI QDPAKLARVI HMIDAEKWAM
LDADVKGDIY ESLLEKNAED TKSGAGQYFT PRALIQAMVA CVQPQPGKTI ADPSAGTGGF
FLAAYDWITE HHGARMDREQ KQFLKHHAFH GNEIVANTRR LCLMNLFLHN IGEIDDQPNI
APTDALIGPA PARYDYVLAN PPFGRKSSMT VTNEEGEQEK EDFVYNRQDF WATTSNKQLN
FVQHIRTMLK ENGQAAVVVP DNVLFEGGAG ETVRRKLLTT TELHTILRLP TGIFYANGVK
ANVLFFDNKP GRAEPWTKDI WIYDYRTNVH HTLKRKPLRL EHLQEFIDCY QPGQRDRRQE
TWSEATPDGR WRKYSLDEVL KRDKVSLDIF WLKDESLGDM DNLPEPDVLI GDIIENLEAG
LEAFRNVAQG LENSR