Gene Csal_0086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0086 
Symbol 
ID4026008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp106660 
End bp108276 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content50% 
IMG OID637965237 
Productrestriction modification system DNA specificity protein 
Protein accessionYP_572149 
Protein GI92112221 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTC CAATTGATGA GGCATTACCA GTGCACTCTA TGGAAAAAAA ATTAGCTAAT 
ATAAAAACAC CTCACTGGTT ATGGATAGAG CATAACCAGA TAGCGGAGAT CAACCCAAAG
AAGCCTAAAC TCGATGAAGA GCTTTCAGTT TCCTTCATCC CGATGGGCGC TGTAGCAGAA
GAGTCTGGCC GCTACACCAC CGACGACAGC AAGAAATTCG AAGACGTAAA AAAAGGATAT
ACTTACTTTT CAGATGGCGA CATCCTTTTT GCTAAAATCA CTCCTTGCAT GGAAAACGGA
AAAGTTGCAC TTTTGAGCAA CCTAACCAAT GGCGTCGGCT TTGGGTCCAC AGAATTCCAC
GTATCACGCC TAACCGAAGC CGTTGAAAAA AAATTTTATT TTTATTTTTT TGTTTCCAAG
AGTTTCAGAA AACAAGCTCA AGCCAACATG GCTGGCAGTG CCGGCCAGCT TCGTGTCACC
ACTGACTACT TTAGCAATGT CAGCGTCCCA CTTTGCCCTA CCAGAGAACA ACAGCGAATT
GTCACCAAGA TAGAGGAGCT TTTCTCCGAA ATCGATAGCG GTGTGGAAAG CCTGAAAACC
GCCCAGGCCA AGCTCAAGAC CGCCCGCCAG TCACTGCTCA AGGCCGCCTT CGAAGGCAAG
CTGACCGAGC AGTGGCGAAA AGACAATGCC GATCGACAGG AAAGCCCGGA AGCCTTGCTG
GAGCGGATTC AGGCCGAGCG CGAGGCGCAC TACCAACAGC AGCTGACCGA CTGGCAACAT
CAGCTCAAGG ACTGGGAAGC CGCCGGCAAG GAAGGCAAGA AACCCCGCAA GCCCAAGGTG
CCCAAGGCCC TGCCACCATT GACACAGCAA GAGCTGGCCG AGTTACCAGA ATTGCCGGAG
GGGTGGAAAT GGATAAACCT GGGTAACATT TCGGAGATAT CAGGCGGCAT CACCAAGAAC
CAAAAACGTC AATCATTGCC ACAAAAAAAC CCTTTCCTTC GGGTGGCCAA TGTATACGCG
AACAAGCTGG AACTGGATGA CATCCACTTC ATCGGGACTA CTCCTGATGA AGCAAAAAGA
GCAAAACTAA AAAAAGACGA CCTGCTTATC GTCGAGGGAA ATGGAAGCCC TGACCAAATA
GGAAGAGTCG CAAAATGGGA TGGATCGATA GAGCACTGCA CACACCAAAA TCACTTGATA
CGTTCAAGAT TGGCAAGCCC AATCAGCGCT GATTTTGTCC TGCATTTTCT TCTCTCGGCA
ACAGGAAGAA AAGCAATTAA AAAAGTGGCT AGCTCTACAT CTGGTCTTTA CACACTCAGC
CTTGCAAAAG TTGAAAAGCT TTGCATCCCT GTTTGCTCAA AAAACGAGCA GATGATGATT
GTCGATCAAC TTGAGTCACG CCTCTCCCAA CTCGACCAAT TGGAGCGGAC CCTGACCGCT
TCCATGAAAC AGGCCGAAGC GCTCAAGCAG TCCATCCTCA AGCGCGCCTT CGCCGGTCGA
CTGGTGCCTC AGGATCCCGA CGACGAGCCG GCCAGCGAGC TGTTGGCGCG CATCCGCGCC
GAGCGGGAAA GCCAGCCAAG GGCCCCTCGC AAGCTGCACA GGGAACCGAC GCCATGA
 
Protein sequence
MNSPIDEALP VHSMEKKLAN IKTPHWLWIE HNQIAEINPK KPKLDEELSV SFIPMGAVAE 
ESGRYTTDDS KKFEDVKKGY TYFSDGDILF AKITPCMENG KVALLSNLTN GVGFGSTEFH
VSRLTEAVEK KFYFYFFVSK SFRKQAQANM AGSAGQLRVT TDYFSNVSVP LCPTREQQRI
VTKIEELFSE IDSGVESLKT AQAKLKTARQ SLLKAAFEGK LTEQWRKDNA DRQESPEALL
ERIQAEREAH YQQQLTDWQH QLKDWEAAGK EGKKPRKPKV PKALPPLTQQ ELAELPELPE
GWKWINLGNI SEISGGITKN QKRQSLPQKN PFLRVANVYA NKLELDDIHF IGTTPDEAKR
AKLKKDDLLI VEGNGSPDQI GRVAKWDGSI EHCTHQNHLI RSRLASPISA DFVLHFLLSA
TGRKAIKKVA SSTSGLYTLS LAKVEKLCIP VCSKNEQMMI VDQLESRLSQ LDQLERTLTA
SMKQAEALKQ SILKRAFAGR LVPQDPDDEP ASELLARIRA ERESQPRAPR KLHREPTP