Gene Csal_1585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1585 
Symbol 
ID4027604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1803788 
End bp1806064 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content71% 
IMG OID637966774 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_573637 
Protein GI92113709 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.882312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATCG GTTGGGCGAT GCCGCTGGCG TTGTCGGCCC TGGTCGGTAG CGTCTGGGGC 
GGGTGGGGCG TGTCGAGCGC CGCGGGGCCG TGGGGACTGC TGTGCCTGCT GGCGGTGGCT
CAGGCCCGCT GGCGCGGGCT GGCGATGCTG CTGGTGGCGG GATGGTGTGC GCTGAATGTC
GCCGTACAAG CGGCGGGCGA ACTCCCCGAC GGGCTGGCGC GTCAGGATGT CGCGCTCGAG
GGGCGGGTGA CACAGGTGAG CCGCGATCGC GGGCTGGCCA AGCTGCGCCT CGCGGTCGAG
CGCTGCGTGC CCCGGGCAGC GGGATTGCCG CCTTGCACGC GTCTGTCACG GGTACGCCTC
TCCTGGTACG ACGCGCCCGC GCTGCGTGTC GGCGATCAGT GGCGCCTGAC GGCGCGCCTG
CGCCCGCCGC AGGGCTTCGC CAACGGCTAT GGTTTCGACT ATGCCGCGTG GCTGTGGCGT
GAAGGCATCC AGGCGACGGG CTATGTGCGT GATGCCGCGT CCGCGCAGCG CCTCCAGGCG
GCCCCCGCCG GCCTGCGCGA GACGGGGCTG GCATTTCTGG AGGCACGTGC GTTGTCGCCG
CTCGGCAAGC GCTGGCTGGC GGCATTGACC CTCGGCGCCG GCGAGCGGTT GACCCAGGAC
GACTGGGATC TGCTCAACGC CACCGGTACC ACGCATCTCA TGGTGATCTC GGGGTTGCAT
GTCGGCCTGG TGACGAGCGT CGCGCTCTGG CTGCTGCGTT TCCTGGCGCG CGTCGTGACG
CCGGGACGCT GGCGCCTCGC GGCCTGGCCG TGGTGGCTGG CCGGGGCGGC GGCACTGGGG
TATGCCGGCC TGGCGGGTTT CGAGCCGCCG GCCCTGCGCG CCACGGTGAT GGCGCTGCTG
GGGTTGTGGG TGGCCAGCGG TCGCCATGCG CCCGGCCCCT GGCAGGCCTG GTGGCTGGCA
TTGCTGGTGG TCGTCACGGG CGATCCGTTG ACGGTGTGGC AACCCGGGCT CTGGCTGTCG
TTCCTGGCGG TAGGGGTGTT GATCGCCGCC TGGCAGGGCA GGCCGGTGCC GCGCGGCGCG
AGGGGATGGC TGCTCGCCCT GGTACGCTCG CAATGCCTGC TGGCGCCCTT CATGGCGGCG
GCCGTGCTGG TGGGGTTCGA GCGCTTGTCG CCGGCCGCGC CCTTGATCAA CCTGGTTGCC
GTGCCGCTGG TGGGAAGCCT GATGGTACCG CTGGGGCTGG CGGGGTGGGC CCTGGCCTGG
TCGCCGGGGC TGGCCATGTT GCCCTGGCGG GCCTTCGATG CCCTGGCCCA GCTCGTCTGG
CAAGGGCTCG CGTGGATGGG CGAAATCGTG CCATCGTGGT TGCCACCGGG CGAGGAAATT
CTGCCCCTGG CGTTGCTGCT GGCCGCCTGG GGGGGAGCAT GGCTGATGCC GGGGCTTGCG
CGGTCCGTTC GTGTGTGGGC CTCGCTGAGC CTGATCGCTC TGGCACTGAC CTGGCAGCCT
TCGACGATTC CGCCGGGGCG GGTCGTGGTC ATCGTCCACG ATGTGGGGCA GGGGCAACTG
GTCGAACTGC GCAGTGCCAC GCAACGCATG CTGTTCGATA CGGGGCCGCG TTACGGCTCG
GGGTTCGCCC CCGCCGCCAC GTTGTGGCCG CCGGGGCGTC GCTTCGACGA TGTCATCGTC
AGTCACAGCG ATCGCGACCA TGCCGGCGGC GTGGCCACGT TACGCGACAT GCATCATGTC
GGGCGCTGGT GGGGGCCGCC GAACATGGCG GTGGGCGTTG CCACGCACGC CTGCCGTCAA
GGCGTGGCCT GGCGGCGCGA CGAGGTGAGT TACCGGTTCT TGTCGCCGCG TTCCGGTGAC
TCGGCATTGA GCGACAATGA TCGCTCCTGC GTGCTGAGCG TGACGGCCGG CGGGCAGCGC
TTGCTGATCA TGGGAGATGC CGGCACCACC ATCGAGCGTC GTCTGCTGCG AGACATCGAG
CGCCCGCTGA CGGTGCTGAT CGCCGGCCAC CACGGCAGCC GTACGAGTTC ATCGCCGGCC
TTCGTCGCGC GGGTACATCC TCGACATGTG ATCTTCAGCG CCGGCCGCGA CAATGCCCAT
GGCCATCCTC ATCCCGAGGT GGTGCGCCGT TTCCGGCGCG CCGGGAGCTG TCTGTGGAAT
ACCGCCGTCG ACGGCGCGCT GCGTTTCACC CTGGGCGAGT CTCCGTTGCG CATGCGGCCC
GCGCGCCCGC CCGGCGGTGT CGAAGGGCCG TGCATTGGGG TAGAATCCGG CGATTGA
 
Protein sequence
MRIGWAMPLA LSALVGSVWG GWGVSSAAGP WGLLCLLAVA QARWRGLAML LVAGWCALNV 
AVQAAGELPD GLARQDVALE GRVTQVSRDR GLAKLRLAVE RCVPRAAGLP PCTRLSRVRL
SWYDAPALRV GDQWRLTARL RPPQGFANGY GFDYAAWLWR EGIQATGYVR DAASAQRLQA
APAGLRETGL AFLEARALSP LGKRWLAALT LGAGERLTQD DWDLLNATGT THLMVISGLH
VGLVTSVALW LLRFLARVVT PGRWRLAAWP WWLAGAAALG YAGLAGFEPP ALRATVMALL
GLWVASGRHA PGPWQAWWLA LLVVVTGDPL TVWQPGLWLS FLAVGVLIAA WQGRPVPRGA
RGWLLALVRS QCLLAPFMAA AVLVGFERLS PAAPLINLVA VPLVGSLMVP LGLAGWALAW
SPGLAMLPWR AFDALAQLVW QGLAWMGEIV PSWLPPGEEI LPLALLLAAW GGAWLMPGLA
RSVRVWASLS LIALALTWQP STIPPGRVVV IVHDVGQGQL VELRSATQRM LFDTGPRYGS
GFAPAATLWP PGRRFDDVIV SHSDRDHAGG VATLRDMHHV GRWWGPPNMA VGVATHACRQ
GVAWRRDEVS YRFLSPRSGD SALSDNDRSC VLSVTAGGQR LLIMGDAGTT IERRLLRDIE
RPLTVLIAGH HGSRTSSSPA FVARVHPRHV IFSAGRDNAH GHPHPEVVRR FRRAGSCLWN
TAVDGALRFT LGESPLRMRP ARPPGGVEGP CIGVESGD