Gene Sare_3847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3847 
Symbol 
ID5707925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4378924 
End bp4381344 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content74% 
IMG OID641273269 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_001538631 
Protein GI159039378 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00472645 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCGACG GGCAGCCGAT GCGTTCCGGT CGCCGGGCAG CCCCGCCGCG CAGGGGTGGC 
GCACCTGACC CACCGGATTT GCGGCTGGCC GGCTTGGCCG TTGCCGCGTG GCTCGCCGCG
TTGGCCGGGC TGCATTTGCC CGCCAGTTCC TCTTTGCTTG TTGCCGCGAT CGCCGCCGGG
CTGGCTGGAC TGGGCGGGCT GTACCTGCTT GGACTGCTGG GTCGTCCACT CGCATCTGTC
CGTCCGTACG GCTGGACGGC CATCGCCATC TTGCTTGGCG TGGTCTGCGG GGCGAGCGTC
ACCGCAGCTC GGGTGACCGT GCGGGATGCC ACACCGGTGC GCGCCCTGGT GGAGGCGCGG
GCCACCGTCG CCGCCGACCT GGTTGTCCGA GACGACCCCC GGTTGGTGCG CACCGCTTCT
GGGAGACCCG CCATGTTCCT GGTGGCAACG GAGTCGACCC GAGTCACCGG GCCCGGCGGG
CGTCGGGTCG AGGCGCGGGC CCGGATGCTG GTCCTCGCCA CCGACCCAGC CTGGCGGTAC
CTGCTGCCGG GGCAGCGACT GACCGCCGAG GGGCGGCTCG CCGCTCCGCG GGGCGGCGAC
CTCACCGCCG CGGTCCTCTG GTCGACCCGG GCCCCGGTAC CCCACGGGCC GCCACCGGGC
TTTCAGCACG CCGCCGGCAC GCTCCGCGCC GGGCTTCAGG AAGCCTGCGA ACCACTACCG
GACGAGCAGG GCGGCCTGCT ACCCGGTCTG GTGGTGGGCG ATACGAGTCG GTTGCCCGAT
GCGGTGCGGG AGGATTTCCT CGCCACGGGC ATGACCCACC TGACGGCGGT CTCCGGATCC
AACGTCGCGA TCATCGTGGG CGCCGTGCTG CTTCTCGCCC GCTGGGGGCG GGCCGGTCCC
TGGCTCGCCG CCGGGCTCAG TGTGGTCGCA CTGGCAGGAT TCGTGATCTT GGTTCGTCCG
TCGCCGAGCG TCGTGCGGGC GGCCACCATG GGAGCGATCG GGCTCGCCGC GCTCGCCGTC
GGACGGCCGC GTGCGGCGTT GCCGGCCCTG GCCGCGGCGG TCACCGCCCT CGTGCTGTTC
GATCCCGAGC TTGCCGGGGA CGTCGGCTTC GCCCTTTCCG TCCTCGCCAC CGGCGGGTTG
CTGCTGCTCG CCCCGCGCTG GCGGGACGCG TTGCGGCGCC GCCGGGTGCC TGCGGGGGTC
GCCGAGGCAC TTGCCGTGCC CGCCGCCGCG CAACTCGCCT GCGCGCCGGT CGTCGCGGGG
ATCTCGGGCA CGGTCAGCCT GGTCGCGGTC CCGGCGAACC TGTTGGCGGT GCCAGCGATC
GCGCCCGCAA CGGTGCTCGG CGTCGTGGCG GCGGCGCTTT CGCCCCTCTG GCCGGCGGGC
GCTGGATTTC TGGCCTGGCT GGCCAGTTGG CCGGCATGGT GGCTGGTCGC CGTGGCGCAT
CACGGGGCAC GGGTGCCGGC GGGCGCACTA CCCTGGCCGG ACGGCGTCGC TGGCGCGCTG
TTGCTGACCG GGTTGACTCT GGCCCTGCTG GTGGCTGCCC GCCGCCGAGT GGTGCGCCGA
CTTGTGGCGG TGACCGCCGT GGCGGCCGTG CTCGGCGCGT TGCCGGTGCG GCTGGTGGCC
TCCGGCTGGC CACCGGTGGG TTGGGTGGCC GTGGCATGCG CGGTCGGTCA GGGCGATGCG
ATTGTCCTGT CCGCCGGTCC GGGGCGGGCC GTGGTGGTGG ACGCCGGGCC GGAGCCGGGG
GTTGTGGACC GCTGCCTGCG TCGAATCGGT GTCCGGGAGG TGCCGCTGCT GATAGTCAGC
CACTTCCATC ACGACCACGT TGGTGGGGTG GCGGGCGTGT TCCGGGGGCG GCGGGTCACG
ACCGTGCTCG CTCCGCCGTG GCCGGAGCCG GAGCACGGTC GTGATCTGGT CCGTGTCACG
GCCGCGGCGG GCTCCGCCGA TGTGATCTCC GCCCCGGCCG GCTGGGGCTA CCGAACCGGT
GGAGTGGAGC TGACCGTCAT CGGCCCACCA ACTCCGCTGC GGGGTACCCG CTCCGACCCG
AACAACAACT CGCTCGTCCT GCTGGCCACG GTCAGCGGGG TGCGGATCCT GCTCACCGGT
GACGCCGAGG CCGAGGAACA GCGCGCCCTG CTCGACCGCC CACCGGCCGG CGGGCTCCGC
GTGCACGTGC TGAAGGTCGC CCACCACGGC TCGGCATACC AGGACTCCGC CTTCCTCGAC
GTGGTCCGCC CGCTGGTCGC GGTCGTCCCG GTTGGCCGAG ACAACACCTA CGGGCACCCG
GCTGCGTCTG TGCTCGGTCG CCTTGCCCGC GGTGGGGCTC GCGTTCTGCG AACCGACGTC
GATGGGGACG TGGCTGTGGT GACCCGGCCG TCCGGTCTGG CCGTCGTCAC GCGGGGGCCT
GAGAGCCCGA GCGATCGTTA G
 
Protein sequence
MSDGQPMRSG RRAAPPRRGG APDPPDLRLA GLAVAAWLAA LAGLHLPASS SLLVAAIAAG 
LAGLGGLYLL GLLGRPLASV RPYGWTAIAI LLGVVCGASV TAARVTVRDA TPVRALVEAR
ATVAADLVVR DDPRLVRTAS GRPAMFLVAT ESTRVTGPGG RRVEARARML VLATDPAWRY
LLPGQRLTAE GRLAAPRGGD LTAAVLWSTR APVPHGPPPG FQHAAGTLRA GLQEACEPLP
DEQGGLLPGL VVGDTSRLPD AVREDFLATG MTHLTAVSGS NVAIIVGAVL LLARWGRAGP
WLAAGLSVVA LAGFVILVRP SPSVVRAATM GAIGLAALAV GRPRAALPAL AAAVTALVLF
DPELAGDVGF ALSVLATGGL LLLAPRWRDA LRRRRVPAGV AEALAVPAAA QLACAPVVAG
ISGTVSLVAV PANLLAVPAI APATVLGVVA AALSPLWPAG AGFLAWLASW PAWWLVAVAH
HGARVPAGAL PWPDGVAGAL LLTGLTLALL VAARRRVVRR LVAVTAVAAV LGALPVRLVA
SGWPPVGWVA VACAVGQGDA IVLSAGPGRA VVVDAGPEPG VVDRCLRRIG VREVPLLIVS
HFHHDHVGGV AGVFRGRRVT TVLAPPWPEP EHGRDLVRVT AAAGSADVIS APAGWGYRTG
GVELTVIGPP TPLRGTRSDP NNNSLVLLAT VSGVRILLTG DAEAEEQRAL LDRPPAGGLR
VHVLKVAHHG SAYQDSAFLD VVRPLVAVVP VGRDNTYGHP AASVLGRLAR GGARVLRTDV
DGDVAVVTRP SGLAVVTRGP ESPSDR