Gene Sare_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0004 
SymbolrecF 
ID5707577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4766 
End bp5896 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content73% 
IMG OID641269527 
Productrecombination protein F 
Protein accessionYP_001534931 
Protein GI159035678 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.366352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000175389 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGTACGTAC GCCGGCTCGA ACTCGTCGAT TTCCGCTCGT ACGAGCGGGT CGGCGTGGAC 
CTCGAACCGG GGGCGAACGT CCTGGTCGGC CACAACGGGG TCGGCAAGAC CAACCTGATC
GAGGCGCTCG GCTACGTGGC GACCCTGGAC TCCCACCGGG TCGCCACCGA CGCCCCGCTG
GTCCGGATGG GCGCCGGTGC GGCGGTCATC CGCTGCGCGG TGGTGCACGA GGGCCGCGAG
TTGCTGATCG AGTTGGAGAT TGTCCCGGGG CGGGCCAACC GGGCCCGGCT CGGTCGGTCC
CCGGCCCGGC GAGCCCGGGA CGTGCTCGGT GCCCTGCGGC TGGTGCTCTT CGCCCCGGAG
GACCTGGAAC TGGTCCGGGG CGACCCGGCC GAGCGCCGCC GCTACCTTGA CGACCTGCTG
GTGCTCCGAC AGCCTCGCTA CGCCGGTGTG CGGGCCGACT ACGAACGGGT GGTCCGGCAG
CGCAACGCCC TGCTGCGCAC CGCGTACCTG GCCAGGAAGA CCGGCGGCAC CCGCGGTGGG
GACCTGTCCA CGCTCGCGGT GTGGGACGAC CACCTCGCGC GGCACGGCGC GGAACTGCTC
GCCGGTCGAC TCGACCTCGT TGCCGCGCTC GCCCCTCATG TGACCAGGGC ATACGACGCG
GTGGCCGCCG GCACGGGCGC CGCCGGAATC GCGTATCGAC CCTCGGTTGA GCTGCCCACC
CCGACCACCG ACCGGGCTGA CCTGACCGCG GCGTTGAGCG CCGCGCTCGC CGCCGGCCGG
TCCGCTGAGA TCGAGCGGGG AACCACCCTG GTCGGCCCGC ACCGGGACGA CCTCACCCTG
ACGCTGGGGC CACTGCCCGC GAAGGGGTAC GCCAGCCACG GCGAGTCCTG GTCCTTGGCG
CTGGCACTCC GGCTGGCCGG ATACGACCTG CTGCGGGTCG ACGGAATCGA ACCGGTGCTG
GTGCTGGATG ACGTCTTCGC CGAGTTGGAC ACGGGCCGTC GGGATCGGCT CGCGCAACTG
GTCGGCGACG CGAGTCAACT CCTGGTGACG TGCGCGGTGG AGGAGGATGT TCCCGCGCGT
CTGCGGGGTG CGCGGTTCGT TGTCCGCGGT GGGGAGGTGC ACCGTGCCTG A
 
Protein sequence
MYVRRLELVD FRSYERVGVD LEPGANVLVG HNGVGKTNLI EALGYVATLD SHRVATDAPL 
VRMGAGAAVI RCAVVHEGRE LLIELEIVPG RANRARLGRS PARRARDVLG ALRLVLFAPE
DLELVRGDPA ERRRYLDDLL VLRQPRYAGV RADYERVVRQ RNALLRTAYL ARKTGGTRGG
DLSTLAVWDD HLARHGAELL AGRLDLVAAL APHVTRAYDA VAAGTGAAGI AYRPSVELPT
PTTDRADLTA ALSAALAAGR SAEIERGTTL VGPHRDDLTL TLGPLPAKGY ASHGESWSLA
LALRLAGYDL LRVDGIEPVL VLDDVFAELD TGRRDRLAQL VGDASQLLVT CAVEEDVPAR
LRGARFVVRG GEVHRA