Gene Acel_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0003 
SymbolrecF 
ID4484712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp3097 
End bp4212 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content65% 
IMG OID639728760 
Productrecombination protein F 
Protein accessionYP_871765 
Protein GI117927214 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATCTGA GCCGTCTCGA ATTGACCGAC TTTCGGTCAT ATCGACGCGC CGCGCTTGAG 
CTCGACCCGG GAGTGAACGT CTTCGTCGGA TCCAACGGGC AGGGCAAGAC GAATCTCGTC
GAGGCCGTCT GCTACCTGGC CCTGTTGCGC AGCCACCGGA CGGCGACCGA CGCGCCGCTT
GTTCGGCAGG GCAGCGAACG CGCCGTCCTG CATGGGGAAG TCCTGACCTC CGGCCGCCGC
ATTGACCTGG ATGTGGAAAT CGTCCCCGGT CGGGCGAACA GACTCCGTGT CAATGGACAC
GCAACCCGCC GCGCCCGGGA CCTGGTCGGA ATTCTGCGGG TCGTCATCTT TGCCCCGGAG
GATCTCGCCT TAGTGAAAGG CGATCCCGCG GCGCGACGGG ACTATCTCGA TGATGTTCTC
GTGGAGTTGC GTCCGCGGCT GTTCGCCGTC CGTGCCGAGT ACGAGAAGGC CTTGCGGCAG
CGCAATGCAT TTCTTCGTGC CGTGGCGCAG GACGGCCAGC AGGTCGATCG CAACAGCCTC
GACGTGTGGA ACCTGCACTT TGCGCGGGCG GCCGCGGCAC TTCTCGACGC ACGACGACGG
CTGGTTCACG AACTGGCGCC TTTCGTCGAG AAGGCGTACG CCGCAATCTC AGGCGGCAGT
GGAGCTGTTC GCCTGGAATA CCGGAGCACT GTCCCGGAAG AAGTGTTGCA GGACGCCGAC
GAGGAGACAC GGATCGCCGG TATTCTCGCC GCACTCCGCA AGGTCCAGGA CGCCGAACTG
GCCCGCGGGC TCACCCTCGT CGGGCCGCAT CGCGATGATC TCAACTTGGA GCTGGATAGC
CGCCCGGCCC GGGGTTATGC CAGCCACGGT GAGTCGTGGT CGTACGCTCT CGCGCTGCGC
CTAGGGGCGT ACGAGCTGCT CCGCTCGGAT GGGGAGACAC CGGTGATGAT TCTGGACGAC
GTGTACGCCG AGCTGGATCA GCAGCGCAGG CGCAGGCTGA CCGGATGTGT CAGCGGAGCC
GAACAATTGC TCATCACGTC AGCGGTTGAC GAACCCGACC TTCCGGTCGG GCGCCGCTAC
GTCGTCCATG AGAGCCAGGT GCACGTTGCC GACTGA
 
Protein sequence
MYLSRLELTD FRSYRRAALE LDPGVNVFVG SNGQGKTNLV EAVCYLALLR SHRTATDAPL 
VRQGSERAVL HGEVLTSGRR IDLDVEIVPG RANRLRVNGH ATRRARDLVG ILRVVIFAPE
DLALVKGDPA ARRDYLDDVL VELRPRLFAV RAEYEKALRQ RNAFLRAVAQ DGQQVDRNSL
DVWNLHFARA AAALLDARRR LVHELAPFVE KAYAAISGGS GAVRLEYRST VPEEVLQDAD
EETRIAGILA ALRKVQDAEL ARGLTLVGPH RDDLNLELDS RPARGYASHG ESWSYALALR
LGAYELLRSD GETPVMILDD VYAELDQQRR RRLTGCVSGA EQLLITSAVD EPDLPVGRRY
VVHESQVHVA D