Gene Francci3_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0004 
SymbolrecF 
ID3902950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3698 
End bp4897 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID637877333 
Productrecombination protein F 
Protein accessionYP_479127 
Protein GI86738727 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000897119 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACCTCA CGCACCTGTC CCTCGTCGAC TTCCGTTCCT ACCCGGCCCT TGACCTGACT 
CTGGGTCCGG GAGTGGCCAC CTTCGTGGGT GGTAACGGCC AGGGGAAGAC CAACGTGATC
GAGGCGATCA GCTATGTCGC CACGTTGGCC AGCCACCGGG TCGCCGGGGA CGCCCCGCTG
GTCCGTGACG GCGCGTCTCG GGCGGTCATC CGCGCCAGGA TCGTCCGGGG TGACCGGGCC
GCGCTGGTGG AGATCGAGAT CGTTCCGGGA AAGGCGAACC GGGCCCGGTT GAACCGGGCT
CCCGTCGCCC GGCCGCGTGA CATCGTCGGT CTGCTGTGCA CCGTGCTGTT CGCCCCGGAG
GACCTGGCCC TGGTGAAGGG GGATCCGGCG CAGCGTCGTC AGTTCCTCGA CGAGCTGCTG
ATCGCCCGGA CCCCAAGGAT GGCCGCGGTC CTCGCCGACT ACGACCGGGT CCTCAAACAA
CGATCGACCC TGCTACGCAC CGCCGGGACG GCCCGGCGAG CTGGGGGGCA GGGAGATCTG
CGCACCCTCG ATGTCTGGGA CGGGTATCTG GCCGCGCATG GCGCCGAGGT GCTGGCCGCG
CGGTTGGCAC TCGTCGACGC GTTGCGGCCG GCCGTGGCCG CGGCGTACGA GGCCGTCGCC
GGCGCCGAGT CCGCTACCGC GCTGGACTAC CGTTCCAGCG TCACCCTCCC GGACATCTTG
CATGCGTCCG GTCCACCCGG TCCACCGGGC CAGCCAGAGC AGCCGGGCGC GGGTCGGCCG
GATCCGGCGG CACCGGATCG GACCATGCTG GCGGAGGCGA TCCGCGCCGA TCTGGAGGCC
GCCCGGCCAC GGGAGGTCGA ACGGGGGATG ACGCTGGTCG GTCCGCACCG CGACGATCTG
CTCTTGTCGA TCAACGGGCT CCCGGCCCGT GGCTACGCGA GTCACGGCGA GTCCTGGTCC
CTCGCCCTCG CGCTCAAGCT GGCCTCGTTC GACCTGCTGC GTGCCGATGA CCGCGAGCCG
GTCCTGCTCC TGGACGACGT CTTCGCCGAA TTGGACACGC GCCGCCGCGG TCGGCTCGCG
GAACTCGTCG CCTCCGCGGA GCAGGTGCTG GTCACAGCCG CGGTCGAAAC CGACGTTCCC
ACAGAGCTGA CCGGGGTGCG GTACGCCGTC GCCGGAGGAG AGGTCCAGCA TGCCCACTGA
 
Protein sequence
MHLTHLSLVD FRSYPALDLT LGPGVATFVG GNGQGKTNVI EAISYVATLA SHRVAGDAPL 
VRDGASRAVI RARIVRGDRA ALVEIEIVPG KANRARLNRA PVARPRDIVG LLCTVLFAPE
DLALVKGDPA QRRQFLDELL IARTPRMAAV LADYDRVLKQ RSTLLRTAGT ARRAGGQGDL
RTLDVWDGYL AAHGAEVLAA RLALVDALRP AVAAAYEAVA GAESATALDY RSSVTLPDIL
HASGPPGPPG QPEQPGAGRP DPAAPDRTML AEAIRADLEA ARPREVERGM TLVGPHRDDL
LLSINGLPAR GYASHGESWS LALALKLASF DLLRADDREP VLLLDDVFAE LDTRRRGRLA
ELVASAEQVL VTAAVETDVP TELTGVRYAV AGGEVQHAH