Gene Franean1_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0004 
Symbol 
ID5668431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5118 
End bp6491 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content73% 
IMG OID641238932 
ProductDNA replication and repair protein RecF 
Protein accessionYP_001504379 
Protein GI158311871 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.470302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATCTCA CCCACCTGTC CCTGACCGAC TTCCGCTCGT ACGCCCGGCT GGACCTCGTC 
CTCGAGCCAG GTGTCACGAC GTTCGTGGGT TCGAACGGTC AGGGCAAGAC GAACCTGATC
GAAGCGATCG GATTCGTCGC CACGTTGGGT AGCCATCGTG TCGCGAACGA TGCTCCGCTC
GTCCGCGAGG GGTGCGGGCA GGCTGTTGTT CGGGCCCGGA TCGTGCGCGG TGACCGGGCC
GCGCTGGTCG AGATGCAGAT CGTGCCCGGG AAGGCGAACA GGGTCCGCCT GAACCGTGCC
CCAGTGGCCC GGGCCCGCGA CGTGGCGGGA CTGCTCGCCA CGGTCCTCTT CGCCCCGGAG
GACCTCGCGC TGGTGAAGGG CGACCCGGCC GAGCGGCGGC GGTTCCTCGA CGACCTGCTG
GTCGCGCGGG CGCCGCGGAT GGCGGCGGTG CAGTCCGACT ACGACCGGGT GCTCAAGCAG
CGCTCGGCGC TCCTGCGGTC GGCGGGAGCC GCCCGCCGTG CCGGTGGCCG GGGCGACCTG
CGCACGCTCG ACGTCTGGGA CAGGCATCTC GCCGACCACG GCGCGGAGCT GCTCGCGGCC
CGCCTGGCGC TGGTCGAGGA GTTGCGGCCG CGGGTCGAGA GCGCCTACGC CGCGGTCGCG
GGCCAGGACG CGCCCACCGG CATCGAGTAC CGCTCGACCG TGACGCTCGA TTCGTCACCC
GATCGGGCCT TGTCGCCAGG CCGGGCTGGC CTGGGCGAGC CGGACGGGGA CGCCGGCCGG
AACGGAAACG GCGCCAGCCC GAATCACCCG AGTGGCTCGG GTGACCCCGC GAGCGGGAAC
GGCGGTTCGG GCGACACCGA GGTTGGCGAC ACCGAGGCTG GCCGCACGGG TGATGGCGCG
GGTGACGGTC GCACGGAAGC CGACCGCACA GAGGTTGGCC ATGTGGGAGG TGCCGTCGGC
GGCACGGCGA CTCGGGCGGC GCTCGAAGAG GCGATCCTCG CCGGTCTCGC GGCGGTGCGC
ACCCAGGAGA TCGAGCGCGG TGTGACGCTG GTCGGTCCAC ATCGTGACGA TCTGCTGTTG
TCAGTGAACG GGCGGCCGGC GCGTGGCTAT GCCAGCCACG GCGAGTCCTG GTCGCTGGCG
CTCGCGCTGC GGCTGGCCTC GTTCGAACTG CTCCGCGCGG ACGACCGGGA ACCGGTGCTG
CTGCTCGACG ACGTGTTCGC GGAGCTGGAC ACCCGCCGGC GCGCGCGGCT CGCCGCCCTG
GTCGCTGACG CCGAGCAGGT ACTCGTGACC GCGGCGGTCG ACGCGGATGT GCCGGCGGAG
CTTGCCGGTG TGCGCTTCGA GGTGGTCTCC GGGGAGGTGT GCCGTGCCGG CTGA
 
Protein sequence
MHLTHLSLTD FRSYARLDLV LEPGVTTFVG SNGQGKTNLI EAIGFVATLG SHRVANDAPL 
VREGCGQAVV RARIVRGDRA ALVEMQIVPG KANRVRLNRA PVARARDVAG LLATVLFAPE
DLALVKGDPA ERRRFLDDLL VARAPRMAAV QSDYDRVLKQ RSALLRSAGA ARRAGGRGDL
RTLDVWDRHL ADHGAELLAA RLALVEELRP RVESAYAAVA GQDAPTGIEY RSTVTLDSSP
DRALSPGRAG LGEPDGDAGR NGNGASPNHP SGSGDPASGN GGSGDTEVGD TEAGRTGDGA
GDGRTEADRT EVGHVGGAVG GTATRAALEE AILAGLAAVR TQEIERGVTL VGPHRDDLLL
SVNGRPARGY ASHGESWSLA LALRLASFEL LRADDREPVL LLDDVFAELD TRRRARLAAL
VADAEQVLVT AAVDADVPAE LAGVRFEVVS GEVCRAG