Gene Franean1_1753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1753 
Symbol 
ID5670155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2102199 
End bp2103950 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content77% 
IMG OID641240674 
ProductDNA repair protein RecN 
Protein accessionYP_001506097 
Protein GI158313589 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.391848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000930146 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTCGAGG AGATTCGCAT CCGTGGCCTC GGGGTCATCG ATGACGCCGT CCTCGACCTG 
GCGCCGGGGC TGACGGTCGT CAGCGGTGAG ACCGGCGCAG GCAAGACCAT GATTGTGCAG
GGGCTCGGCC TGCTGACCGG GGGCCGCGCC GACTACGCGC TGGTCTCCCC GCAGGCCGGG
CGCGCCTTCG TCGAGGCCCG GCTCGCCGTG CCGGCCGACT CCGGGCTCGC CAAGCGTGTC
CGCGAGCTCG ATGGCGACGT CGACGAGGAC GTCATCATCA TGGGCCGCAC GCTGAGCGCC
GAGGGCCGCT CGCGGGCCCA GCTCGCGGGG CGTTCGGTGC CCGCGAGTGT CCTCGCGGAG
ATCACCGAGA ACGTGATCGC TGTGCACGGG CAGTCCGAGG CGCAGCGGCT GCGCCGTCCC
GCGACCCAGC GCGACGCGCT CGACCGGTTC GCCGGCGCCT CCGTCGCCGA GCCGCTCGTC
CGGTACGGGC AGGTGTACGC GCGGCTGGCG AAGGTCCGCG CCCGGCTGGC CGACATCACC
GGGCGGGCGC GGGAGCGCGC GCAGGAGGCG GAGCTGCTCC GGCTCGGCCT GGCCGACGTC
GAACGGGTCG CGCCGCTGCC CGGCGAGGAT GTCACGCTCG ACGCGGAGCT GCGCCGGCTG
GAGCACTCCG AGACCCTGCT GCGGGTGGCG CGCTCGGCGC ACGGCGCGCT GGTCAGCGAC
CCCGCGACCG GCGAGGACGG CCCGGCCGCC ACCGACCTGG TCGGCGCCGC CCGCCGGATC
GTCGCCGGCG ACGCCTCGCT CGACCCGCAG CTCGCCGAGC TGGCGAACCG GTTGACGGAG
CTGTCGAGCC TGCTGACCGA CGTGGCCGCC GACCTGGCCT CCTACGCCGC CGGGGTGGAG
TCCGACCCGG AGCGGCTCGC TCAGGCGCAG GAACGCAAGG CGGCGCTGAC CGCACTGGCC
CGCGCGCACG GCACGGACGT CGACGGCGTG CTGCGGTGGG CCGACACCGG CGGGCGTCGC
CTGCTCGAGC TCGACGCCGA CGGCGACCAG ACGGGCGCGC TGGCCGCCGA ACGCGACGAG
CTGACCGCGG AGCTCGCCGG CCTGGCCGAG CGGATCAGCG CGGCGCGCTC CGCCGCCGCC
GAGCGGTTCG GCGCGGCGGT CGCCGCCGAA CTGTCCGGCC TCGCCATGCC GCGGGCCCGG
GTGGAGGCGG CGGTCGGGCA CCGCGACGAC CCGAACGGCC TGCCTGTCGG CGGCCGGACG
CTGGCCTACG GGCCGAGCGG GATCGACGAC GTCGAGCTTC GCCTGGTTCC GCACCCGGGC
GCGCCGGCCC GGCCGGTCGA GAAGGGCGCC TCGGGCGGCG AGCTCTCCCG GGTGATGCTG
GCGATCGAGG TGGTGCTCGC GGCGGCCGAC GCCGGGGCGA CGATGGTCTT CGACGAGGTT
GACGCCGGGG TGGGCGGGCG AGCCGCGGTC GAGATCGGCC GCCGCCTCGC CCGGCTCGCC
CGCACCCACC AGGTCATCTG CATCACCCAT CTGCCGCAGG TCGCCGCGTT CGCCGACCGG
CACCTGGTCG TGCGCAAGGC CGACGACGGT TCGGTGACCC GCAGTGGCGT GGTGGCGCTC
GACGGCCCCG GGCGGGTCCG GGAGCTGTCG CGGATGCTGG CCGGCCAGGA GGAGAGCTCG
TTGGCCCGCG GCCACGCCGA GGAGCTGCTC GCCGCCGCCG CCGCCGACAA GGCCGCCCTC
GCCCAGCCGT GA
 
Protein sequence
MLEEIRIRGL GVIDDAVLDL APGLTVVSGE TGAGKTMIVQ GLGLLTGGRA DYALVSPQAG 
RAFVEARLAV PADSGLAKRV RELDGDVDED VIIMGRTLSA EGRSRAQLAG RSVPASVLAE
ITENVIAVHG QSEAQRLRRP ATQRDALDRF AGASVAEPLV RYGQVYARLA KVRARLADIT
GRARERAQEA ELLRLGLADV ERVAPLPGED VTLDAELRRL EHSETLLRVA RSAHGALVSD
PATGEDGPAA TDLVGAARRI VAGDASLDPQ LAELANRLTE LSSLLTDVAA DLASYAAGVE
SDPERLAQAQ ERKAALTALA RAHGTDVDGV LRWADTGGRR LLELDADGDQ TGALAAERDE
LTAELAGLAE RISAARSAAA ERFGAAVAAE LSGLAMPRAR VEAAVGHRDD PNGLPVGGRT
LAYGPSGIDD VELRLVPHPG APARPVEKGA SGGELSRVML AIEVVLAAAD AGATMVFDEV
DAGVGGRAAV EIGRRLARLA RTHQVICITH LPQVAAFADR HLVVRKADDG SVTRSGVVAL
DGPGRVRELS RMLAGQEESS LARGHAEELL AAAAADKAAL AQP