Gene Francci3_3158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3158 
Symbol 
ID3903955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3736999 
End bp3738753 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content75% 
IMG OID637880479 
ProductDNA repair protein RecN 
Protein accessionYP_482244 
Protein GI86741844 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0998846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.788962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGAGG AGATTCGCAT CCGCGGCCTC GGGGTCATCG ACGATGCCGC CCTCGACCTC 
GCTCCCGGCC TCACCGTGGT GAGTGGCGAG ACCGGAGCCG GCAAGACCAT GATTGTCCAG
GGGCTGGGAC TGCTCACCGG CGGGCGGGCG GATTACGGTC TGGTCCGGCC CGGGGTCGAT
CGCGCCTTTG TCGAGGGTCG GCTCGTCATC GGTGCCGAGT CGGCTGTCGC GGCCCGGGTC
CGCGAAGTCG GCGGCGACCT CGACGAGGAT CCGGGGGGCG CCGTCCTCGT CGTCGGTCGG
ACCCTGACCG CGGAGGGCCG GTCCCGGGCG CAGGTCGCGG GCCGCTCGGT TCCGGCGAGT
GTCCTCGCCG AGATCGCCGA GGAGCTGATC GCCGTGCACG GCCAGTCCGA GGCGCAGCGC
CTGCTCAAGC CGTCCACCCA GCGTGACGCG TTGGACCGGT TCGCCGGTTC CGCCGTCGCC
GGGCCGTTGG CCCGCTACGG CGGTGTGTAC CGGGAGCTGA CCCGGGTCAG TCGGCAGCTC
GCCGAGATCA CCGACCGGGT GCGTGAACGC GAGCAGGAGG CGGAGCTGCT GCGCATCGGC
CTGGACGAGG TGGAACGGAT CGCCCCCAGC CCCGGTGAGG ACGTCACCCT CGACGCCGAG
CTGACCAAGC TGGAGCACGC GGAGACCCTG GTGCGGGCGG CGCGGACCGC GCACGCCGCG
CTGATGAGCG ATCCGGCCAC CGGGTCGGAC GAGCCCGGTG CGGTCGATCT GGTGGCCGCG
GCCCAGCGGG TCATCGCCGG CGAGTCCGCC CTTGACGCCG AGCTCGCTGC CCTCGGCACG
CGGCTCACGG AGGTGGGGAT GCTGCTCACC GACGTCGCCG CGGACCTCGC CTCCTACGCG
GAGGGCATCG ATGCCGACCC GGTGCGCCTC GCCGACGCCC AGGCGCGCAA GGCCGCCCTC
ACCAGCCTGA CCCGGGCGCA CGGCACCGGC ATTGACGGCG TGCTGGCCTG GGCGGACCAG
GCCGGGCGCC GGTTGCTGGA ACTCGACGGC GCCGGGGACA GCGTCGAGGC GCTCACCGCC
CGGCGGGACA GCCTGACCGC CGAGCTGGCC CGGCTCGCCG AGGAGGTCAG CGAGGCCCGG
ACCAAGGCGG CGGCCCGGTT CGGTGCGGCG GTCGCCGCCG AGCTCGCCGG GCTCGCGATG
CCCCGGGCCC GGGTGGAGGC CGCCGTGTCG CAGCGCGACG ATCCAGCCGG TCTGCCGGTC
GGCCTGCGGG TCGTCGCGTT CGGACCGTTC GGCGTCGACG ACGTGGAGCT GCGGCTGATA
CCGCATCCCG GGGCGCCGCC GCGGCCGGTC CAGAAAGGCG CGTCGGGCGG TGAGCTGTCC
CGGGTCATGC TCGCGATCGA GGTCGTCCTC GCCGCCGCCG ACACCGGTTC GACCATGGTC
TTCGACGAAG TGGACGCCGG GGTCGGCGGC CGGGCCGCGG TGGAGATCGG CCGGCGCCTC
GCCCGCCTCG CCCGCACCCA TCAGGTGATC TGTATCACTC ATCTGCCGCA GGTCGCCGCC
TTCGCGGACC GTCATCTGGT GGTCCACAAG GCCGACGACG GATCGGTCAC CCGTAGCGGG
ATCGTCACGC TCGACGACGC CGGCCGGGTG CGGGAGCTCT CCCGGATGCT CGCCGGCCAG
GAGGAGAGCC CGCTGGCCCG TGGGCACGCC GAGGAGCTGC TCGCCGCCGC GGAGGCCGAC
AAGGCGCTGC CGTAG
 
Protein sequence
MLEEIRIRGL GVIDDAALDL APGLTVVSGE TGAGKTMIVQ GLGLLTGGRA DYGLVRPGVD 
RAFVEGRLVI GAESAVAARV REVGGDLDED PGGAVLVVGR TLTAEGRSRA QVAGRSVPAS
VLAEIAEELI AVHGQSEAQR LLKPSTQRDA LDRFAGSAVA GPLARYGGVY RELTRVSRQL
AEITDRVRER EQEAELLRIG LDEVERIAPS PGEDVTLDAE LTKLEHAETL VRAARTAHAA
LMSDPATGSD EPGAVDLVAA AQRVIAGESA LDAELAALGT RLTEVGMLLT DVAADLASYA
EGIDADPVRL ADAQARKAAL TSLTRAHGTG IDGVLAWADQ AGRRLLELDG AGDSVEALTA
RRDSLTAELA RLAEEVSEAR TKAAARFGAA VAAELAGLAM PRARVEAAVS QRDDPAGLPV
GLRVVAFGPF GVDDVELRLI PHPGAPPRPV QKGASGGELS RVMLAIEVVL AAADTGSTMV
FDEVDAGVGG RAAVEIGRRL ARLARTHQVI CITHLPQVAA FADRHLVVHK ADDGSVTRSG
IVTLDDAGRV RELSRMLAGQ EESPLARGHA EELLAAAEAD KALP