Gene Francci3_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3800 
Symbol 
ID3905548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4555907 
End bp4559218 
Gene Length3312 bp 
Protein Length1103 aa 
Translation table11 
GC content77% 
IMG OID637881126 
ProductUvrD/REP helicase 
Protein accessionYP_482879 
Protein GI86742479 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0835078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGTCC TTCCTGCCCC CGCCCCGCGT CCCGGCTACC GGCTGATCGC AGCCCCGGTC 
GCGCCGCCGC GTCCGCTCGT CCTGGATGCG GCGCAGCGCG CGGTGGTCGA GCACGGCGGC
GGGCCGCTGC TGGTGCTCGC CGGGCCCGGC ACCGGGAAGA CGGCCACGCT CGTCGAGGCG
GTCGCCGCGC GGATCGAAGC CGGGGCGGAT CCGCGGTCGA TCCTGGTCCT CACCTTCAGC
CGGCGAGCCG CCGGAGAGCT GCGGGAGCGG ATCACCGCAC GGCTCGGTGC CGAGGGCGGG
GCCGGCGGAG GTCCCGGGGC GTGGACGTTC CACGCCTGGT GCCTGGCGCT GCTGCGCGCG
CACGAGCGGC CGGCCCCGCC GGGCGGGCTC CGGCTGCTGT CCGGCCCGGA GCAGGACAGT
CGGCTGCGCG ATCTCATCGA GGGTTCCCGC GAGGACGGCC GGCCGGTCTG GCCCGAGCCG
CTCGTGGGTT GCCTGCGCAC CCGCGGTTTC ACCGAGGAGG TGCGGGCGCT GCTGGCCCGC
GCCCGCGAGG TCGGGCTCGA ACCGGTCGCG CTGGCGCAGC TCGCTCGGCG TACCGGCCGC
CCGGACTGGG CCGCCCTCGC CGAGTTCTAC GAGATGTACC TCGACGTGTT CGGATTCGAG
GGGGCGGTCG ACTACACCGA TCTCGTCCAC CGGGCCGTGG TCGTCGCCGA GAGTGCCGAG
GGGGGCGCCT GGCTGCGTGG GCGCTACCGG CACGTCTTCG TCGACGAGTA CCAGGACACC
GATCCGGCTC AGGAACGCCT GCTGGAGGCG GTGGCCGGTG GTGGTGGCAA CCTCGTCGTG
CTGGGCGACC CGGACCAGTC GATCTATGCC TTCCGGGGGG CCGAGGTCGC CGGCCTGCTG
GGCTTCCCGG CCCGGTTCCC CCGCCTCGAC GGCGAGCCGG CGCCGATCGT CGCGCTGCGG
CGCTGTCGGC GGATGGCCCC GGCCCCGCTG GCGGCGAGCC GGCACGTGGC CCGGCGGATC
CCGGCCGCCG GGCTTCCGGT GGCCGCGATC CGGGCACACC GCGACCTCGT CGGGCGCGCC
GACGCCGGGG CGGGCCAGGT GCAGGCCCGC ACGTTTCCGG GCACCGGCGC GGAGGCCGAG
TCGGTGGCCG ACCTGCTCCG TCGCGAACAT CTGGAGAACG GCGTCGCCTG GGACGCGATG
GCGGTCCTGG TCCGCACGGC CGAGCGGATC GGCCGGTTGC GCCGCGTGCT GGCCGCCGCC
GGCGTTCCGG TCAGCGCCGA CGGTGACGAC CTGCCGGTCG CCCAGGAGCC GGCCGCCGCC
CTGCTCCTGC TGGCGCTGCG CTGCGCGGAG GATCCGGCCG GGGCGCTCAC CGTCGACGCG
GCGCGCACCC TGCTCACCTC GCCGCTCGGT GGAGCGGACC CGGCCGGGCT GCGGGCGCTG
GGCCGGGCTC TGCGCACCCT CGAGCGCGAC GCGGGGTCCG AGCATCCGGC GCCGTCGGCG
GAGCTGTTGC GGGCGGCGGT CGCCGAGCCC GACCACTGGC TGGCGACGAT CCCCGACGAC
CTGGCCGGTC CGGTCCGGCG GGTCGGCGGT CTGCTGCGGA CGGCCGGTCA GGCGTTGCGG
GACTCCTCGG GCGCCCCGCA GGACGCTCTG TGGGCGCTGT GGTCGGCGAG CGAGTGGCCG
GCGCGGCTGC GACATGCCTC GGCGGCCGGG GGCGCCGCCG GGCGCGCGGC CGACCGTGAC
CTCGACGCCG TCGTGGCGCT CTTCGATGCC GTCACCCGGC TGGGCCAGCG GCGTGGGCCG
GGCGGTGGCG TCGCGTCCCT GGTCGCGGAG CTCACCCGCC AGCAGATCGC CGGGGACGTC
CTCAAGCCGG CCGGGGAGTC CGTGCGCCGC CGCGGGGTGC GGCTGCTGAC CGCCCATCGC
GCCAAGGGGC TCGAATGGGA GGTCGTCGTC GTCTGCGGAG TGCAGGACGG CACCTGGCCC
GACCTGCGGG AGCGGCATTC GCTGCTCGGT GCGGAGCAGC TCGACGCGCC GTCCCGGGGC
GGGTTGCGCC CGCCGCTGAC CCGCCAGGAC CTGCTCGCCG ATGAACGTCG GCTGTTCTAC
GTCGCCCTGA CCAGGGCGCG GCGCCGGCTG GTCGTCACCG CGGTGAACAG CCCCGAGGAC
GACGGCAGCC TGCCGAGCCG CTTCCTTGAG GAACTGGGTG TAGCGGTGGA GCACGTGCCC
GGTCGCCCAG CCCGGCCGCT CACCCTGGTC GGCCTGGTCG CGACCCTGCG CCGGCTGGCG
ACGGAACCGG ACTCAAGCCC GGTGATGCGC TCCGCCGCGC AGGCCCGCCT CGCCGCGCTC
GCCGCCGCAC GGGACCAGGC GGGCCGGCCC CTGGTGCCGG CCGCGCACCC GGACACCTGG
TGGGGGCTGC TCGACCCCAC CACGTCGGAT GTTCCTGTTG TCCCGGTGGC CGGGCCGATC
CGGCTGTCCG GCTCGTCGCT GTCGAGCATC GGCGCCTGTT CCCTGCGGTG GTTCCTAGAG
CACGAGGCGT ACGCGGTCAC CCCGGCCTCG ACCGCGCAGG GATTCGGCAA GGTGGTGCAC
GCCCTGGCGG ACGAGGTCAC GACCGGGCGG ACCCCGGCCG ACCTCGCCGC GCTCGACGCC
CGGCTGGACA CGGTGTGGCG GCAGTTGGAC TTCGACGCCC GCTGGCGCTC CGATCAGGAG
CGCGCGGCCG CCCGCGAGGC GCTCGCCCGC TTCCTGGACT GGCACGCCGC CGAGCGCGGT
CGGCGCGTGA TCGACGCGGA GGTCAGGTTC TCCTGCGACC TACGGGTCGC CGGGCGGGAC
GTCCAGCTGC GCGGATTCAT CGACCGGCTC GAACTGGACG AGGCCGGCCG GGTCCACGTC
ATAGACTTCA AGACCGGTCG GACCGCGGTC GCTCCCGCCG AGCTCGCCAC CCACCCCCAG
TTGGGCAGCT ATCAGCTCGC CGTGCGGGCC GGCGCGCTCG ACGATGTGCT CGCGGCCGCG
GCGCCGGATC AGCCCGGGGC GGATCAGCCC CGGGCGGTGC CCGGCGGGGC GGAGCTCGTG
CAGCTCCGGC GGGATGCCGG CGCTGCGGCG GCCGGTCCGC GGCTGCCGGG GGAGCGGCCT
GGCCCGCCGG AGGTCCAGGC CCAGAGCGCG CTGCCTCCCC ACGGCGCGAC CTGGATGGAC
GAGGTGCTCG ACGCGGCGGT GCGCACCATC GACGCCGAGG CGTTCCGGCC GACGCCCGGC
GACCACTGCA CGCTGTGCAC CTTCCAGACG AGCTGCCCCG CGCGGCCCGA GGGACGTCAG
GTGGTCGAGT GA
 
Protein sequence
MVVLPAPAPR PGYRLIAAPV APPRPLVLDA AQRAVVEHGG GPLLVLAGPG TGKTATLVEA 
VAARIEAGAD PRSILVLTFS RRAAGELRER ITARLGAEGG AGGGPGAWTF HAWCLALLRA
HERPAPPGGL RLLSGPEQDS RLRDLIEGSR EDGRPVWPEP LVGCLRTRGF TEEVRALLAR
AREVGLEPVA LAQLARRTGR PDWAALAEFY EMYLDVFGFE GAVDYTDLVH RAVVVAESAE
GGAWLRGRYR HVFVDEYQDT DPAQERLLEA VAGGGGNLVV LGDPDQSIYA FRGAEVAGLL
GFPARFPRLD GEPAPIVALR RCRRMAPAPL AASRHVARRI PAAGLPVAAI RAHRDLVGRA
DAGAGQVQAR TFPGTGAEAE SVADLLRREH LENGVAWDAM AVLVRTAERI GRLRRVLAAA
GVPVSADGDD LPVAQEPAAA LLLLALRCAE DPAGALTVDA ARTLLTSPLG GADPAGLRAL
GRALRTLERD AGSEHPAPSA ELLRAAVAEP DHWLATIPDD LAGPVRRVGG LLRTAGQALR
DSSGAPQDAL WALWSASEWP ARLRHASAAG GAAGRAADRD LDAVVALFDA VTRLGQRRGP
GGGVASLVAE LTRQQIAGDV LKPAGESVRR RGVRLLTAHR AKGLEWEVVV VCGVQDGTWP
DLRERHSLLG AEQLDAPSRG GLRPPLTRQD LLADERRLFY VALTRARRRL VVTAVNSPED
DGSLPSRFLE ELGVAVEHVP GRPARPLTLV GLVATLRRLA TEPDSSPVMR SAAQARLAAL
AAARDQAGRP LVPAAHPDTW WGLLDPTTSD VPVVPVAGPI RLSGSSLSSI GACSLRWFLE
HEAYAVTPAS TAQGFGKVVH ALADEVTTGR TPADLAALDA RLDTVWRQLD FDARWRSDQE
RAAAREALAR FLDWHAAERG RRVIDAEVRF SCDLRVAGRD VQLRGFIDRL ELDEAGRVHV
IDFKTGRTAV APAELATHPQ LGSYQLAVRA GALDDVLAAA APDQPGADQP RAVPGGAELV
QLRRDAGAAA AGPRLPGERP GPPEVQAQSA LPPHGATWMD EVLDAAVRTI DAEAFRPTPG
DHCTLCTFQT SCPARPEGRQ VVE