Gene Francci3_3799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3799 
Symbol 
ID3905547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4552518 
End bp4555910 
Gene Length3393 bp 
Protein Length1130 aa 
Translation table11 
GC content75% 
IMG OID637881125 
ProductUvrD/REP helicase 
Protein accessionYP_482878 
Protein GI86742478 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0121759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCCGC GGGTGGAGCA GCCGGGCGCG GGCGTCCACG TGACGATTCC CCGCCAGCCG 
TTCCTGCCCG GTTTCGACGG CGACGTCGAC GGTGACCCGG CGGGCGGCGT CGCCGGTCCC
CGGCTCGACC CGGACGACCT GCGCGACCTG CTGGAGGTCC CGTATACCGA CGAGCAGATC
GCCGCGGCCA CCGCTCCGCT CGAACCGGGG GTCATCATCG CGGGTGCGGG ATCCGGCAAG
ACCTCGGTGA TGGCCGCGCG GGTGGTCTGG CTCGTCGCCA CCGGGCAGGT GCGGCCCGAC
CAGGTGCTCG GACTGACCTT TACGACGAAG GCGGCGGCGG AGCTGTCCGG CCGGGTGCGG
CTGGCGTTGC GCAAGGCGTC GGCTGGGGCG GGCCCGGGTG GACCGGCCGC GGACGGCGAG
GTGGACGGGG AGCCGACCGT CGCCACCTAC CATGCCTTCG CCGGGCGGCT GGTCGTCGAC
AACGCCCTGC GGCTGGGGCT GGAGCCGGAC CTCCAGCTGA TCTCCGGCGC GGCCCGCTAC
CAGCTCGCCG CCCGGGTCGC CCGCTCCCAC GGCGGGAAGG TGGAGGCGCT GACACGTTCG
CTGGGCGCGC TCGTCGGCGA GCTCGTCGCG CTGGACGCCG AGATGAGCGA GCACCTGGTC
GATCCGGCGG ACCTCGTCGC GTTCGACGGC GCGCTGCTCG CCGAGATCGA CGCGGCGCTG
CGGCGTGCCG AGCAGCGGCG GGGCACCCGG GGCGTCCGCC GGGAGCTACG CCGGTGCGCG
GCCGCCGCCC GGGGGCGCCG GGAGCTGGCC GCCCTGGTGG CGGAGTACCG GGCGGCCCGG
CGGGAACGCG ACGTCCTGGA CTTTGGTGAC CAGGTCACCT GTGCCGCGAG GCTCGCCGAG
ACCATGCCGG AGGTCGGTCG GGCCGAGCGG GCGCGGGCCG CTGTCGTCCT GCTCGACGAG
TACCAGGACA CCTCGGTGGC CCAGCGGCGG ATGCTCACCG GCCTGTTCGG CGGCGGGCAT
CCGGTCACCG CCGTCGGCGA CCCCTGCCAG GCGATCTACG GCTGGCGCGG GGCGTCGGTG
GCGAACCTCG ACCACTTCCC CACGCACTTC CCCGGGGCCG ACGGCACGCC CGCGGCCGTC
TACGAACTGT CGGTGAACCA GCGCAGCGGC GGCCGGCTGC TCCGCCTCGC CAACACCGTC
GCCGCACAGC TGCGGGCCCG TCACCGCGTC GTCGAGCTCC GCCCGCGACC CGACGTGGCT
GACCAGGGCG AGGCGGTGGT CGCGTTGCAC GTCAGCTGGG CCGACGAGGT GGCCTGGATC
GCCGGCCAGC TCCGCCGGGT GGTCGACGCC GGCACCGCTC CCGGCGACAT CGCCGTGCTG
GTCCGGGCCC GTGGCGACAT CCCCGCGCTG TTTACCGCGA TGCAGGCGGC CGGGCTGCCC
GTCGAGGTCG TCGGGCTGAC CGGGCTGCTC ATCGTCCCGG AGGTCGCCGA GATCGTGGCC
ATGCTGGAGG TGCTCGACTC GCCGACGGCG AACGCCGCGC TCGTCCGGCT GCTCACGGGG
CCGCGGGTGC GGCTCGGCCC CCGGGATCTC GCCGCGCTCG GCCGCCACGC GCGTGAGGCG
GCGCGGGTCC CGGACCCGGT CGGCGATCCG GACCTGCCGG GACGGTCCGA TCCACTTGCC
GAGGCGGTCG CCGACGTCGA CCCCGCCGAC GTCGTCGCCC TGTCGGACGT CCTCGACGAC
CCCGGGCCGC AGATGTCCGC GCCGGGGCGC GCCCGGGTGC GTCGGCTTGC CGCCGAGATC
GCCGCGCTGC GTGGGCACGT CGGCGAGGGC CTGCTCGACC TGCTGCACCG GGTGGTCGCG
ACGATCGGGC TGGACGTGGA GCTCACGGCG ACCGAGGTGG CGGTGCGGGC CCGGCGCCAG
GAGAACGTCG CGGCCTTCCT GGATGTGGCC GCGGGCTTCA CCGATCCCGA CGGCACGAAC
TCCCTGCCCG CGTTCCTCGG ATTCCTCCGG GCGGCGCGGG AGCACGAGCG GGGTCTCGAC
GTCGCCGGCC CGTCCGGGGC CGACGCGATC GCCCTGATGA CGATGCATCG CTCCAAGGGC
CTGGAATGGG AGGTCGTCGC CGTCCCGAAC CTGACCAGCA AGGTGTTCCC TGACCTGACC
GTGCGCGACC AGTGGACCAC CTCGCCGGGC GTGCTGCCGA TCCCGCTGCG CGGCGATGCC
GACGATCTGC CGGCGTTCAC GGTCTGCGCC GAGAAGGCCG CCCTGGACGC CTTCCGGGCC
GATGCCCGTC AGTACGCCGA GCGGGAGGAA CGCCGGCTCG CCTACGTTGC GGTCACCCGG
GCGAAGTCGC TGCTGTTCGC GAGCGGGCAC TGGTGGGGGC CCACCCAGAA GACCCCACGC
GGTGCCTCGG TGTTCCTCGA CGAGCTCGCC GAGCACGCCC GCACCGGCGG TGGGCTGGTC
GACGTCTGGG CGCCGGAGCC GGCGGAGCGT ACCAACCCGG CGCTGGCGAC GCCGGAACGG
TTCGCCTGGC CGATCCCCTA CGAGCCGGAG CCGTACGCCC GGCGCCTCGC CGCGGCCGAG
GGGGTCATGG CGCGGCTGGC CGCGTTGAGC GTACCGGGGG CGTCGGCGGA ACCGGCCGGG
GAACCGTTCG TCGACGGTCC GGCCGGGATG ACGGCGGCCG AGCGCGTCCT GCTCGCCGAG
CTCGACCGGG AGGCCCGGCT GCTGCTCGCC GAGGAACGCG CGGCACGCCT CGCGCGGACC
GATGTCGCCC TGCCGGCGAG CCTCACCGCG TCCCAGATCG TCCGGCTGCG CGCGGACCCG
GAGGCGTTCG CCCGTGAGCT GGTCCGGCCG CTGCCGCGCC GTCCAGTGGC CGCGGCCCGT
CGCGGCACCC GGTTCCACGC CTGGGTTGAG GAGATTTTCG ACTACCGTGC ACTGATCGAC
ACCGAGGATC TGCCCGGCGC GGCCGACGCG GAGCTCACCG ACGACGATCT GCGCTCGCTG
CAGCAGGCGT TTCTGCGTAC CGCGTACGGT GCCCGGCGGC CATTCGCGAT CGAGGCACCC
TTCGAGCTCC GCCTCGCCGG GCGGATCGTG CGCGGTCGCA TCGACGCTGT CTACGACCTC
GGCGGCGGTC GATGGGAGGT GGTCGATTGG AAGACCGGCC GGTCCGATGC CGACGATCTT
CAGCTCGGGA TCTACCGGCT GGCCTGGGCC CGCCTGCGGG GCGTCGACCC GAGTGCGGTC
GACGCCGCCT TCCTGTACGT GCGTACCGGC GCCGTTGTCC GGCCCCCGAC GTTGTCCGAG
GAGGAGCTCG CCGACCTGCT CGCGAGTCCG AGCACCGGGC CCGCCGCGGG TCAGGCCCGC
GGGTCGGCGA GGTCGACGGA CAGCGCCAGG TGA
 
Protein sequence
MNPRVEQPGA GVHVTIPRQP FLPGFDGDVD GDPAGGVAGP RLDPDDLRDL LEVPYTDEQI 
AAATAPLEPG VIIAGAGSGK TSVMAARVVW LVATGQVRPD QVLGLTFTTK AAAELSGRVR
LALRKASAGA GPGGPAADGE VDGEPTVATY HAFAGRLVVD NALRLGLEPD LQLISGAARY
QLAARVARSH GGKVEALTRS LGALVGELVA LDAEMSEHLV DPADLVAFDG ALLAEIDAAL
RRAEQRRGTR GVRRELRRCA AAARGRRELA ALVAEYRAAR RERDVLDFGD QVTCAARLAE
TMPEVGRAER ARAAVVLLDE YQDTSVAQRR MLTGLFGGGH PVTAVGDPCQ AIYGWRGASV
ANLDHFPTHF PGADGTPAAV YELSVNQRSG GRLLRLANTV AAQLRARHRV VELRPRPDVA
DQGEAVVALH VSWADEVAWI AGQLRRVVDA GTAPGDIAVL VRARGDIPAL FTAMQAAGLP
VEVVGLTGLL IVPEVAEIVA MLEVLDSPTA NAALVRLLTG PRVRLGPRDL AALGRHAREA
ARVPDPVGDP DLPGRSDPLA EAVADVDPAD VVALSDVLDD PGPQMSAPGR ARVRRLAAEI
AALRGHVGEG LLDLLHRVVA TIGLDVELTA TEVAVRARRQ ENVAAFLDVA AGFTDPDGTN
SLPAFLGFLR AAREHERGLD VAGPSGADAI ALMTMHRSKG LEWEVVAVPN LTSKVFPDLT
VRDQWTTSPG VLPIPLRGDA DDLPAFTVCA EKAALDAFRA DARQYAEREE RRLAYVAVTR
AKSLLFASGH WWGPTQKTPR GASVFLDELA EHARTGGGLV DVWAPEPAER TNPALATPER
FAWPIPYEPE PYARRLAAAE GVMARLAALS VPGASAEPAG EPFVDGPAGM TAAERVLLAE
LDREARLLLA EERAARLART DVALPASLTA SQIVRLRADP EAFARELVRP LPRRPVAAAR
RGTRFHAWVE EIFDYRALID TEDLPGAADA ELTDDDLRSL QQAFLRTAYG ARRPFAIEAP
FELRLAGRIV RGRIDAVYDL GGGRWEVVDW KTGRSDADDL QLGIYRLAWA RLRGVDPSAV
DAAFLYVRTG AVVRPPTLSE EELADLLASP STGPAAGQAR GSARSTDSAR