Gene Francci3_3744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3744 
Symbol 
ID3906028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4481721 
End bp4485095 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content72% 
IMG OID637881070 
ProductUvrD/REP helicase 
Protein accessionYP_482824 
Protein GI86742424 
COG category[L] Replication, recombination and repair
[S] Function unknown 
COG ID[COG0210] Superfamily I DNA and RNA helicases
[COG1379] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00375] conserved hypothetical protein TIGR00375 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.183659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGTCAC GGTTCTACGC TGATGTGCAC ATTCACTCCC GGTACTCGCG TGCCTGCAGT 
CGTGACTGCG ATCTGGAGCA CCTGGCCTGG TGGGCGGCAC GCAAGGGCAT CGCCGTCGTC
GGCACCGGCG ACTTCACCCA TCCGGCCTGG TCTCAGGAGA TCGCGACGAA GCTGGTCCCG
GCCGAGCCCG GCCTGTTCCG GCTGCGGTCC GATCTTGAGC ATGAGGTGCT GCGCACGTTA
CCGGCCTCCT GCCGGACGGC GACCCGCTTC ATGATCTCTA GTGAGATCTC GACGATCTAC
AAGAGGGGCG ACCGGACCCG TAAGGTCCAT CATCTTCTCT ACGCTCCCGA CCGGGAGGCC
GCCGGACGGA TCACGGCGGC GCTGGCTCGG ATCGGCAACC TCGCCGCGGA CGGGCGTCCC
ATCCTGGGCC TGGACTCCCG CGATCTGCTG GAGATCACCC TTGGTGGCGG GGCGGGCTGC
TATCTGGTCC CGGCGCACGT GTGGACGCCC TGGTTCGCGG TTCTCGGTTC GAAGTCCGGG
TTCGACGCCG TCGAGGACTG CTACGGCGAC CTCGCCGACG AGGTCTTCGC CCTGGAGACG
GGGCTTTCGG CGGACCCGGA GATGTTCTGG CGGATCTCCG GTCTCGACCG CTACCGGCTG
GTGAGCAACT CTGACGCCCA CTCCCCGCCG ATGCTCGGCC GGGAGGCGAC CGCGTTCACC
TGCGACCTCG ACTACTTCTC CATCGAGGCC GCGCTGCGCG GCGGGGACGG CTTCGCGGGG
ACCGTCGAGT TCTTCCCGGA AGAGGGCAAG TACCACCTCG ACGGGCATCG CAAGTGCGGG
GTGGTCCTCA CGCCGGACCA GACCCGGGAG GTCGGCGGAC GCTGCCCCAC CTGCGGCGGT
GGCCTCACCG TCGGGGTCCT CAACCGGGTC GAGGCGCTGG CCGACCGGCG GCCCGGCCAC
CGCCCCGTCA CGGCGCCCGA CGTCACCTCG CTGGTTCCGT TGCCGGAGGT CGTCGGGGAG
ATTCTCGGCG TGGGCCCGAA GAGCAAGGCG GTCGCGGGCC AGGTGACGTC GTTGGTCTCC
CGTCTGGGCC CGGAGCTCGA CATCCTCGGC GACGTACCGC TCACGGACAT CGCTGGGGTC
GGATCGCCGG AGCTGGTCGA GGCAATCAGT CGCCTGCGCC GCGGGGAGGT GATCCGGCAG
GCCGGGTTCG ACGGCGAATA CGGCGTCATC CGGTTGTTCG AACCCCGGGA GCTCGCCCGG
GACGGTGGCA CCCTGTTCGA CCTGGGGTCC GGCGCGGCTG GTGGGCAGGA CCGGATCGAT
GCGGGACCCT CCCTCGACGA GGCGCTCGCG GCGCGGGCGC GTCCGGCACC CGATGCCGCG
GTGGTGCCGG GGGATGCCGC GGTGGCCGAC AGCCAGCTGT TCACGCCGGT TTCCGGCGAT
ACCCCGTCGG TGCTTGACGG GCTGGATCCC GAGCAGCGGC TGGCAGCCTC GCACCTGTCC
GGTCCGCTGC TCGTCCTCGC CGGACCGGGA ACGGGCAAGA CCCGTACGCT GGTCCACGCG
ATCGCACACC GGGTCGCTGA GCACGGGGTG CCGGCCGGGG AGTGCCTGGC GGTCACCTTC
ACCCGGCGTG CGGCCGGGGA GCTCGCCGAA CGGCTAGCCG GCCTGCTGGG TGACGCCGCG
GGACGGGTGC TCGCGACGAC CTTCCACGGC CTCGGGCTTA CGATCATCCG TGAGCAGCAC
GCGAAACTCG CCCGGGGCCC GCAGGTCCAG GTCGCGGACG ACGCGGTGCG CGTGGAGCTA
ATCGCCGCGG CACTGCACGG TGAGGGCGAC GCCCGAACGC GCCGGCGGGT GGCGGCCGGT
GTCGCCGAGC TCAAGCGGCA CCGAGCGCTC GGCCAGGCGA TCCGCGACCA CGACCTGGTC
GGGGCGCTCG CCCGTTACGA CGCCGCGCTG CGCGATCGGG ACATGGTCGA TCTCGACGAT
CTGATCACCC TGCCGCTGAC CCTGCTGCGG TCGTCCCCGG ATCTCGCCGA GCACTACCAG
CGGCGGTGGC GCCACGTCTG GGTCGACGAG TACCAGGACA CCGACGAACT CCAGTACCGC
CTGCTGGGGC TGTTGTGTCC CCCCACGGCC AACCTGTGCG TGATCGGCGA CCCGGACCAG
GCTATCTACA GCTTCCGGGG TGCGGACGTC CGGTTCTTCC TGCGGTTCGA GCAGGACTAC
CCGAGTGCCC GCCCGGTCGC CCTGACCCGC GGCTACCGAT CCACCCGCAC CATCGTGCGG
ACGGCGCTCG ATGTGATCGC GCCCACGAGT CTGGTCCCGG ACCGGACCCT GACCGCCGTG
CGGGGCGCCG AGGGCGACGG CCCGGTGCTG CTGCGGCGCT ACCGCAGTGA GGCCGAGGAG
GCGATCGCCG TCGTCGACAC GATCGACGCG GCGCTCGGCG GTACCTCGTT CCACGCGTTG
GACTCGGGGG TGGACGGATC GGTCGACGCG GGGTTCTCCT TCGCGGACAT CGCCGTGCTC
TACCGGACCG CCCGCCAGGC CGAGCCGATC ATGGAAGCGC TGGCCACCCG CGGTTTCCCG
TTCCAGCGGC GCTCTCATCT CCCGCTCGCG GACGCTCCCG CGGTCGCCGA CCTGCTCGCC
CTGCTGCAGG ACCTGACGAC CACGGATCCC TCGGGTCCCG GCGTCCCGCG TCCGGTGTCG
GGTCTGCTGC GCGACGCGGC GGCACGGGCC ACCGATCTCG CCGAGGCGAG GAGAAGCGAG
CTCGGCGCCG TGCCGTCCGA CGGGTTTCCC GGCTTCTCCG GCGGACGGGT GCCGACCGAG
GCCGAGCTGC GCCTGGCGGT GGAGCTGCTC GCACCGGCCG CCGCAGCGGC AGGCAACGAC
CTGGCGGGTT TCCTCACGTC GGTCACGCTC GCCGCCGAGG TCGACGGGCT GGACCCGAGG
GCGGACCGGA TCTCGCTGCT GACCCTGCAC GCGAGCAAGG GTCTGGAATA CGGCCTGGTC
ATCATCGTCG GTTGTGAGGA CGGGCTGCTG CCGATGCGCT TCGGTCCGGC CGGTGAGGCG
CCGGGCGGTG GGATCCCGGG CGGTGGCGTC AACGGAACCG CCGACGGAAC CGCCGACGCC
GGCACGAAGG ACGCCGAGGC GGAGGAGCGC CGACTCTTCT TCGTCGGCGT CACCCGGGCC
AGGCATCGGT TGGTCCTCAC CTCGGCGGCC AGCCGGCGGC GCGCCGGGTC GGTCGTGACG
ACCCGTCCCT CACCGTTTCT TGCGGACATC AGGCCGGCTT TGCTGTCCTC TGTCCCGGCC
GAGGGCGTCC CGGCCGAGGG CGGGCGGCGC CGGTCGCGCC GGCCGGCACC GGGCAAGCAG
TTGCGACTCC TCTGA
 
Protein sequence
MRSRFYADVH IHSRYSRACS RDCDLEHLAW WAARKGIAVV GTGDFTHPAW SQEIATKLVP 
AEPGLFRLRS DLEHEVLRTL PASCRTATRF MISSEISTIY KRGDRTRKVH HLLYAPDREA
AGRITAALAR IGNLAADGRP ILGLDSRDLL EITLGGGAGC YLVPAHVWTP WFAVLGSKSG
FDAVEDCYGD LADEVFALET GLSADPEMFW RISGLDRYRL VSNSDAHSPP MLGREATAFT
CDLDYFSIEA ALRGGDGFAG TVEFFPEEGK YHLDGHRKCG VVLTPDQTRE VGGRCPTCGG
GLTVGVLNRV EALADRRPGH RPVTAPDVTS LVPLPEVVGE ILGVGPKSKA VAGQVTSLVS
RLGPELDILG DVPLTDIAGV GSPELVEAIS RLRRGEVIRQ AGFDGEYGVI RLFEPRELAR
DGGTLFDLGS GAAGGQDRID AGPSLDEALA ARARPAPDAA VVPGDAAVAD SQLFTPVSGD
TPSVLDGLDP EQRLAASHLS GPLLVLAGPG TGKTRTLVHA IAHRVAEHGV PAGECLAVTF
TRRAAGELAE RLAGLLGDAA GRVLATTFHG LGLTIIREQH AKLARGPQVQ VADDAVRVEL
IAAALHGEGD ARTRRRVAAG VAELKRHRAL GQAIRDHDLV GALARYDAAL RDRDMVDLDD
LITLPLTLLR SSPDLAEHYQ RRWRHVWVDE YQDTDELQYR LLGLLCPPTA NLCVIGDPDQ
AIYSFRGADV RFFLRFEQDY PSARPVALTR GYRSTRTIVR TALDVIAPTS LVPDRTLTAV
RGAEGDGPVL LRRYRSEAEE AIAVVDTIDA ALGGTSFHAL DSGVDGSVDA GFSFADIAVL
YRTARQAEPI MEALATRGFP FQRRSHLPLA DAPAVADLLA LLQDLTTTDP SGPGVPRPVS
GLLRDAAARA TDLAEARRSE LGAVPSDGFP GFSGGRVPTE AELRLAVELL APAAAAAGND
LAGFLTSVTL AAEVDGLDPR ADRISLLTLH ASKGLEYGLV IIVGCEDGLL PMRFGPAGEA
PGGGIPGGGV NGTADGTADA GTKDAEAEER RLFFVGVTRA RHRLVLTSAA SRRRAGSVVT
TRPSPFLADI RPALLSSVPA EGVPAEGGRR RSRRPAPGKQ LRLL