Gene Francci3_2605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2605 
Symbol 
ID3906511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3074409 
End bp3077489 
Gene Length3081 bp 
Protein Length1026 aa 
Translation table11 
GC content74% 
IMG OID637879930 
ProductDSH-like 
Protein accessionYP_481696 
Protein GI86741296 
COG category[L] Replication, recombination and repair 
COG ID[COG4581] Superfamily II RNA helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.022552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.122557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCTCCA CGCCCGAGAT TCTTGTGGAG TTCGCCGCTG GCCTCCCCTT CGGACTCGAC 
CCGTTCCAGT TCGAGGCGGT CGCGGCGCTG GCTGCCGGTG AAGGGGTGCT CGTCGCGGCC
CCGACGGGAG CGGGTAAGAC GGTTGTCGGC GAGTTCGCCG CCCATCTCGC CCTGCGCACC
GGGACGCGTT GCTTCTACAC GACGCCGATC AAGGCGCTGT CGAACCAGAA GTACGCCGAT
CTCGTCTCCC GCTACGGCGC CGTCTCGGTC GGCCTGCTGA CCGGGGACAC CTCCCGCAAC
GGCGACGCCC CGATCGTGGT GATGACCACC GAGGTCCTGC GCAACATGCT CTACGCCGGC
CCGGTCGACA ACGGCCGGCT GGACGACCTC GGCTATGTCG TCATGGACGA GGTGCACTAC
CTCGCCGACC GCCAGCGCGG CGCCGTGTGG GAGGAGGTCA TCATCCATCT GCCGGCGCAG
GTCCGGCTGG TCTCCCTGTC CGCGACGGTG AGCAACGCCG AGGAGTTCGC CGAGTGGCTG
GTCACGGTGC GCGGCCATAC CCGGGTCATC GTCAGCGATC ACCGTCCCGT TCCGCTGTGG
CAGCATGTGC TCGCTGACCG CACCCTGTAC GACCTGTTCC TCGAGGACAC TGCCGGCACC
CCGCCGCCGG GCGGACCCGA GGCACTTCGT GCCACCCTGG ACGGTGATCT CGTCCGGTCG
AGGTGGAACG TCGGGGTCGA AGCCGGATTC GAGGTAGCCG GCACCCGGCG CGCCACCATC
GGCGACCGGG ACCGCGGCGG CGACCGGGAC CGCGGCGGCG ACCGGGACCG CGGCGGCGAC
CGGGACCGCG GACGGGGGCG GGGCGGCAAC CGCAACGGCG AACGGGGCCG GGCCACCCAG
GGTGGTCGCG GTGGCGACCG TGGCCGCAAT GGGGGTGCCG GCCAGGCGCC CGTGGCAGCC
ATCCCCGCGG CCCGTGGTGG TCCGACGGCG GTCGGTGGAC GAGTGGTGAA CCCGGACCTG
CTGCGGCTGG CCCGCGAGGA GAGCCGGGCG CTGTCGGGGG GCTCTCCCGC GGTCGGCCGC
GGTCGGCCGG CGCCCGGCGC GCGCCGGCGG ACGTGGGTGC CGGGCCGGCC GGAGGTCGTC
GAACGGCTCG ACCGCGACGG CCTGCTCCCG GCGATCGTCT TCGTGTTCAG CCGGGCCGGC
TGCGATGCGG CGGTCACCTC CTGCGTGCGT GCGGGTCTGC GGCTCGTCGG CCCCGCGGAG
CAGCAGCGGA TCCGGACCCT GGTCCGGGAA CGCACCGCCG GGATCCCCGA GACCGACCTC
GCCGTGCTCG GCTACTGGGC GTGGCTGGAG GGGCTGGAAC GCGGCATCGC ATCCCACCAC
GCCGGCATGC TGCCAACCTT CAAGGAGATC GTCGAGGAGC TGTTCGTCCA GGGGCTGGTC
CGGGTGGTCT TCGCCACGGA GACCCTGGCG CTCGGCATCA ACATGCCCGC CCGTACCGTC
GTGCTGGAGC GGCTGACGAA GTTCAACGGC GAGAGCCGCG TGGACATCAC CCCGGGGGAG
TACACCCAGC TGACGGGCCG CGCCGGGCGG CGCGGCATCG ACGTGGAGGG CCACGCGGTC
GTCCTGTGGC AGCCGGGGCT GGATCCGCTC GCGCTCGCGG GGCTGGCCTC CACGCGGACC
TATCCGCTGC GCTCGTCGTT CCGGCCGTCG TACAACATGG CCGTCAACCT GGTCGGCAGG
CTCGGGCGGG AGCGGGCACG CACCGTGCTG GAGTCGTCGT TCGCGCAGTT CCAGGCCGAC
CGGGCCGTGG TGGGCCTGGC CCGGGCGGTG CAGCGCAACA CCGAGGCGAT CGACGCGAAG
CGTGAGGCGT TGAGCTGCGA CAAGGGTGAC ATCGGCGAGT ACGACCGGTT GCGCCGGGAG
ATCGCTGAGC GGGAGGCCGC GCTGTCGCGG GAGGGTTCGG CGCGCCGGCG GGCCGAATCC
GCGGCGGCGC TCGCCCGGCT GCGGACCGGG GACATCGTGC GGGTGCCCGC CGGGCGGCGC
AGTGGGCTCG CTGTCGTGCT CGACGCCGAC GCCGCCGCGA ATGCCGCCGA CGGCCCGCGG
CCGGTGGTGC TCACCGCGGA CAGGCAGGTT CGCAGGCTGT CCCTGACCGA CTTCCCGATC
GCCGTCGAAC CGCTCGGCCG GGTCCGGGTG CCGCGGTCGT TCAACCCGCG CTCGCCGCAG
TCACGCCGGG ATCTGGCCTC CTCCCTGCGC GCCGCCGACG TCAACCCCGA TGCCCCTCCG
GGCCGGCGGG CCAGGGTCCG TTCGGCCGCG GCGGACGACG CCGAGCTCGC CCGGCTGCGC
CGGGCCCTGC GCGCCCATCC GGTGCACGAC TGTCCGCGCC GCGAGGAGCA CCTGCGCTCC
GCCGAGCAGG TCAACCGGCT GGTCAAGGAG ACCGCCGCGA TCTCACGGAA GGTGGAGGGC
CGGACGAACA CGGTCGCGAA GACGTTCGAC CGGGTCCGCG CCGCGCTCGA GGACCTCGGC
TACCTGGACG GGGACCGCGT CACCGCCGCC GGGCGGGTGC TCGCGCGCAT CTACTCGGAG
CAGGACCTGC TCGTCGCCGA ATGCCTGCGC GCCGGCATCT GGGACGATCT CACCCCGCCG
GCGCTGGCCG CCGCCGTGTC CACGCTGGTG TTCGAGCCGC GCGGTGACGA CGCCGGCGTC
CCGGCGCTGC CGGGCGGCGC GGCGCTACGC GACTGCCTCG CCGAGATGGT CCGGCTGTCG
GAGCGGCTCG CCGAGGCCGA GCAGGCCCAC CGGCTCGCGT TCCTGCGCCC GCCCGAGCTC
GGATTCGTCG CCGTCGCGCA CGACTGGGCC GCGGGCCGCA CGTTGGAACG GGTGCTGACC
GACAGTTCCG TGGAACTGAC CGCCGGCGAT TTCGTCCGGT GGATGCGCCA GCTCATCGAC
ATCCTTGACC AGATCGCGCA GGTCGCTCCG ATGGTGCAGG CGGACCCGGG AACGCCGGAC
GGCGCGCGGG TGCGCCGGAC CGCCCGGGCG GCGATGGACG CCGTCCGTCG CGGGGTCGTC
GCGTACGCGA TGAGCGTCTG A
 
Protein sequence
MSSTPEILVE FAAGLPFGLD PFQFEAVAAL AAGEGVLVAA PTGAGKTVVG EFAAHLALRT 
GTRCFYTTPI KALSNQKYAD LVSRYGAVSV GLLTGDTSRN GDAPIVVMTT EVLRNMLYAG
PVDNGRLDDL GYVVMDEVHY LADRQRGAVW EEVIIHLPAQ VRLVSLSATV SNAEEFAEWL
VTVRGHTRVI VSDHRPVPLW QHVLADRTLY DLFLEDTAGT PPPGGPEALR ATLDGDLVRS
RWNVGVEAGF EVAGTRRATI GDRDRGGDRD RGGDRDRGGD RDRGRGRGGN RNGERGRATQ
GGRGGDRGRN GGAGQAPVAA IPAARGGPTA VGGRVVNPDL LRLAREESRA LSGGSPAVGR
GRPAPGARRR TWVPGRPEVV ERLDRDGLLP AIVFVFSRAG CDAAVTSCVR AGLRLVGPAE
QQRIRTLVRE RTAGIPETDL AVLGYWAWLE GLERGIASHH AGMLPTFKEI VEELFVQGLV
RVVFATETLA LGINMPARTV VLERLTKFNG ESRVDITPGE YTQLTGRAGR RGIDVEGHAV
VLWQPGLDPL ALAGLASTRT YPLRSSFRPS YNMAVNLVGR LGRERARTVL ESSFAQFQAD
RAVVGLARAV QRNTEAIDAK REALSCDKGD IGEYDRLRRE IAEREAALSR EGSARRRAES
AAALARLRTG DIVRVPAGRR SGLAVVLDAD AAANAADGPR PVVLTADRQV RRLSLTDFPI
AVEPLGRVRV PRSFNPRSPQ SRRDLASSLR AADVNPDAPP GRRARVRSAA ADDAELARLR
RALRAHPVHD CPRREEHLRS AEQVNRLVKE TAAISRKVEG RTNTVAKTFD RVRAALEDLG
YLDGDRVTAA GRVLARIYSE QDLLVAECLR AGIWDDLTPP ALAAAVSTLV FEPRGDDAGV
PALPGGAALR DCLAEMVRLS ERLAEAEQAH RLAFLRPPEL GFVAVAHDWA AGRTLERVLT
DSSVELTAGD FVRWMRQLID ILDQIAQVAP MVQADPGTPD GARVRRTARA AMDAVRRGVV
AYAMSV