Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2605 |
Symbol | |
ID | 3906511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3074409 |
End bp | 3077489 |
Gene Length | 3081 bp |
Protein Length | 1026 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637879930 |
Product | DSH-like |
Protein accession | YP_481696 |
Protein GI | 86741296 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4581] Superfamily II RNA helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.022552 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.122557 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCTCCA CGCCCGAGAT TCTTGTGGAG TTCGCCGCTG GCCTCCCCTT CGGACTCGAC CCGTTCCAGT TCGAGGCGGT CGCGGCGCTG GCTGCCGGTG AAGGGGTGCT CGTCGCGGCC CCGACGGGAG CGGGTAAGAC GGTTGTCGGC GAGTTCGCCG CCCATCTCGC CCTGCGCACC GGGACGCGTT GCTTCTACAC GACGCCGATC AAGGCGCTGT CGAACCAGAA GTACGCCGAT CTCGTCTCCC GCTACGGCGC CGTCTCGGTC GGCCTGCTGA CCGGGGACAC CTCCCGCAAC GGCGACGCCC CGATCGTGGT GATGACCACC GAGGTCCTGC GCAACATGCT CTACGCCGGC CCGGTCGACA ACGGCCGGCT GGACGACCTC GGCTATGTCG TCATGGACGA GGTGCACTAC CTCGCCGACC GCCAGCGCGG CGCCGTGTGG GAGGAGGTCA TCATCCATCT GCCGGCGCAG GTCCGGCTGG TCTCCCTGTC CGCGACGGTG AGCAACGCCG AGGAGTTCGC CGAGTGGCTG GTCACGGTGC GCGGCCATAC CCGGGTCATC GTCAGCGATC ACCGTCCCGT TCCGCTGTGG CAGCATGTGC TCGCTGACCG CACCCTGTAC GACCTGTTCC TCGAGGACAC TGCCGGCACC CCGCCGCCGG GCGGACCCGA GGCACTTCGT GCCACCCTGG ACGGTGATCT CGTCCGGTCG AGGTGGAACG TCGGGGTCGA AGCCGGATTC GAGGTAGCCG GCACCCGGCG CGCCACCATC GGCGACCGGG ACCGCGGCGG CGACCGGGAC CGCGGCGGCG ACCGGGACCG CGGCGGCGAC CGGGACCGCG GACGGGGGCG GGGCGGCAAC CGCAACGGCG AACGGGGCCG GGCCACCCAG GGTGGTCGCG GTGGCGACCG TGGCCGCAAT GGGGGTGCCG GCCAGGCGCC CGTGGCAGCC ATCCCCGCGG CCCGTGGTGG TCCGACGGCG GTCGGTGGAC GAGTGGTGAA CCCGGACCTG CTGCGGCTGG CCCGCGAGGA GAGCCGGGCG CTGTCGGGGG GCTCTCCCGC GGTCGGCCGC GGTCGGCCGG CGCCCGGCGC GCGCCGGCGG ACGTGGGTGC CGGGCCGGCC GGAGGTCGTC GAACGGCTCG ACCGCGACGG CCTGCTCCCG GCGATCGTCT TCGTGTTCAG CCGGGCCGGC TGCGATGCGG CGGTCACCTC CTGCGTGCGT GCGGGTCTGC GGCTCGTCGG CCCCGCGGAG CAGCAGCGGA TCCGGACCCT GGTCCGGGAA CGCACCGCCG GGATCCCCGA GACCGACCTC GCCGTGCTCG GCTACTGGGC GTGGCTGGAG GGGCTGGAAC GCGGCATCGC ATCCCACCAC GCCGGCATGC TGCCAACCTT CAAGGAGATC GTCGAGGAGC TGTTCGTCCA GGGGCTGGTC CGGGTGGTCT TCGCCACGGA GACCCTGGCG CTCGGCATCA ACATGCCCGC CCGTACCGTC GTGCTGGAGC GGCTGACGAA GTTCAACGGC GAGAGCCGCG TGGACATCAC CCCGGGGGAG TACACCCAGC TGACGGGCCG CGCCGGGCGG CGCGGCATCG ACGTGGAGGG CCACGCGGTC GTCCTGTGGC AGCCGGGGCT GGATCCGCTC GCGCTCGCGG GGCTGGCCTC CACGCGGACC TATCCGCTGC GCTCGTCGTT CCGGCCGTCG TACAACATGG CCGTCAACCT GGTCGGCAGG CTCGGGCGGG AGCGGGCACG CACCGTGCTG GAGTCGTCGT TCGCGCAGTT CCAGGCCGAC CGGGCCGTGG TGGGCCTGGC CCGGGCGGTG CAGCGCAACA CCGAGGCGAT CGACGCGAAG CGTGAGGCGT TGAGCTGCGA CAAGGGTGAC ATCGGCGAGT ACGACCGGTT GCGCCGGGAG ATCGCTGAGC GGGAGGCCGC GCTGTCGCGG GAGGGTTCGG CGCGCCGGCG GGCCGAATCC GCGGCGGCGC TCGCCCGGCT GCGGACCGGG GACATCGTGC GGGTGCCCGC CGGGCGGCGC AGTGGGCTCG CTGTCGTGCT CGACGCCGAC GCCGCCGCGA ATGCCGCCGA CGGCCCGCGG CCGGTGGTGC TCACCGCGGA CAGGCAGGTT CGCAGGCTGT CCCTGACCGA CTTCCCGATC GCCGTCGAAC CGCTCGGCCG GGTCCGGGTG CCGCGGTCGT TCAACCCGCG CTCGCCGCAG TCACGCCGGG ATCTGGCCTC CTCCCTGCGC GCCGCCGACG TCAACCCCGA TGCCCCTCCG GGCCGGCGGG CCAGGGTCCG TTCGGCCGCG GCGGACGACG CCGAGCTCGC CCGGCTGCGC CGGGCCCTGC GCGCCCATCC GGTGCACGAC TGTCCGCGCC GCGAGGAGCA CCTGCGCTCC GCCGAGCAGG TCAACCGGCT GGTCAAGGAG ACCGCCGCGA TCTCACGGAA GGTGGAGGGC CGGACGAACA CGGTCGCGAA GACGTTCGAC CGGGTCCGCG CCGCGCTCGA GGACCTCGGC TACCTGGACG GGGACCGCGT CACCGCCGCC GGGCGGGTGC TCGCGCGCAT CTACTCGGAG CAGGACCTGC TCGTCGCCGA ATGCCTGCGC GCCGGCATCT GGGACGATCT CACCCCGCCG GCGCTGGCCG CCGCCGTGTC CACGCTGGTG TTCGAGCCGC GCGGTGACGA CGCCGGCGTC CCGGCGCTGC CGGGCGGCGC GGCGCTACGC GACTGCCTCG CCGAGATGGT CCGGCTGTCG GAGCGGCTCG CCGAGGCCGA GCAGGCCCAC CGGCTCGCGT TCCTGCGCCC GCCCGAGCTC GGATTCGTCG CCGTCGCGCA CGACTGGGCC GCGGGCCGCA CGTTGGAACG GGTGCTGACC GACAGTTCCG TGGAACTGAC CGCCGGCGAT TTCGTCCGGT GGATGCGCCA GCTCATCGAC ATCCTTGACC AGATCGCGCA GGTCGCTCCG ATGGTGCAGG CGGACCCGGG AACGCCGGAC GGCGCGCGGG TGCGCCGGAC CGCCCGGGCG GCGATGGACG CCGTCCGTCG CGGGGTCGTC GCGTACGCGA TGAGCGTCTG A
|
Protein sequence | MSSTPEILVE FAAGLPFGLD PFQFEAVAAL AAGEGVLVAA PTGAGKTVVG EFAAHLALRT GTRCFYTTPI KALSNQKYAD LVSRYGAVSV GLLTGDTSRN GDAPIVVMTT EVLRNMLYAG PVDNGRLDDL GYVVMDEVHY LADRQRGAVW EEVIIHLPAQ VRLVSLSATV SNAEEFAEWL VTVRGHTRVI VSDHRPVPLW QHVLADRTLY DLFLEDTAGT PPPGGPEALR ATLDGDLVRS RWNVGVEAGF EVAGTRRATI GDRDRGGDRD RGGDRDRGGD RDRGRGRGGN RNGERGRATQ GGRGGDRGRN GGAGQAPVAA IPAARGGPTA VGGRVVNPDL LRLAREESRA LSGGSPAVGR GRPAPGARRR TWVPGRPEVV ERLDRDGLLP AIVFVFSRAG CDAAVTSCVR AGLRLVGPAE QQRIRTLVRE RTAGIPETDL AVLGYWAWLE GLERGIASHH AGMLPTFKEI VEELFVQGLV RVVFATETLA LGINMPARTV VLERLTKFNG ESRVDITPGE YTQLTGRAGR RGIDVEGHAV VLWQPGLDPL ALAGLASTRT YPLRSSFRPS YNMAVNLVGR LGRERARTVL ESSFAQFQAD RAVVGLARAV QRNTEAIDAK REALSCDKGD IGEYDRLRRE IAEREAALSR EGSARRRAES AAALARLRTG DIVRVPAGRR SGLAVVLDAD AAANAADGPR PVVLTADRQV RRLSLTDFPI AVEPLGRVRV PRSFNPRSPQ SRRDLASSLR AADVNPDAPP GRRARVRSAA ADDAELARLR RALRAHPVHD CPRREEHLRS AEQVNRLVKE TAAISRKVEG RTNTVAKTFD RVRAALEDLG YLDGDRVTAA GRVLARIYSE QDLLVAECLR AGIWDDLTPP ALAAAVSTLV FEPRGDDAGV PALPGGAALR DCLAEMVRLS ERLAEAEQAH RLAFLRPPEL GFVAVAHDWA AGRTLERVLT DSSVELTAGD FVRWMRQLID ILDQIAQVAP MVQADPGTPD GARVRRTARA AMDAVRRGVV AYAMSV
|
| |