Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0992 |
Symbol | |
ID | 5669406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1167299 |
End bp | 1170769 |
Gene Length | 3471 bp |
Protein Length | 1156 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239920 |
Product | UvrD/REP helicase |
Protein accession | YP_001505354 |
Protein GI | 158312846 |
COG category | [L] Replication, recombination and repair [S] Function unknown |
COG ID | [COG0210] Superfamily I DNA and RNA helicases [COG1379] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00375] conserved hypothetical protein TIGR00375 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.175331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGTCAC GGTTCTTTGC CGACACCCAC GTGCACTCCA GGTACTCGCG GGCCTGTAGC CGCGACTGCG ACCTGGAGCA TCTGGCGTGG TGGGCGGCCC GAAAAGGCAT CCGCGTCGTC GGGACCGGCG ACTTCACCCA TCCGGCCTGG TCGCAGGAGC TCGCGACCAA GCTGGTGCCG GCGGAGCCGG GACTGTTCCG GCTGCGTCCC GACCTGGAGC GTGACGTGCT CGCGACGCTG CCCGCGTCGT GCCGCACCGC GACCCGATTC ATGATCTCTA CGGAGATCTC CACCATCTAC AAGCGAGGCG ACCGTACAAG GAAGGTTCAT CACCTCCTCT ACGCGCCCGA TCGCGAATCG GCCGGTCGGA TCACGTCCGC ACTGGCCCGA ATCGGCAATC TCTCCGCAGA CGGACGGCCG ATCCTCGGCC TGGACTCCCG TGACCTGCTC GAGATCACGC TCGGTGCCGG CGAGGGCTGT TACCTGATCC CCGCGCACGT GTGGACGCCC TGGTTCGCCG TGTTGGGCTC AAAATCCGGG TTCGACGCCG TCGAGGACTG CTACGCCGAC CTCGCCGAGC ACGTCTTCGC GCTGGAGACA GGCCTGTCGG CGGACCCGGA GATGTTCTGG CGGATCTCCG GGCTGGACCG CTACCGGCTG GTCAGCAACT CCGACGCGCA CTCCCCGCCG ATGCTCGGGC GGGAGGCCAC CGCCTTCACC TGCGAGCCCG ACTACTTCTC GATCCTCAGT GCGCTGCGCA CCGGCGACGG GTTCGGTGGC ACCGTCGAGT TCTTCCCCGA GGAGGGGAAG TACCACCACG ACGGCCACCG CAAGTGCGGG GTGGCGCTGA CGCCGGAGCA GACCCGCGCG GTCGGCGGCA TCTGCCCCGA GTGCGGGCGT CCGCTCACGG TAGGCGTACT CAACCGGGTG GACACGCTCG CCGACCGCAC GGAAGTGCAC CGACCCGACA CGGCAGGTGA CGTCACGTCG CTGGTGGCCC TGCCCGAGAT CGTCGGCGAG ATCCTCGGGG TCGGGCCGAA GAGCAAGTCG GTGGCGACGC AGGTCAGCAC CCTCGTCTCC CGGCTCGGCC CCGAGCTCGG CATCCTCTCG GACGTCCCGC TGGACGCGAT CGCGGAGGTG GGCTCGCCGA CGCTGGTCGA GGGCATCGGA CGGCTGCGCC GGGGCGAGGT GATCCGCCAG GCCGGCTTCG ACGGCGAGTA CGGCGTCATC CGGCTCTTCC AGCCCGACGA GCTCACCGCC GACGCGGGCA CGCTGTTCGA CCTCGGTACC CCCGGCCTCG CGACGGCGCC GCAGGCCGGC ATCCGCCGGG GCCGCGGCGC CGGCCGGGGC GGCACCGCGC TCGCACCCGG CGCCGCCGAA GCAGCCGAGG TCCCCGAGGT CCCCGAGGTC CCCGAGGTCG CCGGAGCTGC CGGAGCTACC GACGCCGCCA AGATCACCGA AGCCGGTGAC CTGTTCCCGG GCGATGACGC GGTGGAGCGC AGCGGCCCGC CGTCGGTGCT CGACGGACTT GATCCCGACC AGCGGCGCGC GGCCGAGCAC GTCGGCGGGC CGCTGCTGGT CCTCGCCGGC CCCGGCACCG GGAAGACCCG CACGCTCGTG CACGCCATCG CGCACCGGGT CCGTGAGCTC GGCGTGCCCG CCGAGCAGTG CCTGGCGGTC ACGTTCACCC GCCGCGCTGC CGGCGAGCTG CGAGAACGGC TCGACGTTCT GCTTGCGCCA GGCCCGGCCG GGACGCGCGC TGAGAGCACA CCCGCGGACG GCACGCCCAC AGATGCGGCG TCCACGAGCG GTGCGCCCGT GGACGAGGCA CCAGGAGACA CGGGGAGGCC CGGTGCCGGC GCGCTGGCGA CGACGTTCCA CGGCCTCGGC CTGCTGATCA TCCGGGAGCA GCATGTGGCG CTCGCGCGGG CGCCACGGGT GCAGGTGGCC GACGACGCCA TCCGCGGGGA GCTGATCGAC GAGGCGATGC ACGGGACCGA CGGTCCCGAG GCGCGGCGCC GTGCCGCGGC GGAGCTGGCC GAGCTCAAGC GGCAGCGCGC CCTGGGGCGC ACCTACCGGG ACCACGCGCT CGCCGGGGTG CTCGCCCGCT ACGACGCCGC ACTGCGTGAT CGCAACATGG TCGACCTCGA CGACCTGCTC ACGCTGCCGT TGGGGCTGCT GCGGTCGAAC CCGGACCTGG CGGCCGAGTA CCAGCGGCGC TGGCGGCACG TCTGGGTCGA CGAGTTCCAG GACATCGACG AGGTGCAGTA CGGCCTGCTG CGCGAGCTCT GCCCGCCCGC GGCCGACCTG TGCGCGATCG GCGACCCCGA CCAGGCGATC TACAGCTTCC GCGGCGCCGA GGTCGGCTTC TTCCTGCGCT TCGAGCAGGA CCGTCCCGGC GCCCGGCGGG TCTCGCTCAC CCGCAACTAC CGCTCGACGC CCACCATCGT CGGCGCCGCG CTGGACGCCA TCGCGCCCAC CACCTTCGTG CCCGGCCGCG AGCTGCGCGC GGTGACCGGC CTGGATGAGG ACGGCCCCGT CGTCGTCCGG CAGTGCGCGA GCGAGGCCGA GGAGGCCATC GCCGTGGCCG ACCTGATCGA GGAGGCCCTC GGCGGCACCT CGTTCCACTC CCTCGACTCG GGCGTGGACG GCTCGGAGCG GGGCCGGCTG TCGTTCGCCG ACATCGCGGT GCTCTACCGG ACGGCGCGCC AGGCCGAGCC GGTGATGGAC GCGCTGGTGC GCCGCGGGTT CCCGTTCCGC CAGCATGCCC ACACCCCGCT CGCGGACGTC CCGGCCGTCG CCGCGGTGCT CGCCCTGCTG CGCGATCGCC GTGACCCCGC GCCCCGTACG GTCACCGCGC TGCTGCGGGA CGCGGCCGCG CGGCTCACCG ACCTCGCCGT GGCCGCCGGC ACCGAGCTCG GCACCGCTGC CGCCCTCGCG CCCGGTGTCG CTCAGCCCGG CGGGGTCCCG GCGGGACGGC TGCCGAGCGA GGCGGAGATC CGGCGCGCCG CGGAGCTGCT CGCCCCGGCG GCCGCGGTCG CCGGGGACGA TCTAGGCGCG TTCCTCACCT CGGTGACCCT GGCGACGGAG GTGGACGGGC TCGACCCGCG GGCGGACCGC ATCTCCCTGC TCACGCTGCA CGCGTCCAAG GGGCTGGAGT TCGGCCTCGT CATCATCGTC GGCTGCGAGG ACGGCCTGCT GCCGCTGCGC TGGGGTGGGT CGAGCCCGTC GGCCGACGCC GAGGCGGAGG AACGGCGGCT GCTCTTCGTC GGGATGACCC GGGCCCGCCG CCAGCTCGTC CTCACCCATG CCGCCCGGCG CCGGCGCGGC GGCGACCTGG CCGACACCGG CCCGTCGGCG TTCCTCTCCG CCATCGACGG ACGTCATCTC GACAAGCGCA CGCGCGCCGC GCAGGGCACA GCCGCCGCCG CACGCCGTCG GTCAGCCGGC CGCCAGCTCC GCCTGATTTG A
|
Protein sequence | MRSRFFADTH VHSRYSRACS RDCDLEHLAW WAARKGIRVV GTGDFTHPAW SQELATKLVP AEPGLFRLRP DLERDVLATL PASCRTATRF MISTEISTIY KRGDRTRKVH HLLYAPDRES AGRITSALAR IGNLSADGRP ILGLDSRDLL EITLGAGEGC YLIPAHVWTP WFAVLGSKSG FDAVEDCYAD LAEHVFALET GLSADPEMFW RISGLDRYRL VSNSDAHSPP MLGREATAFT CEPDYFSILS ALRTGDGFGG TVEFFPEEGK YHHDGHRKCG VALTPEQTRA VGGICPECGR PLTVGVLNRV DTLADRTEVH RPDTAGDVTS LVALPEIVGE ILGVGPKSKS VATQVSTLVS RLGPELGILS DVPLDAIAEV GSPTLVEGIG RLRRGEVIRQ AGFDGEYGVI RLFQPDELTA DAGTLFDLGT PGLATAPQAG IRRGRGAGRG GTALAPGAAE AAEVPEVPEV PEVAGAAGAT DAAKITEAGD LFPGDDAVER SGPPSVLDGL DPDQRRAAEH VGGPLLVLAG PGTGKTRTLV HAIAHRVREL GVPAEQCLAV TFTRRAAGEL RERLDVLLAP GPAGTRAEST PADGTPTDAA STSGAPVDEA PGDTGRPGAG ALATTFHGLG LLIIREQHVA LARAPRVQVA DDAIRGELID EAMHGTDGPE ARRRAAAELA ELKRQRALGR TYRDHALAGV LARYDAALRD RNMVDLDDLL TLPLGLLRSN PDLAAEYQRR WRHVWVDEFQ DIDEVQYGLL RELCPPAADL CAIGDPDQAI YSFRGAEVGF FLRFEQDRPG ARRVSLTRNY RSTPTIVGAA LDAIAPTTFV PGRELRAVTG LDEDGPVVVR QCASEAEEAI AVADLIEEAL GGTSFHSLDS GVDGSERGRL SFADIAVLYR TARQAEPVMD ALVRRGFPFR QHAHTPLADV PAVAAVLALL RDRRDPAPRT VTALLRDAAA RLTDLAVAAG TELGTAAALA PGVAQPGGVP AGRLPSEAEI RRAAELLAPA AAVAGDDLGA FLTSVTLATE VDGLDPRADR ISLLTLHASK GLEFGLVIIV GCEDGLLPLR WGGSSPSADA EAEERRLLFV GMTRARRQLV LTHAARRRRG GDLADTGPSA FLSAIDGRHL DKRTRAAQGT AAAARRRSAG RQLRLI
|
| |