Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5420 |
Symbol | |
ID | 5673751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6545309 |
End bp | 6548449 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244275 |
Product | helicase domain-containing protein |
Protein accession | YP_001509681 |
Protein GI | 158317173 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.795016 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTACGT TCGCGCCGGG TTCGATCGTG GTGGTCCGGG ACGAAGAATG GCTGGTCACG GGGGCCGAGC AGGGCACGGA TGGCTGGCGG TTGGACGTGG TCGGCCTCGG TGAGCTCGTG CGGGAGACCA CGGCGACCTT CTTCAGCGGG CTCGACCACA TCGAGTTGCT GGACCCCCGC GAGGCAGAAC TGCTCCCGGA CGGCTCCCCG CGGCACCGGC GCACCCGGCT GTGGCTGGAG GCGACACTAC GCAGGACGCC GATGCCGGCC GGCGAGACCT CGCTGACCGT CTCCGACGGG ATGCTGGCGA CGCACCTCGA CTACCAGCGG CGGGCGGTCG CGCACGCGCT GTCCCCGACG AACCTGCGGC CCCGTGTCCT CATCGCGGAT GCCGTCGGCC TCGGCAAGAC CCTGGAGATC GGGATGCTGC TCGCCGAGCT GACCCGGCGC GGGCGGGCTG ACCGGGTGCT GGTGGTGACC CCGCGGCACG TCCTCGAACA GATGCAGCAC GAACTGTGGT GCCGGTTCGG ACTGCCGCTC GTCCGGCTCG ACAGTGATGG TCTGCAGCGG GTGCGGCAGA ACCTGCCGGC GAGCAGGAAT CCGTTCACGT TCTACCGCCG CATCATCGTC TCGATCGACA CCTTGAAATC GCCGCGCTAC CGGTCCTTTC TGGAGCGGCA CCGCTGGGAT GTCGTCGTCA TCGACGAATC ACACAACCTG ACGAACACCG GCACGCTGAA CAACGAACTC GCGCGGGTTC TCGCACCGAA CACGGAAGCG CTGGTCCTGG CATCGGCGAC GCCGCACAAC GGGAAGAAGG AGTCGTTCGC GGAACTCCTG CGGCTGCTCG ACCCGACGGC TGTAGGCCCG GACGGCGAGT ACGACGTCGC TGACGTGGAG CGGCTGTTCA TCCGCCGGCA CCGCCACTCC CCCGAGGTCG CCGCGGAGGT CGGCGCCGAC TGGGCGCTGC GTCCCGAGCC CGTGGTGATC CCGGTGGCCG CGTCGCCGGC GGAGGACGCG ATCGCCACGG AGATCTCCCG GACCTGGCTG TACCCACAGG GCCCGTCCCC CGTCGCGGGC CGCGGGTCGG CGCTGTTCCC CTGGACGCTG GCGAAGGCGT CCCTCTCCTC CCCGGCCGCG CTGCTGGAGA CGACCGAGGC CCGCCTCAAG CGGCTCGCGT CCCGAAGCGG CGGCGGGGCA GGGGAGAGGA ACGGCGGCGG CGATCACGAG CTGGAGCGGC GGGCCCTCGA GCGCCTGCGT GACCTCACCG AGCTGGCCCT GGCCGGGGAG AGCGCCAAGC TCACCGCGCT CGCCGAGTAC CTGCGCACCA TCGGTGTCGG TGCCCGCTCG GCGACCCGCG CGGTGCTGTT CGCCGAGCGG GTCGCCACCC TGCGCTGGCT CGCCAGCGAG CTCCCCGAGC GCCTGGGCCT CGCCAAGGAT CAGATCGCCG TCATGCACGG CGGCCTGCCG GATGTCGAGC AGGAGCGGAT CGTCGACGAC TTCAAGACCA CGGCGAGCCC GGTCCGGCTG CTGATCACCG GGGATGTCGC CAGTGAGGGC GTCAACCTGC ACGCGCAGTG CCACCACCTG GTCCATGTCG ACATCCCGTG GTCGCTGATC AGGATCGAGC AGCGCAACGG GCGCATCGAC CGCTACGGCC AGAAGCATCC GCCGCAGATC GCGGCACTGG CGCTGGTGCC GTCCGACGAC CGGTTCAGCG GTGACGTGCG GGTGCTGCAG CGCCTGCTGG CCAAGGAGCA TCTCGCCCAC ACGACACTGG GCGACGCGGC GACGCTGATG CACCTGCACT CGGCGAGCGG CGAGGAGGAC GCGATCCGGG ACGCGCTCGC GCGCGGCCAG AACCTGGACG AGGTCGTCCC CGACCCGGGT TCGGGGCAGG GCTCCGAGGA GTTCTTCGGG TTCTTCGACG AGGAGTTCGC CGCCGCCGGC GACGACCTGC CGCCGGCTCC CCCGGACCGG CCCCGCGAGT CGCTGTACCC CACCGACGCC GACTTCCTCG CCGACGCGGT CGCCGAGGTG TACGACGATC CCGCCCGCGC GCCGGACGAC AAGGATCCCG CCCGGGGCGG CGTCGGCTGG AAGGTCTTCC GGGACAAAAG TCTGATCGCG CTCCGACCGC CACGTGACCT GCGGGTGCGC CTCGACGCAC TGCCGGCGTC GTATGTCGCC GAACGCGGGA TCCGCGAGCA GCTGCTGCTC GCCGTGACAC CGTCGGTCGC CTTGGACGGG CTGCGCGCGG CACGCGAGGG GCAGACCGGC GGCGGCCCGG GACGGCCGGC GCTGGTGGCC GCCGCGACGA CCGCGGCTGC GACGACTGCG GCTGGGACGA CTGCGGCTGG GACGGCCGCG AGCGGGCCGG GGGGCGTCAC ACCGGCGGGT CGCAGGGGAC GTCCGGCGAG GGCCCGCGAG GTCACCGCGC CCACCCCGTC CACGTGGCCG GAGGCGCATT TCCTCTCCCC GCTGCACCCG GTGCTCGACT GGGCGGCGGA CAAGGTGCTC GCGGCCGGCG GCCGCAACGA GGTGCCGCTG GTGCGCGGGC CGGTCGACGT CCCGCGGGTG CTGGTGATCG CGACGCTGAT GAACCGGCGC GGCCAGGTCG TGACCCGCCA GATGGTCGTC GTCGAGTTCC CGACCGGACG GGCCGATCTG CCGATCGCGC AGGTCGTCGA GGGTCTGGAG CTGTTCGCGG GCACGGGGCT GATCCCCGGG CCCGGCGAGC GGGAGCCCGC GGTGAACCCG GGCGCGGCCG TGGCCACCGA CGAGCTGCGC GCGCTGGTGC CGGCCGCGAT CGACGCGGCG GCCCGCGACC TCGACATGGC CGAGGACATC CAGCACTCGG ACCTGGAGCG GCGGCTGGCG GACTGGTCGA CCCGGCGGAC CCGCTGGCGG GAGCAGGCGG CGCAGCTCGA ACTCGAGATG ACGGGGCCGG GGCTGGCGAA GGTCCGCCGG CTGTCGAAGC AGGTCAGCCT TGAGGAGGAG ATCGCGCGGT CGCTGCGCGC CAGCCAGCGG CTGCTGCGCC CGCTCGCCGT CGTGATCCCG GCGACCGCCG GCGGCAGGAG CGACGGCGAT GCGGCCGGCA CCGAGATCGG CACCGGGACG ACCGACGGGG GGAACGTCTG A
|
Protein sequence | MTTFAPGSIV VVRDEEWLVT GAEQGTDGWR LDVVGLGELV RETTATFFSG LDHIELLDPR EAELLPDGSP RHRRTRLWLE ATLRRTPMPA GETSLTVSDG MLATHLDYQR RAVAHALSPT NLRPRVLIAD AVGLGKTLEI GMLLAELTRR GRADRVLVVT PRHVLEQMQH ELWCRFGLPL VRLDSDGLQR VRQNLPASRN PFTFYRRIIV SIDTLKSPRY RSFLERHRWD VVVIDESHNL TNTGTLNNEL ARVLAPNTEA LVLASATPHN GKKESFAELL RLLDPTAVGP DGEYDVADVE RLFIRRHRHS PEVAAEVGAD WALRPEPVVI PVAASPAEDA IATEISRTWL YPQGPSPVAG RGSALFPWTL AKASLSSPAA LLETTEARLK RLASRSGGGA GERNGGGDHE LERRALERLR DLTELALAGE SAKLTALAEY LRTIGVGARS ATRAVLFAER VATLRWLASE LPERLGLAKD QIAVMHGGLP DVEQERIVDD FKTTASPVRL LITGDVASEG VNLHAQCHHL VHVDIPWSLI RIEQRNGRID RYGQKHPPQI AALALVPSDD RFSGDVRVLQ RLLAKEHLAH TTLGDAATLM HLHSASGEED AIRDALARGQ NLDEVVPDPG SGQGSEEFFG FFDEEFAAAG DDLPPAPPDR PRESLYPTDA DFLADAVAEV YDDPARAPDD KDPARGGVGW KVFRDKSLIA LRPPRDLRVR LDALPASYVA ERGIREQLLL AVTPSVALDG LRAAREGQTG GGPGRPALVA AATTAAATTA AGTTAAGTAA SGPGGVTPAG RRGRPARARE VTAPTPSTWP EAHFLSPLHP VLDWAADKVL AAGGRNEVPL VRGPVDVPRV LVIATLMNRR GQVVTRQMVV VEFPTGRADL PIAQVVEGLE LFAGTGLIPG PGEREPAVNP GAAVATDELR ALVPAAIDAA ARDLDMAEDI QHSDLERRLA DWSTRRTRWR EQAAQLELEM TGPGLAKVRR LSKQVSLEEE IARSLRASQR LLRPLAVVIP ATAGGRSDGD AAGTEIGTGT TDGGNV
|
| |