Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5135 |
Symbol | |
ID | 5673469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6151638 |
End bp | 6153506 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243985 |
Product | type III restriction protein res subunit |
Protein accession | YP_001509399 |
Protein GI | 158316891 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0782203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCCGC GGTGCGCGGC GTCGTCGTCC GGCTGGCGCA CCACCATAGA GTCTGATGTC GTGAGAGTCG GACCCCTGCC TCAGAGCTCC AGCGGGCAAG GCCCCAGCGG ACGAGGCCCC CGCGCGCGCG GCGAGGCCCG GCCCCTGCGG GCATGGCAAC GCGCAGCCCT GGAGACCTAC CGGTCGCGCA GCGCGTCCGG CGGCCGCGAC TTCCTGGCCG TCGCGACCCC GGGCGCCGGC AAGACGACGT TCGCGCTGGA GATCGCCGCC GACCTGTTGG CCGCGGGCGA GGTGCGTTCG GTGACCGTGG TCGCCCCGAC CGAGCACCTC AAGCGGCAGT GGGCCAACGC CGCCTCCGCG GTCGGGGTCG ACCTGGACCC GACCTTCCGC AACTCGGCCG GCGCCACCGC GTCGGACTAC ACCGGGGTCG CGGTCACCTA CGCCCAGGTC GCCGCGCACC CCGCCCTGCA CCGCATGCGC ACGGCCGCGC GCCGCACGCT GGTGATCCTC GACGAGATCC ACCACGCGGG CGACGCCCTG TCCTGGGGCG AGGCGGTCCG GGAGGCGTTC GAGCCGGCCG CCCGCCGGCT CGCACTCACC GGAACCCCGT TCCGGTCCGA CGTCAACCCG ATCCCGTTCG TGACCTACCT GCCCGACGCC GAGGGCGTGA CGCGCAGCGT CGCGGACTCC TCCTACGGTT ACGCCGAGGC GCTGCGTGAC GGTGTGGTCC GTCCCGTGCT CTTCCTCGCC TACTCGGGTG AGATGTCCTG GCGCACCAGC GCCGGCGCGG AGCTCAGCGC CCGGCTCGGT GAGCCGCTGA ACAGCGAGCA GACCGCGGCA GCGTGGCGCA CGGCCCTCGA CCCCCGCGGA GACTGGATGC CCGCGGTCCT GGCCGCCGCC GACACCCGGC TCTCCCAGGT GCGCCGGGGC GGGATGCCGG ATGCCGGAGG CCTGGTCATC GCGACCGACC ACACCAACGC CCGCGCCTAC GCCGGCCTGC TGCGGCGGAT CACCGGCGCG TCTCCCGTGA TCGTCCTCTC CGACGACCCG ACCGCCAGCA CGAAGATCGC CACGTTCCGT GAGTCGACGG ACCGGTGGAT GGTCGCCGTC CGCATGGTCA GTGAGGGCGT GGACGTCCCC CGCCTGGCGG TCGGCGTGTA CGCCACCTCG GCCTCGACGC CGCTGTACTT CGCCCAGGCC GTCGGCCGGT TCGTCCGTGG CCGCGGACGC TCGGAGACGG CGTCGGTGTT CCTGCCCAGC GTCCCGTCGC TGCTGGCGCT GGCCGGCGAG ATGGAGGTCC AGCGCGACCA CGCGCTCGAC AAGCCGCAGC GCGAGCCCGA CGCGTTCGAC GACGACGCGC TGCGCGAGGC CAACCGCCGC CGTGACACCC CCGACAAGCC CGACACCCTG TTCACCGCGC TCGGCTCCTC CGCCCAACTC GACCGGGTGA TCTTCGACGG CGGCGAGTTC GGCACGCCGG CCGCCTCCGG CTCCCTCGAG GAGGAGGACT TCCTGGGTCT GCCGGGCCTG CTCGAGCCCG ACCAGGTCGC GACCCTGCTG CGCCAGCGCC AGGCGGCGCA GCAGGCCGCC GCGGCGAAGG CGCAGTCCGC GGCCGGCGAA CCCGTGGTGC CGGCCGCCCG GCAGGGGGAG GCGGGCACCG ACCCGGGCGA CCGGCCCGTC CACGAGCAGA TCGGTGACCT GCGGCGCGAG CTGAACAAGC TTGTCGCCGC GCACTACCAT CGCACCGGAA AGCCGCACGG GATGATCCAC GCCGAGCTGC GCCGCTCCTG CGGCGGCCCG CCGAGCGCCC AGGCCAGCAC GGCCCAGCTC CAGGCCCGGA TCGACACGAT GCGCCGCTGG GCCGGCTGA
|
Protein sequence | MAPRCAASSS GWRTTIESDV VRVGPLPQSS SGQGPSGRGP RARGEARPLR AWQRAALETY RSRSASGGRD FLAVATPGAG KTTFALEIAA DLLAAGEVRS VTVVAPTEHL KRQWANAASA VGVDLDPTFR NSAGATASDY TGVAVTYAQV AAHPALHRMR TAARRTLVIL DEIHHAGDAL SWGEAVREAF EPAARRLALT GTPFRSDVNP IPFVTYLPDA EGVTRSVADS SYGYAEALRD GVVRPVLFLA YSGEMSWRTS AGAELSARLG EPLNSEQTAA AWRTALDPRG DWMPAVLAAA DTRLSQVRRG GMPDAGGLVI ATDHTNARAY AGLLRRITGA SPVIVLSDDP TASTKIATFR ESTDRWMVAV RMVSEGVDVP RLAVGVYATS ASTPLYFAQA VGRFVRGRGR SETASVFLPS VPSLLALAGE MEVQRDHALD KPQREPDAFD DDALREANRR RDTPDKPDTL FTALGSSAQL DRVIFDGGEF GTPAASGSLE EEDFLGLPGL LEPDQVATLL RQRQAAQQAA AAKAQSAAGE PVVPAARQGE AGTDPGDRPV HEQIGDLRRE LNKLVAAHYH RTGKPHGMIH AELRRSCGGP PSAQASTAQL QARIDTMRRW AG
|
| |