Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7063 |
Symbol | |
ID | 5675373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8618727 |
End bp | 8620721 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641245908 |
Product | restriction endonuclease |
Protein accession | YP_001511299 |
Protein GI | 158318791 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAGT CAGGTGGCGG TCAGTCGGAA TGGCAGCAGC GACGCGCCGC GGCTGCAGGT GCTGAGCGGG AGGCTGCGCG ACAACGTAAA CGCGAAGCGA CCGCGCGAGC AAAGCAGGAC GAGCGCGAGC GGGAGCAGGC CCAGCGCCAG CAGTCGGTGG ACGACGACAA CGCTGCCGCA GCCGCTCACA TCGCAGAGCT GGGCAGGAAC CTGCTACTCA ACGTCTTGGG CCTCCCTGCC TTCACCGTGG CCGCGCTGGA GGTAGTACCG GAGCAGTCGA TCTTTCAGCC GGGATCGCTG GCAGTCGCTG GTAGGGCGCC GGATTGGGAG CAATACAAGC CTCCGGCTCC TGGTCGGGCC AGCCGGCTGG TGGGCGGGTC GGGTCGATAC CAGCGGGAGT TCGACGCGGC CCGAAGCAAG TGGGAAGCCG ACACGGCCGA GTTCCAGCGG GTCGAGACCG ACCGGCTACG ACAACTCGCT GCGGCTCGTA CCAGACACGG CCAGCAGGTG GCCGCGGCCC TCGACCGTGC CGCGGCGCAC AACGCCCGCA TCGCGGCACA GTGGGCGGCG TGTCTGGACG GTGACCCGGA AGCGGTCGAG TGGTTCGTCG GCCAAGCGCT CGCGGCCACG TCCTATCCTG ACGGCTTCCC CGTAGCACGC AAGGTCGCCT ACCGGCCCCA GGAACACGAC ATCGTCATCG AGATCGAGTT CCCACGACGA TCCGTGATCC CCGAAATCAG GGGATACAAG CTATTCAAGA CCGCACCCGA GGTCAGGCCG GTAAAGTGGA AAGAGTCTGA AGTCAAGAAG CTATATGCGC AGCTCGTTGC CTGGATAACG TTGCGGATAG TGCACGAGGT TTTCGAGGCT ACGAAGGCGC TTGACCTCAT CGAGGTTGTT GTCTTCAACG GGACCGTCAT CGATGTAGCA CCCACGACCG GCAAAGATAC TCTCTACCAT CTGGTAAGTC TCGAACCCGA ACGGTCCCTG TTCGAGGCGA GCCTTGAACT CGACCGGGTC ACCGATCCGA TCGGATGCCT GCGCGAGCTG GGCGCGAAAG TCTCACCCAA CCCTTACGAC CTAGAAGCCG TGAAACCCGT TGTCACGTTC GATCTCCGCC GATTCAGGCT CGCCAACGAT GCGGCCGAAC TAGCCGATCT AGACTCCCGA CCGAACCTCA TGGAACTCAC CCCGAGCGAG TTCGAGAAAC TCATCGAAAA ACTTCTCAAG GCCATGGGTA TCGAAGCGTA TCGCACCATC GACTCCCGCG ACGATGGCAT AGACGTCGTC GCGACTAAAG ACGACATCAT CTTTGGTGGT GTCTGCCTGG TACAGGCTAA ACGTACCAAG AACCGAGTCG AGCTCGGGAC GGTTCAAGCG GTCGCTGGCT CCATGAACGA CCACAACGCC GCTACCCCCG ATGTTCAGCA GCAGTCGGTA CAGGCCCGTG GCCGGTCGAA CAGCCGGCAG CGCCTCGGTG GCGGCCAGGG CATCAGCCGA GGCGGGAATG CGCAGACCGT CAGCGTCCAG ATCGCCGAAG TAGGCGATCG AGGTGATGCT GGGCAGCTCT GCGACGGTAA CGGCGGTCAC CGCCTCGTCG AGGCCTTCCT GCGCGCGACG CTGGTCAGCC TGAGCCTCCA GCAACAGCTG CTGCCCATGG GCGACATCGC GAGGCCGCCG GTCGACGGGA AGTGCGGTCC ACACCTTGCG GTCGCGGTCG CCGGGGCTCG CCTGCTCCAG CTGAGTGCGC AGCGCCGAGG TCCGCTCGAG GGCAGTGTTG TGGACGGCGG CGTTCCGGTC GCGGTCACGC CGAGCCGGGC TACGGCGCTG GCCCGGCCCT CGCGGTCAGC GCCCTCGGTG CTGGCGAGCA ACCACTCGGC ACGGGCCACG ATCTCCGCCG GTCGGCCAGC GACCGCGGCG CGGGCACGGG CTGCCACGGT CTCGGCTGAC CTGGCCTCCA ATCGAAGATC TTGCCCCACC TCTACCGCCA GGTAG
|
Protein sequence | MAQSGGGQSE WQQRRAAAAG AEREAARQRK REATARAKQD EREREQAQRQ QSVDDDNAAA AAHIAELGRN LLLNVLGLPA FTVAALEVVP EQSIFQPGSL AVAGRAPDWE QYKPPAPGRA SRLVGGSGRY QREFDAARSK WEADTAEFQR VETDRLRQLA AARTRHGQQV AAALDRAAAH NARIAAQWAA CLDGDPEAVE WFVGQALAAT SYPDGFPVAR KVAYRPQEHD IVIEIEFPRR SVIPEIRGYK LFKTAPEVRP VKWKESEVKK LYAQLVAWIT LRIVHEVFEA TKALDLIEVV VFNGTVIDVA PTTGKDTLYH LVSLEPERSL FEASLELDRV TDPIGCLREL GAKVSPNPYD LEAVKPVVTF DLRRFRLAND AAELADLDSR PNLMELTPSE FEKLIEKLLK AMGIEAYRTI DSRDDGIDVV ATKDDIIFGG VCLVQAKRTK NRVELGTVQA VAGSMNDHNA ATPDVQQQSV QARGRSNSRQ RLGGGQGISR GGNAQTVSVQ IAEVGDRGDA GQLCDGNGGH RLVEAFLRAT LVSLSLQQQL LPMGDIARPP VDGKCGPHLA VAVAGARLLQ LSAQRRGPLE GSVVDGGVPV AVTPSRATAL ARPSRSAPSV LASNHSARAT ISAGRPATAA RARAATVSAD LASNRRSCPT STAR
|
| |