Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4039 |
Symbol | |
ID | 5672397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4817470 |
End bp | 4818603 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641242915 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001508332 |
Protein GI | 158315824 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0972683 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGAAGC GGGCGTACCG TTACCGCTTC TACCCGACCC CAGAGCAGGC CGAGCAGCTC GCCCGCGCCT TCGGCTGCGT CCGCTACGTC TACAACTGGG CACTCGCCGA GCGGTCTCGC GCGTGGTTTC AGGAACGGTG TCGCATCACG CACGCCGAGA CCGACAAGAT GCTCACCGCG TGGAAACGGG ACCCGGAGAC GGCATGGCTC GCCGAGCCGT CGAAAGGACC CTTGCAGGCC ACGCTACGGC ACCTTCAGTC GGCGTTCGTG AACTTCTGGG AGAAGCGGGC CGGCTACCCG TCTTTCAAGA AGAAGGGCAA GACCCTCGAG TCGGCGACCT ACTTCCGGAA CTGCTTCAGC TACCGGAACG GCGCTGTCAC CCTCGCCAAG CAGGACCGGC CGCTGGACAT CGTCTGGTCG CGTCCGCTGC CCGACGGCGC GGACCCGTCG CAGGTGACGG TGTCGCGGAA CGCCCGCGGC CAGTACCACA TCTCGATCCT GGTCGAGGAG ACCATCACCA CCCTGCCTCC GACATCGGGG CAGGTGGGGA TCGACGCGGG GATCACGAGT CTGGTCACTC TGTCGACCGG GGAGAAGGTC ACCAACCCCC GGCACGAGCG TGCCGACCGG GCTCGGCTCG CACGCGCACA GTGGGAATTG TCCCGCAAGG AGAAGGGCTC AGCGAACCGG GCGAAAGCCC GCGCGAGGGT CGCGAAGGTC CACGGCCGTA TCCGGGACCG TCGTCGGGAT CATCTGCACA AGCTGTCCAC GAGGATCATC CGCGAGAACC AAACGGTGGT CATCGAGGAC CTGTCCGTCC GCAACATGGT CCGCAGCCAT TCGCTCGCAC GGGCGATCTC CGACGCATCG TGGTCGGAGC TGCGGACGAT GCTGGAGTAC AAGGCCGGCT GGTACAGCCG CACCGTGATC GCGATCGACC GTTTCTACCC GAGCAGCAAG ACCTGTTCGG TGTGCGGGTC GATCGTCGAG AAGATGCCGT TGAACGTCCG GGAATGGGCC TGCCGCGGCT GCGGCACAGT CCACGACCGG GACGTGAATG CGGCGCGGAA CATTCTGGCC GCGGGGCTCG CGGTGGCTGC CTGTGGAGAT GGAGTGAGAC CGCCTCGCTC CTGA
|
Protein sequence | MVKRAYRYRF YPTPEQAEQL ARAFGCVRYV YNWALAERSR AWFQERCRIT HAETDKMLTA WKRDPETAWL AEPSKGPLQA TLRHLQSAFV NFWEKRAGYP SFKKKGKTLE SATYFRNCFS YRNGAVTLAK QDRPLDIVWS RPLPDGADPS QVTVSRNARG QYHISILVEE TITTLPPTSG QVGIDAGITS LVTLSTGEKV TNPRHERADR ARLARAQWEL SRKEKGSANR AKARARVAKV HGRIRDRRRD HLHKLSTRII RENQTVVIED LSVRNMVRSH SLARAISDAS WSELRTMLEY KAGWYSRTVI AIDRFYPSSK TCSVCGSIVE KMPLNVREWA CRGCGTVHDR DVNAARNILA AGLAVAACGD GVRPPRS
|
| |