Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2165 |
Symbol | |
ID | 5670565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2599255 |
End bp | 2600388 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641241086 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001506507 |
Protein GI | 158313999 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.896467 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGAAGC GGGCGTACCG CTACCGCTTC AACCCGACCC CCGATCAGGC CGCCCAGCTC GCGCGAACCT TCGGCTGCGT CCGCTACGTG TACAACCGGG CGCTCGCCGA ACGGCACCGG GCCTGGTTCC AGGAGCAGCG GCGGGTCACC CACGCCGAGA CCGACCGGAT GCTCACGGCG TGGAAACGCG ACCCGGAAAC GGAATGGCTC GCCGAGCCGT CGAAAGGCCC GCTTCAGGCC ACGCTGCGGA ATCTCCAGAC CGCGTATGTG AACTTCTGGC AGAAACGCGC CGGCTACCCG ACGTTCAAGA AGAAGGGCAG GACCCTCGAC TCGGCGACCT ACTTCCGGAA CTGTTTCAGT TTTCGGGACG GTCGGATCAC GCTGGCGAAG CAGGACGCGC CGCTGGCGAT CGTCTGGTCG CGTCCGCTGC CCGAGGGCGC GGAGCCCTCG CAGGTCACGG TGTCGCGGAA CGCCCGCGGC CAGTACCACG TCTCGATCCT GGTCGAAGAG AAGATCACTA CGCTTCCCGC GTTGCCCGGG CGGGTGGGGA TCGACGCGGG GGTCACCTCG CTGGTCACCC TGTCGACGGG GGAGAAGGTG GCCAACCCGA AGCACGAGCG TCGGGATCGG GCCCGGCTGG CCTGTGCGCA GCGGGACCTG TCCCGGAAGG TGCAGGGGTC GGTGAACCGG GCGAAGGCCC GAGCGAGGGT CGCCCGGGTG CACGGGCGGA TCGCCGACCG GCGTCGGGAT CATCTCCACC AGCTGTCCAC GAGGATCATC CGCGAGAACC AAACGGTGGT CATCGAGGAT CTGTCCGTCC GCAACATGGT CAGGAACCAT TCGCTCGCGC GGGCGATCTC CGATGCTTCG TGGTCGGAGT TGCGGCGGAT GTTGGAGTAC AAGGCCGGCT GGTACGGTCG CACCATCATT GCGATCGATC GGTTCTATCC GTCGTCCAAA ACCTGTTCGG TGTGCGGGTC GATCGTGAAG GAACTGCCGC TCAACGTCCG GGAACGGGCC TGCCGTGGTT GCGGCACGGT CCACGACCGG GACGTGAACG CGGCGGTCAA CATTCTGGCC GCGGGGCTCG CGGTGGCTGC CTGTGGAGAT GGAGTGAGAC CGCCTCGCTC CTGA
|
Protein sequence | MVKRAYRYRF NPTPDQAAQL ARTFGCVRYV YNRALAERHR AWFQEQRRVT HAETDRMLTA WKRDPETEWL AEPSKGPLQA TLRNLQTAYV NFWQKRAGYP TFKKKGRTLD SATYFRNCFS FRDGRITLAK QDAPLAIVWS RPLPEGAEPS QVTVSRNARG QYHVSILVEE KITTLPALPG RVGIDAGVTS LVTLSTGEKV ANPKHERRDR ARLACAQRDL SRKVQGSVNR AKARARVARV HGRIADRRRD HLHQLSTRII RENQTVVIED LSVRNMVRNH SLARAISDAS WSELRRMLEY KAGWYGRTII AIDRFYPSSK TCSVCGSIVK ELPLNVRERA CRGCGTVHDR DVNAAVNILA AGLAVAACGD GVRPPRS
|
| |