Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0685 |
Symbol | |
ID | 5669102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 803665 |
End bp | 804993 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641239612 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001505050 |
Protein GI | 158312542 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.289129 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACGT TGCAGGCGTA CCGGTTCGCG CTCGACCCGA ACGACGCCCA GGCCGCGAAC CTGCGTCGCC ACGCGGGGGC GGCCCGGTTC GCCTACAACT GGGGCCTGGC CCGGGTGAAG GCCGCGTTCG CGCAGCGCGA CGCGGAGAAG TCCTACGGCC TGGACGGCGA CCTGCTCACC CCGGTGCCGT GGACGCTGCC CGCGCTGCGG CTTGCCTGGA ACGCCGCCAA GCGGGACGTC GCGCCCTGGT GGGACGAGTG CTCGAAGGAG GCGTACTCGG CCGGGCTGGA CCAGTTGGCC CGTGCGTTGA AGAACTTCAC CGACTCCCGG AAGGGAAAGC GCAACGGCCG CCGGGTCGGT TTTCCCCGGT TCAAGAAGCG CGGGAAGGCC CGCGACTCGT TCCGCTACAC GACCGGTGCC TACGGCCCCG CGACCGATCT GTACGTGAAA CTGCCCCGGA TCGGCCGGGT CAAGGTCGGC GAGCCGATGG GCGCGCTCAC GTCGCGGCTG GCGGATGGCC GGGCGCGGCT GGGTGGCGCG ACGGTGTCCC GGACGGCTGG CCGCTGGTTC GTGGCGTTCA CCGTCGACAC CGACCGGGAC GTTTCCGAAC GGCCGACCCG CCGTCAGTGG ACGGGCGGCA CGGTCGGCGT CGACCTGGGC GTGAAACACC TCGCGGTCCT CTCCACCGGC GAGACGGTGG CGAACCCGAA ACGGCACGCC GCCGCGCTGC GGAAACTGCG CCGCGCGTCG CGGGCCTATG CCCGGTCGAA GCCGGGTAGC GCTGGGCGCC GGCAGCGCGC CGCCGGGCTC GCGACGATCC ATGCCCGGGT CGCGAACCAG CGCCGCGACG GGCTGCACAA GCTCACGACA CGGCTCGCCC GGTCCCACGA CGTGATCGTG GTCGAGGATC TACACGTCGC CGGGATGGTC CGCAACCGGC GGCTCGCCCG CGCCGTCTCG GACGTCGGGA TGGGTGAGAT CCGCCGGCAA CTCGACTACA AGACCCGCTG GTACGGTTCG CGGCTGCACG TCGCGGACCG CTGGTATCCG TCCTCGAAGA CCTGTTCCGG CTGCGGCTGG CGAAACCCAA GCCTGACGCT GTCGGACCGC ACGTTCCGCT GCCAGTCCTG CGGGCTGGTG GCCGACCGCG ACCACAACGC CGCGATCAAC CTCAGACACC AGGTCGCCGC CAGTACGTCG GAGACCGTAA ACGCCCGTGG AGCCGACCAT AAGACCCGCA CGAGCGGGCA GGTGGCTGGG AAGCGGGAAC CTGGCACGGC CAAGGCCGGT CAGACCAGGA GTGCCGGCGC GCAAGTGCCG GCGGCGTGA
|
Protein sequence | MRTLQAYRFA LDPNDAQAAN LRRHAGAARF AYNWGLARVK AAFAQRDAEK SYGLDGDLLT PVPWTLPALR LAWNAAKRDV APWWDECSKE AYSAGLDQLA RALKNFTDSR KGKRNGRRVG FPRFKKRGKA RDSFRYTTGA YGPATDLYVK LPRIGRVKVG EPMGALTSRL ADGRARLGGA TVSRTAGRWF VAFTVDTDRD VSERPTRRQW TGGTVGVDLG VKHLAVLSTG ETVANPKRHA AALRKLRRAS RAYARSKPGS AGRRQRAAGL ATIHARVANQ RRDGLHKLTT RLARSHDVIV VEDLHVAGMV RNRRLARAVS DVGMGEIRRQ LDYKTRWYGS RLHVADRWYP SSKTCSGCGW RNPSLTLSDR TFRCQSCGLV ADRDHNAAIN LRHQVAASTS ETVNARGADH KTRTSGQVAG KREPGTAKAG QTRSAGAQVP AA
|
| |