Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4789 |
Symbol | |
ID | 5673130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5716390 |
End bp | 5717370 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641243645 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001509061 |
Protein GI | 158316553 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.232227 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTCTC GGGTAGTGAA GCGGGCATAT CGTTATCGCT TCTACCCGAA TCCCGAGCAG GTCGCGCAGC TGGCTCGTAC CTTCGGCTGT GTCCGCTACG TCTACAACCG GGCGCTGGCC GAACGGTCCC GCGCCTGGAC CCAGGAGCAG CGCAGGGTCA CGCACGCGGA GACCGACAGG ATGCTCACCG TGTGGAAGCG GGACACGGAG ACGGCGTGGC TGGCCGAACC GTCGAAAGGG CCACTGCAGG CCGCGCTGCG CCATCTGCAG GCGGCGTTCG TGAAATTCTG GGAAAAACGG GCCGGATATC CGTCTTTCAA GAAGAAGGGC AGGAGCCTCG ATTCGGCGAC CTATTTCCGG AACTGCTTCA CCTACCGGAA CGGGAACATC ATACTGGCCA AGCAGGACCG GCCGTTGGAC ATCGTCTGGT CGCGTCCGCT GCCCGACAGC GCAGCGCCCT CGCGGGTGAC GGTGTCGCGG AACGCCCGCG GCCAGTACCA CGTCTCGATC CTGGTCGAGG ACACCGTCAC CAGCCTCCCG CCGGCTGGGG GGCAGGTCGG GATCGACGCG GGCATCACAG CGCTGGTCAC CTTGTCGACC GGGGAGAAAG TCACCAACCC CCGGCACGAG CGCCGCGACC GTGTCCGGCT GGCGCGGGCG CAGCGGGACC TGTCCCGCAA GGCGAAGGGC TCGGCGAACC GGGTGAAGGC CCGCGTGAGG GTCGCGGAGA TTCACGGCAG GATCGCGGAT CGGCGCCGGG ATCATCTGCG CAAGCTGTCC ACGAGGATCA TCCGCGAGAA CCAAACGGTG GCCGTCGAGG ACCTGTCCGT CCGCACCATG GTCCGCAACC ACTCGCTGGC CCGTGCTATC TCCGACGCAT CCTGGTCGGA GTTGCGGGCG ATGGTGGAGT ACAAAGCCGA CTGGTACGAC GGAACATTCT GGCCTCGGGG CTCGCGGTGT CCGCCTGTGG AGATGGAGTG A
|
Protein sequence | MGSRVVKRAY RYRFYPNPEQ VAQLARTFGC VRYVYNRALA ERSRAWTQEQ RRVTHAETDR MLTVWKRDTE TAWLAEPSKG PLQAALRHLQ AAFVKFWEKR AGYPSFKKKG RSLDSATYFR NCFTYRNGNI ILAKQDRPLD IVWSRPLPDS AAPSRVTVSR NARGQYHVSI LVEDTVTSLP PAGGQVGIDA GITALVTLST GEKVTNPRHE RRDRVRLARA QRDLSRKAKG SANRVKARVR VAEIHGRIAD RRRDHLRKLS TRIIRENQTV AVEDLSVRTM VRNHSLARAI SDASWSELRA MVEYKADWYD GTFWPRGSRC PPVEME
|
| |