Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5114 |
Symbol | |
ID | 5673449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6124272 |
End bp | 6125657 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243965 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001509379 |
Protein GI | 158316871 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.279407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.776114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGTTC TCCAGGCGTA CCGGTTCGCA CTCGACCCGA ACCAGGCGCA GCTTGCCGAC CTGCGGCGTC ATGCCGGGGC GGCCCGGTTC GTGTTCAACT GGGGTCTGGC CCGGGTGAAA GCCGCCCTGT CGCAGCGAGA CGCCGAGCAG TCCTACGGCC TGACCGGGGA CATGCTCACC CCGGTGCCGT GGACGCTGCC CGCGCTGCGC CTCGCCTGGA ACGAGGCCAA GAGCACCGTC GCCCCGTGGT GGTCGGCGTG CTCGAAGGAG GCGTACTCGT CCGGCCTCGA CCAGCTCGCC CGCGGGTTGA AGAACTTCAC TGATTCCCGC ACGGGGAAGC GGAAGGGGAA GCGGGTCGGT TTCCCCCGGT TCAAGAAGCG TGGCAGGGCC CGCGACTCGT TCCGGTACAC GACCGGCGCC TACGGCCCGG CCAGCAACCT ACAGGTGAAG CTGCCCCGCC TGGGCCGGGT CAAGGTCCAC GAGGCGATGG GTGCGCTCAC CGGCCGGTTG GCGGCCGGGT CCGCCCGGCT GCTCGGTGCG ACCGTGTCGC GCACGGCGGG CCGCTGGTTC GTGGCGTTCA CCGTCGAGAT CGATCGTGAG ATCCCCCAGA ACCCGTCGGC CCGCCAACGT GCGGGCGGCG CGGTCGGTGT CGACGCGGGA GTGAAGCACC TCGCTGTCCT GTCGACCGGT GAGCAGGTCG ACAACCCCAG GCCACTCACC CGCTCGCTGC GCAGGCTGCG CACTGCGTCC CGGGCCTGTG CCCGCTCGAA GCCGGGCAGT GCCGGCCGCC GCCAGCGCGC TGCCGCACTG GGCCACCTGC ACGCCCACAT TGCCCACCAG CGGCGCGATG GGCTGCACAA GCTCACCACC CGGCTCACGA AGAACCACGA CGTGATCGTG GTCGAGGATC TGCATGTCGC CGGGATGGTC CGTAACCGCC GGCTCGCCCG CGCGGTCTCG GACACCGGGA TGGCCGAGAT CCGGCGCCAA CTCACCTACA AGACCGTCTG GTACGGATCG CGGCTCGTCG TCGCGGACCG GTGGTATCCG TCGAGTAAGA CCTGTTCCGG CTGTGGCTGG CGAAACCCAA GCCTCACCCC GGCCGACCGC ACGTTCGCCT GCCAGTCCTG CGGGCTGGTG ATCGACCGCG ACCTGAACGC CGCGATCAAC CTGCGCAACC TCGTCGCCGC CAGTACGTCG GAGACGGAAA ACGCCCGTGG AGCCGACCGT AGGACCCAGC CTGCTGGGCG GGTGGCTGGG AAGCGGGAAC CCGGCACCGT GGGCATGACC CCGTGCGAGC GGGTCAGACC GGGGGTGCCT CACCGAGAGG CGAGGCGGCA TGACCGGGCG CTACCAGGCG CTCACATGCA ACGGCGGTGG CACTGA
|
Protein sequence | MKVLQAYRFA LDPNQAQLAD LRRHAGAARF VFNWGLARVK AALSQRDAEQ SYGLTGDMLT PVPWTLPALR LAWNEAKSTV APWWSACSKE AYSSGLDQLA RGLKNFTDSR TGKRKGKRVG FPRFKKRGRA RDSFRYTTGA YGPASNLQVK LPRLGRVKVH EAMGALTGRL AAGSARLLGA TVSRTAGRWF VAFTVEIDRE IPQNPSARQR AGGAVGVDAG VKHLAVLSTG EQVDNPRPLT RSLRRLRTAS RACARSKPGS AGRRQRAAAL GHLHAHIAHQ RRDGLHKLTT RLTKNHDVIV VEDLHVAGMV RNRRLARAVS DTGMAEIRRQ LTYKTVWYGS RLVVADRWYP SSKTCSGCGW RNPSLTPADR TFACQSCGLV IDRDLNAAIN LRNLVAASTS ETENARGADR RTQPAGRVAG KREPGTVGMT PCERVRPGVP HREARRHDRA LPGAHMQRRW H
|
| |