Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2998 |
Symbol | |
ID | 5671381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3524959 |
End bp | 3526182 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241901 |
Product | transposase IS111A/IS1328/IS1533 |
Protein accession | YP_001507321 |
Protein GI | 158314813 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCACTG TGTTCATTAG GCCGCTGATG GTTCTCCCGC TGGCTGCTCA CGGTGATCGG CATGAAGGGT GGGAGAGGCG TTGCGGCTGT TCATCGGGGA CGGGTTGGGT GGAAGACCAC CACGATGTGG AGGTGATGGA TGCCTCGGAG CGCCGGCTGG CGAAGGCCCG GCTGCCCGAG GGAGTGACCG GCATGGCCCG GCTGCACGCG CTGGTCGGCG AGCACGCCGG CGACGCCGCC GATGAGGTCG AGGTGTTGGT CGGGATCGAG ACCGACCGCG GCCCGTGGGT GCAGGCGCTG CTCGCCGCTG GCTACACCGT GTTCGCGATC AACCCGCTGC AGGCGTCCCA CTACCGGGCG CGGCACGCCG TGTCGGGCGC CAAGAGCGAC GCCGCCGACG CACACGCGCT CGCGGACATG GTCCGCACCG ACTCCCACCA GCTGCGGCCG GTCGCCGGGG ACACCGCCGC CGTCGAGGCG GTCAAGGTCG TGACCCGCAC CCACAAGACC CTGATCTGGG AACGCACCCG CCACACCCAG CGGCTGCGAC ACGCGCTGCG GGACTACTTC CCCGCCGCGC TGGCCGCGTT CGACGACCTG GACGCCGCCG ACACCCTGGA ACTGCTGGCC AAGGCACCGA CCCCGGCCGA GGCGGCCAGG CTGACCGTCA GCCAGATCAG CGCCGCGCTG CGCCACGCCC ACCACCGCGA CGTCCCCACC AAGGCCGCGG CGATCCAGGC CGCGCTGCGC GCGGAGCACC TCGACCAGCC CGACGTCGTC ACCGCCGCCT ACGCCGCCTC GGTCCGCGCG CTGATCGCGG TCCTGGCCGT GCTGAACGAG CAGGTCAAGA CCCTGCAAGG GCAGGTGGCC GAGCATTTTG GCGACGACCC CGACCGGTAC ACCACGGCCA AGGCCCGCAA GAACTACGCC GGTACGTCCC CGATCACCCG CGCCTCCGGC AGGAAGAAGA CCGTCTCCGC CCGGTTCGTC CACAACGACC GGCTCGTCGA CGCCCTCATC ACCCAGGCCT TCTCCAGCCT GCAGGCGTCC CCCGGAGCCC GCGCCTACTA CGACCGGCAA CGCGAACGCG GAGCCAACCA CAACGCCGCC CTGCGCCAGC TCGCCAACCG CCTCGTCGGC ATCCTGCACG GCTGCCTCAA GACAGCCACG CCCTACGACG AAGCGACCGC CTGGCCGCGT CACGCCGAAC AGCTCGCTGC TTGA
|
Protein sequence | MSTVFIRPLM VLPLAAHGDR HEGWERRCGC SSGTGWVEDH HDVEVMDASE RRLAKARLPE GVTGMARLHA LVGEHAGDAA DEVEVLVGIE TDRGPWVQAL LAAGYTVFAI NPLQASHYRA RHAVSGAKSD AADAHALADM VRTDSHQLRP VAGDTAAVEA VKVVTRTHKT LIWERTRHTQ RLRHALRDYF PAALAAFDDL DAADTLELLA KAPTPAEAAR LTVSQISAAL RHAHHRDVPT KAAAIQAALR AEHLDQPDVV TAAYAASVRA LIAVLAVLNE QVKTLQGQVA EHFGDDPDRY TTAKARKNYA GTSPITRASG RKKTVSARFV HNDRLVDALI TQAFSSLQAS PGARAYYDRQ RERGANHNAA LRQLANRLVG ILHGCLKTAT PYDEATAWPR HAEQLAA
|
| |