Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3412 |
Symbol | |
ID | 5671783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4041045 |
End bp | 4042106 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242300 |
Product | transposase IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_001507720 |
Protein GI | 158315212 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.406116 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCCAGG TCACAGGCTC AAGCCGACTT CTGGCACCCC ACAGGCTGGC CGAGCATCCC GGCGCGCAGG TGATCTGCCG GGACCGGGCC GGTGCCTACG CCGAGGGCGC CCGCACCGGT GCCCCGGACG CGGTGCAGGT GGCGGACCGC TTCCATCTGT GGAGCAATCT CGCCGGATAC GTCGAGACGA CGGTCGCCCG GCATCGTTCC TGCCTGGCGC AACCACCGGC CACTGACGAG GCTGCGGACG AGCCGCGTGC CGACCTTGAT GGCGCGGTGG CCGCAGCGCG GGCAGCGTCC TTCGAACAGC GGGCGTTCGT GCGACACGCC CGCGAGCGGT ACGCCGCCGT CCAGGAACTC AAAGCCGCAG GCGTGGGCAT CAAACCGATC GCCGCCCGAC TCGGCCTGGC CCGAGGAACG GTCCGCAAGT ACTACCGTGC CACCAGCGTC GACGACGTCC TGGCCAAGGC CCGCGACGGC CGCGGCTCGA TCCTGCGGCC GTGGGAGCCC TACCTCACCG AGCGGGTCAA CGCCGGGATC ACCAACGGCA GCCAGCTGTT CGGGGAGATC CGCGACCAGG GATACACCGG GAGCAAAGCC GTGGTCCTGA CCTACCTGCG CCCCCTCCGC GCCGGCGGCA GTACAGCCGC TCCCGCGACG CGGACGGCGC CGAAGGTCCG CACCGTCACC CGCTGGATCC TCACGCACCC CGACCACCTG GATGAACAGG ACACCCTCGC ATTGCAGCAG GTCCTCACCC GCTGCCAGGA CCTCCGGAAG ACCGCCGACC ATGTCACCGC GTTCGCGCAG ATGCTCACCG GCCGCCACGG GGAGCGGCTC AACGGGTGGA TCGCCGCCGT CGACGCCGAT GACCTGTCCG ATCTTCACCG CTTCACCCGC GGCCTCCTAC GCGACCACGA CGCCGTTCTC AACGGACTGA CCCTGCCGCA CAGCTCCGGA CAGGTCGAAG GCACCGTGAA CCGCATCAAA ATGATCAAGC GGCAGATGTA TGGCCGGGCG AACTTCGACC TGCTCCGCAA ACGAGTTCTC CTCGCGACCT GA
|
Protein sequence | MPQVTGSSRL LAPHRLAEHP GAQVICRDRA GAYAEGARTG APDAVQVADR FHLWSNLAGY VETTVARHRS CLAQPPATDE AADEPRADLD GAVAAARAAS FEQRAFVRHA RERYAAVQEL KAAGVGIKPI AARLGLARGT VRKYYRATSV DDVLAKARDG RGSILRPWEP YLTERVNAGI TNGSQLFGEI RDQGYTGSKA VVLTYLRPLR AGGSTAAPAT RTAPKVRTVT RWILTHPDHL DEQDTLALQQ VLTRCQDLRK TADHVTAFAQ MLTGRHGERL NGWIAAVDAD DLSDLHRFTR GLLRDHDAVL NGLTLPHSSG QVEGTVNRIK MIKRQMYGRA NFDLLRKRVL LAT
|
| |