Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2958 |
Symbol | |
ID | 5671344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3483192 |
End bp | 3484643 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241864 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001507284 |
Protein GI | 158314776 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACGGTT CGGTCGAGGT CGTCGTGGAC GACGCCGAGC AGATCCTGGA TCGGGTGGCG GCGATCGACG TCGCGAAGGC GTCCGGGAAG GTGTGCGTGC GGGTCCCGCA CGACAGCCGG GAAGGCAGGC GAGTCACCCG CGTCTTCGGT GTCACCGCGA CGGTCCCGGC GGTGGAGGAA CTCGCCGACC ACCTGGTCTG CCAGGGCGTC CAGCGGGTCG TCGTCGAAAG CACGTCTGAC TACTGGCGGG TGTTCTACTA CCTGCTCGAG GAGCGGGGCC TGACAGTGTG GCTGGTCAAC GCCCGGGACG TCAGGAACGT CCCCGGAAGA CCCAAGACAG ACAAGATCGA CGCTGTGTGG CTGGCGAAGT TGAACGAGCG GGGGATGCTG CGGCGGTCGT TCGTGCCGCC GGTGGCGATC CGCCGGATGC GGGATGTGAC CCGGATGCGG GTCGACCTCG TCGCGGACCG CACGCGGGTC AAACAGCGCG CAGAGAAACT ACTCGAAGGC GCGCTGATCA AACTGTCGTC GGTGGTCTCG GACGTTTTCG GGGTCGCCGG CCGGCGGATC CTCAACGCAC TGATAGCCGG GGAACGCGAC CCGCGCCGGC TCGCGGCACT CGGGACGGGT CTGAAGGCCT CGCCGGCAGC ACTGACCGCG GCGCTGACCG GCCGGTTCAG CGACCACGAC GCGTTCATGC TCACGATCTA CCTGGAGCAG ATCGACGCGC TCGACAAGCA CCTCGCCACG CTGTCCGCGC GGATCGACCA GATGACCGCG GCGATCCCCC TTCCCACCCG TCGCACCGAC ACCCCCGGCG TCGCTGAGAT CACCACGGTG CCCGGGGTCG GGACGGTCTC GCCGGTCACC GGTGAGATCA TCCCGCCCGC CGACGCCCCT CCCGCAGGTG GCACGCCCCC ACCCGCCGGT GGCCTGCCGG GGCCGCAGAC GGTCGCGGAT CTCGTCGACC TGCTCGACGC GATCCCCGGG ATCGGCAGGG ACGCCGCACA ATTGATCCTC GCGGAGATCG GCACGGACAT GGCCCCGTTC GCAACCTCCG GGCACCTGGC GTCGTGGGCG AAGCTGACCG CGCGCACGAT CCAGACCGGG GCGTCGCTAC GGATGGGCCG GACCGGGCGC GGGAACCGGT ACGTGCGCCG CACACTCGGC ACGGCCGCGG CCTCCGTCGC ACGCACCAAC ACCTTCCTCG GGGCACGTCA CCGGCGGTTA CGCGCCCGCC GCGGCGCGCT GAAGGCTCTC GTCGCTACCA GCCGCACCAT CCTAGAGATC ATCTGGCGGA TGGTGCACGA CCAGGTGCCG TTCCGGGAAC TCGGCGCGGA CTACCACACC CGCCACCAGG ACCCGGACAA GCGCAAGCGA ACGCTGGCCC GGCAGATGAA AAACCTCGGG CTCTCGCCCG AGGAAGCCGC CGCCATGCTC GCCGCAGCCT GA
|
Protein sequence | MDGSVEVVVD DAEQILDRVA AIDVAKASGK VCVRVPHDSR EGRRVTRVFG VTATVPAVEE LADHLVCQGV QRVVVESTSD YWRVFYYLLE ERGLTVWLVN ARDVRNVPGR PKTDKIDAVW LAKLNERGML RRSFVPPVAI RRMRDVTRMR VDLVADRTRV KQRAEKLLEG ALIKLSSVVS DVFGVAGRRI LNALIAGERD PRRLAALGTG LKASPAALTA ALTGRFSDHD AFMLTIYLEQ IDALDKHLAT LSARIDQMTA AIPLPTRRTD TPGVAEITTV PGVGTVSPVT GEIIPPADAP PAGGTPPPAG GLPGPQTVAD LVDLLDAIPG IGRDAAQLIL AEIGTDMAPF ATSGHLASWA KLTARTIQTG ASLRMGRTGR GNRYVRRTLG TAAASVARTN TFLGARHRRL RARRGALKAL VATSRTILEI IWRMVHDQVP FRELGADYHT RHQDPDKRKR TLARQMKNLG LSPEEAAAML AAA
|
| |