Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0474 |
Symbol | |
ID | 5668894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 559194 |
End bp | 560261 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641239404 |
Product | integrase catalytic region |
Protein accession | YP_001504842 |
Protein GI | 158312334 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATCC GCCTGCTCTA TCTGATCTTC GTGAGGGTGT GCGGCTGGCT GGTCCTCCTC GGCCGTTCGT CAGCGTCCAA GGACATCGAG TTGCTCGTGT TGCGGCACGA GGTCGCGGTG CTGCGTCGTA CCCAGCCCAA GCCCCGGTGG GACTGGGCGG ACCGGGCGGT CCTCGCCGCA CTCATCCGAC TCCTGCCCAA GACGCTGCGA GCCCACCGGC TGGTCACCCC GGGGACCGTC CTACGGTGGC ACCGCCGTCT GATCACACGG AAATGGACCT ACCCGCAGCG GACGGGACGA CCTCCGGTCA GCCCGGAGAT CGCCGCACTG ATCGAGCGGC TCGCGACCGA GAACACGACG TGGGGGTACC AGCGGATCCA GGGCGAGCTC CTCAAGCTCG GCCACCGGGT CAGCGCGTCC ACGATCCGCC GCGTCCTGAA GTCCCTGGGT CTCCCACCGG CGCCCAAGCG GCAGACCGAC ACGACATGGC GACAGTTCCT ACGCACCCAG GCATCGACCA TGCTGGCCGT CGACCTCTTC CACGTCGACT GCGCCGTGAC GCTCCAGCGT CTGTACTGCT TCTTCGTCCT GGAAGTCGGC ACCCGCACCG TGCACATCCT CGGGGTCACC GCCCACCCCG ACGGCCCGTG GACCACCCAA CAAGCCCGGA ACCTCCTCAT AGACCTCGGC GACCGGGCCG CCGACTTCCA GGTCCTGATC CGCGACCGCG CTGGGCAGCT CACCGCCTCC TTCGACGCGG CCCTCGCCGA TGCCGGCATC ACCGCGGTCA AGATCCCACC CCGGACTCCT CGGGCGAACG CCTACGCGGA ACGGTTCGTC CACACGGTCC GGACCGAGGT CACCGACCGC ATGCTGATCG TCGGTGAGCG GCACCTGCGT ACCGTTCTGG CCGAGTACGC ACGGCACTAC AACGGACGAC GACCCCACCG CGGTTGTGGC CTTCAACCGC CTCGGCCCGA CCACCCTGTC GCCGACCTGG CCAGGGAACG GATCAAGCGC CAGCCCGTCC TCGGCGGCCT GATCAACGAA TACGAACGAG CCGCCTAA
|
Protein sequence | MSIRLLYLIF VRVCGWLVLL GRSSASKDIE LLVLRHEVAV LRRTQPKPRW DWADRAVLAA LIRLLPKTLR AHRLVTPGTV LRWHRRLITR KWTYPQRTGR PPVSPEIAAL IERLATENTT WGYQRIQGEL LKLGHRVSAS TIRRVLKSLG LPPAPKRQTD TTWRQFLRTQ ASTMLAVDLF HVDCAVTLQR LYCFFVLEVG TRTVHILGVT AHPDGPWTTQ QARNLLIDLG DRAADFQVLI RDRAGQLTAS FDAALADAGI TAVKIPPRTP RANAYAERFV HTVRTEVTDR MLIVGERHLR TVLAEYARHY NGRRPHRGCG LQPPRPDHPV ADLARERIKR QPVLGGLINE YERAA
|
| |