Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3768 |
Symbol | |
ID | 5672133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4468115 |
End bp | 4469239 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242649 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_001508069 |
Protein GI | 158315561 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.36553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAT GGCGGCGCAG CTCGCTCGCT GACCTGGTTC GGCTGCGGCG GGGATTCGAC CTGCCGGCGC CGGAACGCCG GGCTGGTTGT TTTCCGGTGG TCGGCTCCGC CGGGGTCAGC GGGTGGCACG ACCGGGGGCC GATCGCCGGG CCGGGGATCA CTCTCGGGCG CAGCGGTTCG TCAATCGGCA CGGTGACGTA TGTGCCATCT GACTACTGGC CGTTGAACAC CGTGCTGTTC GTCGAGGACT TCCAGGGCAA CGATCCGCGT TTTCTCTATT TCCTGCTGCG GACGATCGAC TTCGCCCGGT TCAACTCGGG GAGCGCGCAG CCCTCGCTCA ACCGCAACTA CATCGCCGCC GTCGAGCTGC GTGCGCCGGA GTATCCCGAA CAGCGGGCGA TCGCGGCGGT GCTGGGCGCC CTCGACGACA AGATCGCGCT GAACCATCGG CTTGCCTCGA CTGCCCGGGA GCTCGCCGAG GCACGGTACG CGGCGGCGAC GCGCGGGCCG GGTCGCCGGG AGCTCAGACT CGGTGACCTG GTCGAGACGC TGACCCGGGG GATCACGCCC CGGTACACGG CGGACGACTC CGCGCTCGTC GTGCTCAACC AGAAGTGTGT GCGCGCCGGC CGGGTCGACC TCGCACCGGC CCGCGGGACG GATCCGGCCA CGGTCCCCGC CGCGAAGCGG CTCAGAGCGG ACGACGTCCT GGTCAACTCG ACGGGTATCG GCACCCTGGG CCGAGTGGCC AGGTGGGTGC ACGCGACGCG GGCGACCGTC GACTCGCATG TCACCGTCGT CCGGCTCGCG CCGGACCGGC TGGACCCGGT GTGCGGGGCG TTCGCCCTGC TGGCCGCGCA GCCGCGGATC GCGTCGCTGG GTGAGGGCAG CACCAGCCAG ACCGAATTGA GCAGGGCCGC GTTGAACGAC CTGGTGATCG CGGTGCCGGC GGCCGAGCGG TGTGCCGAGA TCGGCGCCGA GCTGGCGGCG CTGGACGCGC GCGGCGAGGC GGCGCACGCG GAGTCGGCGG CGCTGGCCCG GCTGCGGGAC GCGCTGTCGC CGAAGTTGAT GTCGGGGGAG ATCCGTGTCC GGGACGCGGA GCGGACCGCG GGAAGCCTGG TTTGA
|
Protein sequence | MPEWRRSSLA DLVRLRRGFD LPAPERRAGC FPVVGSAGVS GWHDRGPIAG PGITLGRSGS SIGTVTYVPS DYWPLNTVLF VEDFQGNDPR FLYFLLRTID FARFNSGSAQ PSLNRNYIAA VELRAPEYPE QRAIAAVLGA LDDKIALNHR LASTARELAE ARYAAATRGP GRRELRLGDL VETLTRGITP RYTADDSALV VLNQKCVRAG RVDLAPARGT DPATVPAAKR LRADDVLVNS TGIGTLGRVA RWVHATRATV DSHVTVVRLA PDRLDPVCGA FALLAAQPRI ASLGEGSTSQ TELSRAALND LVIAVPAAER CAEIGAELAA LDARGEAAHA ESAALARLRD ALSPKLMSGE IRVRDAERTA GSLV
|
| |