Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4861 |
Symbol | |
ID | 5673201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5829681 |
End bp | 5831099 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243716 |
Product | HNH endonuclease |
Protein accession | YP_001509132 |
Protein GI | 158316624 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.254841 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGAAC AGGTACCTTC CCCGACAGCC GCGTCCCCCA CGCCTGACAT CGGCCTGATG CCGGGCCTGG ACGCGGAGAC CGCGCCGCTG GCGGATCTGG AAGCCGCGAT CTGCGGGTGG GCAGGGCGGC TCGCGGCTGC GACCTGCGGC TGGCTGGTCC TGCTCTCCGC CTTCGAGCGG CGGGGCGGCT GGTCGGGGAT CGGGCTGCGC TCATGTGCGC ACTGGCTGTC CTGGCGCTGC GGGATCGGGC TGCGTGCCGC CCGGGAGCAC CTGGCCACGG CCCGCGCCCT CGAACAACTC CCCGCGGTGC GGGCGGCGTT CGCCGACGGA GCGGTCTCCT ATTCGAAGGT CCGGGCGATC ACCCGGATCG CCGACCCGAC CACCGAACTG CTCTGGCTCG AACACGCCCT GCACTGCACC GCAAGCCAGC TGGAACGCCT CGTCCGCACC CTCCGCCAGA CCACCACCGA CCCCGCCGAC CGTGCCAGGA CGCAGGCGGC CCGGCGGGTC TCCTGGCGCA CGGACGACGA CGGCATGCTG CACCTGACCG CGGTCCTGCC TCCCGACGAA GGCGCCCAGC TCGTCGCAGC GCTCGACGCC GCCCGCGCCA GCCTCGACAC CACCACCACC GGCACCACCG GCACCGGCAC CGGCACCGAC GCCGACGCCG ACGCCGACGC CGGCCAGCCG CCTCCCGACG GGGAGGTCGT TGCCGCGCCG CGGGACCGCC GACGCGACGC CGACGCGCTC GTCGCTCTCG CCGAGGGGTT CCTGCAACGG CCAGCTCCCG GACTGACCTC GCCCGCCCAC ACGCTCACTG TGCACGTCGA CGCGGCGACC CTGCTGGACG CCGCACGGCC ACCGCGACCC GGGCCCGGGT CGCGCGCGGA GATCTCACCC GGGATCGGCC TGTCCTCCGC CGTCCTGCGC CGGCTCGGTT GCGACGGGCT GATCCGCGCC CTGGTCACCG ACACCCACGG CAACCCGCTG CGGCTGGGCC GGCGCCGCCG GCTGCCGAAC CGGCAGCTCC GGGACGCGGT CCACGCCCGG GACAGGGGCA CCTGCCAGTA CCCGGGCTGC GCACACACCC GGTGGCTGCA CATCCACCAT CTCGTTCCCT GGATCGAGGG CGGCGGCACC GACATCGACA ACCTCACCCT CGTCTGCGGC GCACACCACC GCACCCTGCA CGACGAGGAC ATCAAGCTCC GCAGAACCAC CACCGGGCGG ATCGTCGCCC TGCTTCCCGA CGGCCGCACG CTCGACCCGG CGCCGCCCGC CAATCCGGGG GCCCGACCCG CCGAGGTCCT CGCCGAGGCC ACTCGGCACG TGGCGCCAGA CGCGATCGTC ACCTGGAACG GCGGCCCGTT CCACCTCGAC GACTCGATCC GCGCACTCCT GCAGGATCAG GCCGCGTGA
|
Protein sequence | MIEQVPSPTA ASPTPDIGLM PGLDAETAPL ADLEAAICGW AGRLAAATCG WLVLLSAFER RGGWSGIGLR SCAHWLSWRC GIGLRAAREH LATARALEQL PAVRAAFADG AVSYSKVRAI TRIADPTTEL LWLEHALHCT ASQLERLVRT LRQTTTDPAD RARTQAARRV SWRTDDDGML HLTAVLPPDE GAQLVAALDA ARASLDTTTT GTTGTGTGTD ADADADAGQP PPDGEVVAAP RDRRRDADAL VALAEGFLQR PAPGLTSPAH TLTVHVDAAT LLDAARPPRP GPGSRAEISP GIGLSSAVLR RLGCDGLIRA LVTDTHGNPL RLGRRRRLPN RQLRDAVHAR DRGTCQYPGC AHTRWLHIHH LVPWIEGGGT DIDNLTLVCG AHHRTLHDED IKLRRTTTGR IVALLPDGRT LDPAPPANPG ARPAEVLAEA TRHVAPDAIV TWNGGPFHLD DSIRALLQDQ AA
|
| |