Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0892 |
Symbol | |
ID | 5669306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1038925 |
End bp | 1041597 |
Gene Length | 2673 bp |
Protein Length | 890 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641239819 |
Product | hypothetical protein |
Protein accession | YP_001505254 |
Protein GI | 158312746 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.608011 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCCG ACTCGGTACT CGACCTGCTC GCCGCGCAGA TGGCGCAGGA CGCCCTGCCG GCGCGGGTCC GTGACCTGCT ACTGGCCGCC CTGGACGGCG ACGGCCCGCT CGCCGTGGAG CTCGCCCGAC GCGGCGGGAC GCCTGGTACC AACGAGCCGG GCGATGGCGC CACCTCAGAT GACGGCACCG CCCCAGGCGG CGGCACCGCC CGGCTGGGCG GGACGGGGTC GGGTGAGCCG GCCGTCCACC TGGAGTCGAT CGGCGTGCAG GGGTTCCGTG GGATCGGGCC GCTCGCGGTG CTCCCCCTCC GGCCCGGGCC GGGTCTGACT CTCGTCACGG GACGTAACGG CTCGGGCAAG TCCAGCTTCG CGGAGGCCGC CGAGATCGCG CTGACAGGCG ACACCCGCCG CTGGTCACGG CGGGCGGCCG TCTGGCGCGG CGGCTGGCGC AATCTGCACA GCACCGGGCC CGCGCGGGTC GAGATCACCT GCACGGTGGA GGGTCGCCCC GAGCCGGTCA CCGTCACCCG GACCTGGCCG GAGGGCGCCG GTCTGGACGA GGGCGCCGGC GAGATCCGCG CCTCGGACGG CGGGGCGTGG TCACCGTCCG GGTGGGACTG GGCTCTGTCC ACCTACCGGC CGTTCCTGTC CTACGCCGAA CTGGGCGATC TCCTCGGCGG ACGTCCCAGC GCGATGTTCG ATGCGCTGCA CGCGATCCTC GGGCTGGACG AGATGGTCGA CACGGCGCGA CGCCTGAGCG CGGCGCGCAG CCGGTTCACC GAGGCCGCGC GCCGCTCGCG GGAGGAGCAC ACCGCCCTGC TCGCGGCGCT GCGGCGCAGC ACGGACCCGC GGGCAGTCGA GGCCGCCGAC GCGTTGCGGG ACGCCGCACG TCCCGATCTC GACCGAGCCT GGGCCGTCGC GGCGGGCCTG CCGGCGCCGA GCACCGCGCC GCCCGCGAAG CCGGCGGTGC TGCGGGAGCC GGCGGTGCTG CGGGAACTGC TGGGGCTGCG GGAACTGCTG GGGCTGCGGG TGCCGGCGGA GTCCGACGTC CGGGAGCTCG CCGTGCGACT GCGCGCGCTC GCGACCGAGG AGACGAGGCT GGCGGCGACG GCGGCCGGCG ACGCGGCCCG CCTCACCGAC CTGCTGGCCG CCGCGCTCGA CCACCACGAG CATCAGAACG CCGTCCACGC GGGTGGTCCG TCATCTGAGG GTGCCGAGGA CGGCGATCGG GGCCGGGGTG GGAGCGGGAG CCGGCCGTGC CCGGTCTGCG GGGTCGGGCG GCTCGACGAG CGGTGGTGGC ACGCGACACG GGCGGAGGTC ACCCGCCAGC GGGAGCGGGC CGCGGCCGTG CTGGTCGTGC GCGCCGGCCT CACCGCCGCG CTGGCCGACG TCGCGCTGCT GCTCGCACCC GCCCCCGAGC CGTTGACCAC CCCTGATCCC CTGGCTCCCT CCGGGCCCCT GGCTCCCTCC GGGCCGCTGA CCGCCGGTGG GAGCACGGGC GTGCTGGCCG CGCTGGCCGC GGCCGCGGGC ATGGCCTGGC GCCGGTGGGC ACGGCTGCGC GACGACTCCG ACCCGCTGGC CGTCGCGGAC GGGCTGGAGT CCGCCCACCC TCCGCTGCTG CGGGCGGTCG AGGCCCTGCG GGCGGCCGCC CGCGACGAGC TCGACCGCGC CGACGCCGAC TGGCGTCCGC TCGCTGCCCG GGTCACCGCC TGGGTCGGGG CGGCGCGGGC GGTCGATGCC GACGCCGAGG CGCTGGCCGA CGTCCGACAG GCTCTTGACT GGCTGCGGGA GGCGACCCAA CGGCTGCGGG ACGAGCGGAT GCGGCCCTTC GCGCAGCGGT CGGCCCAGAT CTGGTCGATG CTGCGCCAGG AGAGCAACGT CGATCTGGGC CCGGTGCGCC TCACCGGCAG TGCCAACCAG CGGCGCGTCG ACCTCGACGT CACCGTCGAC GGGGTGGACG GCGCCGCGCT CGGCGTGATG AGCCAGGGCG AGCTGCACGC GCTGGGGCTG GCGCTGTTCC TGCCCCGCGC GACCAGCGAC GCGAGCCCGT TCCGCTTCCT CATCATCGAT GACCCGGTGC AGTCGATGGA CCCGGCGAAG GTCGACGGGC TGGCGCGCGT ACTCGCCCAG GTTGCCGAGA CTCGGCAGGT CGTCGTCCTC ACCCATGATG ATCGGCTGGC CGACGCGGTG CGCCGGCTGC GGCTGCCGGC GACGGTGCTC GACGTGGTCC GTCGCGAGGG GTCGCTGGTG CGGCTGCGCG GGAACCTCGA TCCGGTCGGG CGCCATCTCG CGGACGCGCG GGCGCTCGCC CGGACCGGTG ACCTGCCCCG CGACCTGGCG ATGATGCTCG TCCCGGCGAT GTGCCGCTCG GCGGTGGAGA CTGCGTGCAA CGAGGTGGTC CGGCGGCGCC GGCTCGGTGC GGGAGCGCGC CACGCCGAGG TCGAGGCGGC GCTCGCGGCG GCGCACTCGG TGAGCGAGAA GGCCGCCCTG GCCCTGTTCG ACGACGCGCG CCGGGCCCGC GGCGTCCTGC GCCGCCTCGA CGGCCACGCG CCGTGGGCCG CCGACACGTT CCGGGCCGTC CGGGACGGCG TGCACGTCGG GTACGACGGC AACCTGCTGA GCCTGGTCCG CGACACCGGC CGGCTCACGG ACCACATCCG GACCCTGGCT TGA
|
Protein sequence | MAPDSVLDLL AAQMAQDALP ARVRDLLLAA LDGDGPLAVE LARRGGTPGT NEPGDGATSD DGTAPGGGTA RLGGTGSGEP AVHLESIGVQ GFRGIGPLAV LPLRPGPGLT LVTGRNGSGK SSFAEAAEIA LTGDTRRWSR RAAVWRGGWR NLHSTGPARV EITCTVEGRP EPVTVTRTWP EGAGLDEGAG EIRASDGGAW SPSGWDWALS TYRPFLSYAE LGDLLGGRPS AMFDALHAIL GLDEMVDTAR RLSAARSRFT EAARRSREEH TALLAALRRS TDPRAVEAAD ALRDAARPDL DRAWAVAAGL PAPSTAPPAK PAVLREPAVL RELLGLRELL GLRVPAESDV RELAVRLRAL ATEETRLAAT AAGDAARLTD LLAAALDHHE HQNAVHAGGP SSEGAEDGDR GRGGSGSRPC PVCGVGRLDE RWWHATRAEV TRQRERAAAV LVVRAGLTAA LADVALLLAP APEPLTTPDP LAPSGPLAPS GPLTAGGSTG VLAALAAAAG MAWRRWARLR DDSDPLAVAD GLESAHPPLL RAVEALRAAA RDELDRADAD WRPLAARVTA WVGAARAVDA DAEALADVRQ ALDWLREATQ RLRDERMRPF AQRSAQIWSM LRQESNVDLG PVRLTGSANQ RRVDLDVTVD GVDGAALGVM SQGELHALGL ALFLPRATSD ASPFRFLIID DPVQSMDPAK VDGLARVLAQ VAETRQVVVL THDDRLADAV RRLRLPATVL DVVRREGSLV RLRGNLDPVG RHLADARALA RTGDLPRDLA MMLVPAMCRS AVETACNEVV RRRRLGAGAR HAEVEAALAA AHSVSEKAAL ALFDDARRAR GVLRRLDGHA PWAADTFRAV RDGVHVGYDG NLLSLVRDTG RLTDHIRTLA
|
| |