Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4913 |
Symbol | |
ID | 5673253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5899607 |
End bp | 5900776 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243768 |
Product | DNA polymerase III subunit epsilon |
Protein accession | YP_001509184 |
Protein GI | 158316676 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.157801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00553803 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCCGTCG ACGAGCGACT TAAAACAACG GACACACAAT ACCATCCGCG ATCACGGCCG GATTCGTTGT TTCCCTCGCG TTGCCCTCCA TCCGCACCAG GTTCGCTCCG TCCCCGCACT GCTCCGGGCA GCGCGGCGCC GGACCGTCGG GGGTCTGCGC CATCATCTGC GGCTGTGAGT GCTGAGGTTC TTGCCCCGCC CGGCTTCGCC GCACGTTTTC GTCGGCCGTA CGCGGTGGTG GACGTCGAGA CCACGGGCCT GTCCCCGACG AGGGACAGGG TGCTGTCGGT AGCGGTCGTC CTGACCGCCG CCGACGGCAC CGTCGAACAC CGCTGGTCAA CCCTCCTGGA TCCCGGATGT GATCCCGGCC CCGTCCATAT TCACGGTCTG ACCAGGCAGC GCCTGGCCGG GAGCCCGACG TTCGCCGCCG TCGCCGACGA GGTCGCCGGC CTGCTCGCCG GGCGGGTGCT CGTCGCGCAC AACGCGGCCT TCGACTGGCG GATGCTAGCC GGCGAGGCCA TGCGGATCGG GACGACGATC CCGGTCGAGT GGCGGCTGTG CACGCTGACG TTGGCCAGTC GGCTCGGCCT GGAGCTGCCG AGCCTGCGCC TGGCCTCGCT CGCCGCGTAC TGGGGCGTCG TCCAGCGGCG GGCCCATGAC GCGCAGGATG ACGCCGAGGT GCTGGCGGCC TTGTTGCCCC GCATCCTGCA ACGTGCCGCC GACCAGTACC TGGAGCTGCC GCTCACGCGC TGCGGCGTCG ACCACGACGG CGTCACGGTG CGCCCGCCGC GCTTCGCCGC CGCGGGCCGC ACGCCGCCGT GCCGTTATCG AAACCCCGGG CGGCTCGATC CCGGTGCGGC GCTGGTCCAG GGAATGCGCG TCGCTTTCAG CGGCCCCACG CGGACGGAAC GCGGCGAGCT CGTCGGCCGG GCGGTCGCCG CAGGCCTGCA TGTGACGGAA ACCGTGAGTC GGCGCACAAG TCTTCTGGTG ACCAATGACG CTCGCGGCGT GACCCGGAAG GTCCGCACCG CCGCGTTGTT GAACACACCC GTACGAGGAG AGGAAGATTT CCTGGAATTG CTCGCGGCCG TTCGCCCAGG AATTCTCGTC CCGGCTGCCG CACCCGTTCG CCGTGACCGC CGGCCGCGGC GCCCGACCCC GGACGCGTGA
|
Protein sequence | MAVDERLKTT DTQYHPRSRP DSLFPSRCPP SAPGSLRPRT APGSAAPDRR GSAPSSAAVS AEVLAPPGFA ARFRRPYAVV DVETTGLSPT RDRVLSVAVV LTAADGTVEH RWSTLLDPGC DPGPVHIHGL TRQRLAGSPT FAAVADEVAG LLAGRVLVAH NAAFDWRMLA GEAMRIGTTI PVEWRLCTLT LASRLGLELP SLRLASLAAY WGVVQRRAHD AQDDAEVLAA LLPRILQRAA DQYLELPLTR CGVDHDGVTV RPPRFAAAGR TPPCRYRNPG RLDPGAALVQ GMRVAFSGPT RTERGELVGR AVAAGLHVTE TVSRRTSLLV TNDARGVTRK VRTAALLNTP VRGEEDFLEL LAAVRPGILV PAAAPVRRDR RPRRPTPDA
|
| |