Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3104 |
Symbol | |
ID | 3904230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3675959 |
End bp | 3677713 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637880425 |
Product | hypothetical protein |
Protein accession | YP_482190 |
Protein GI | 86741790 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex [COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.197929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.651768 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCCCAG ATCCCGCCCC GGACCGTCCC CGGCCGCCCC GACAGGGCAG GCTGGATGAT CTCGGTCGTC CGTTGGCGGA CGTGACCTTC GTCGTGTTCG ACCTGGAGAC GACGGGCACC TCCCCGGGAC GCGACGAGAT CACCGAGATC GGCGCGGTCC GGGTCCGCGG CGGCCGGATC CTCGCCGAGA TGGCCACCCT GGTCCGGCCG GGCGTCGGCA TCCCTCCGAT GGTCTCGGTG CTCACCGGCA TCACCGACGT GATGGTGGCG ACGGCGCCGC CGGTCACGCA GGTGCTGCCT ACCTTCCTGG AGTTCGCCCG CGGCGCCGTT CTCGTCGCCC ACAATGCCCC GTTCGACCTC GGCTTCCTGC GCGCCGCCGT CGAGCTCTGC GGCTATCCGG TACCCGTCTG GGAGTATCTG GACACGTTGC GTATCGCCCG GCGGGTGGTC ACGAGAGACG AGAGCCCCGA CTGCCGGCTC ACGTCGCTGG CCTCGCTGTT CCGCAGCCCG GTCGAGCCCC GCCACCGGGC GCTGGCGGAC GCCCGGGCCA CCGTCGACGT GCTGCACGGG CTGTTCGAAC GGCTCGGCAA CGCGGGCGTG ACCACCCTGG AGGACCTGCA CGACTACAGC TCCCGGGTGT CGCCGGCCCA GCGACGCAAA CGGCATCTGG CCGACGGCCT GCCGACGGGC CCGGGTGTCT ACATCTTCCG GGACGCCGAC GAACGAGCCC TGTATGTCGG CACCTCGCGT TCGGTGCGCT CCCGGGTCCG TACCTACTTC ACCGCCAGCG AGCCCCGGAC GCGGATGGCG GCGATGGTGG CGCTGGCCGA GCGGGTTGAC GCGATCGGAT GCGCGCACGC CCTGGAGGCC GAGGTCCGGG AACTGCGGCT GATCGCCGAG TACAAGCCGC CGTACAACCG CCGATCCCGC TTCCCGGAGC GCTCCGTGTA TCTCAAGCTC ACCGACGAGC CATTCCCACG GCTTTCACGG GTGCGCGCCG CCCGTGACGA CGCGACCTAT CTCGGGCCGT TCGGCAGCGT CCGCGCCGCC GACGCCGCCG CCGAGGCGCT GCTGGCCGCG GTGCCGCTGC GCCAGTGCTC CGGGCGCCTG TCCCCGCGCG TGCGCCGGTC CGCCTGTACC CTCGCCGACC TCGGCAGATG CGGAGCGCCG TGCGACGGCC GGGAGGACGT GGCGAGCTAC GGCCGGCACG TCGCCGCCGC CCGGGCCGCC ATCACCGGCG ATCCGGGCCG GGTCATCGCC GCCTCGACGC GGCGGATCGA CCGGCTGGCC GCCGAGCGGC GCTACGAGGA GGCCGCCGTC CAGCGGGACC GGATGATCGC GTTCGTCCGC GCAGCCGCAC GTGCCCAGCG GCTGTCGGCC CTCACCGGGG TCGCCGAGCT CGTCGCCGCG GCCCCGACCG CCGAGGCCGG CTGGGATCTG GCCGTAGTGC GTCACGGTCG GCTGGTGTCG GCGGCGAGCG TGCCGCCCGG TGTCGATCCG CGGCCCTGGG TCGACGCCGC GGTCGCCAGC GCCGAGACGG TGCGGCCGCG GCCCGGTCCG GCCCCGTGCG CCTCCGTCGA GGAGACCGAA CGCATCGCCC GTTGGCTCGG TGGGCCTGGG GTGCGGCTCG TGCGGCTGGA GGGCGAGTGG AGCTGGCCGG CCGCGGGCGC CATCCGCGCC GCAGCCGGAT TCGGTGCGGC CCCCGGCCGG TCGGTACGTG CGTATGACGG TGACGGATGG TTCCCGTCGG CCTGA
|
Protein sequence | MRPDPAPDRP RPPRQGRLDD LGRPLADVTF VVFDLETTGT SPGRDEITEI GAVRVRGGRI LAEMATLVRP GVGIPPMVSV LTGITDVMVA TAPPVTQVLP TFLEFARGAV LVAHNAPFDL GFLRAAVELC GYPVPVWEYL DTLRIARRVV TRDESPDCRL TSLASLFRSP VEPRHRALAD ARATVDVLHG LFERLGNAGV TTLEDLHDYS SRVSPAQRRK RHLADGLPTG PGVYIFRDAD ERALYVGTSR SVRSRVRTYF TASEPRTRMA AMVALAERVD AIGCAHALEA EVRELRLIAE YKPPYNRRSR FPERSVYLKL TDEPFPRLSR VRAARDDATY LGPFGSVRAA DAAAEALLAA VPLRQCSGRL SPRVRRSACT LADLGRCGAP CDGREDVASY GRHVAAARAA ITGDPGRVIA ASTRRIDRLA AERRYEEAAV QRDRMIAFVR AAARAQRLSA LTGVAELVAA APTAEAGWDL AVVRHGRLVS AASVPPGVDP RPWVDAAVAS AETVRPRPGP APCASVEETE RIARWLGGPG VRLVRLEGEW SWPAAGAIRA AAGFGAAPGR SVRAYDGDGW FPSA
|
| |