Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2839 |
Symbol | |
ID | 3904751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3343446 |
End bp | 3344696 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637880160 |
Product | McrBC 5-methylcytosine restriction system component-like |
Protein accession | YP_481926 |
Protein GI | 86741526 |
COG category | [V] Defense mechanisms |
COG ID | [COG4268] McrBC 5-methylcytosine restriction system component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.117814 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGCGC CGGTGGAGCT GACCGAGGGC GCCGGGTGGC AGCGGCGGAA GCTGAGCCCG GGCCAGGCCG ATGCGCTCGA TGCCAGCGAG GTGGCGCAGG TGCGGCAACG GCGTGCCGAC GGTACCTGCG AGGTCAAGGA CAACGCCCTG GTCGGCACCG TGCGCCTCGG GTCGGGCGAG GATACGTTCG AGGTTCGCAT CCGACCCAAG GTCACCATCC GCCGTCTGCT GTTTCTGCTT GGCTACGCGC AGGATCGCGG CAGGTGGTTC GAGGACGAGG TCCAGGCGGC GGAGGAACCG GATCTTCTGC CCGCGGTCGC CGCGGCGTTC GCCCGAACCG CCTCGCGGGC GCTCGCGCAC GGGGTGCCGC GAGGCTATCG GCAGGTGGAT GCGGCGCTTC CCGTCCTTCG CGGCCGGCTG CGCGAGTCCG CGCAGCTCCG GCAACGGTCC GGGGTGATGT TCCCCCTCGA GGTGCGCTAT GACGAGCGCA CCGTCGACAC CGCCGAGAAC CGGTTGCTGC TCGCCGCCAC CCGCTCGCTG CTCGCCCTGG CCGGGGTGGC ACCGGCCACC GCCCAGGAGC TGCGCCGTAT CGCCGCCGCC CTGGACGGTG TGGCCGAGCC GGCGCACGGC CCCGTCAAGC CGCCGGACTG GGTGCCAACC CGGGTGAACG CGCCTTACCA TGCGGCGCTC CGGCTCGCCG AGACGGTCTT GCGCTCGTCT TCCTTCGAAC GGGAAGACGG GGAGACGCTC CGGGTGGACG GCTTCGTGGT GAAGATGTGG GAGGTCTTCG AGGACTTCGT GACCCACGCC GTCGACGAGG TCCTCACCCA CCGCGGCGGT GAGGTCCGCC TGCAGGACCG CACCCACCAC CTCGACGAGG ACCGGACGCT GGAGATGTGC CCCGATCTCG TGCTGTACCG GCCGGAGGGC CCGGGCGGGC GGATGATCCC GGCGGTTGTC CTCGACGCGA AGTACCGGCT CGCGATCCGA CAGGGCGCGC GCGCACACGT GTACCACCAG ATGATCGCCT ACTGTGCCCG GCTCGGCGCC CGGCAGGGAT GGCTCGTCTA CGCCGGCTCG GAGCGGGCCG ACGGCCAGCC CGGCGGTCGT GGCGACGTCA TCCGGAGCCG GATCGGGGGT CCCACGCCCA TCGGGCTCGT GACGTACGTG CTCGACCTGA GGCTCCCCCT GGCCGAGTTG CGGGCCAGGA TCGAGCGGAT CGCCGACGAT ATGGTCACCC CGTCCGTCTG A
|
Protein sequence | MLAPVELTEG AGWQRRKLSP GQADALDASE VAQVRQRRAD GTCEVKDNAL VGTVRLGSGE DTFEVRIRPK VTIRRLLFLL GYAQDRGRWF EDEVQAAEEP DLLPAVAAAF ARTASRALAH GVPRGYRQVD AALPVLRGRL RESAQLRQRS GVMFPLEVRY DERTVDTAEN RLLLAATRSL LALAGVAPAT AQELRRIAAA LDGVAEPAHG PVKPPDWVPT RVNAPYHAAL RLAETVLRSS SFEREDGETL RVDGFVVKMW EVFEDFVTHA VDEVLTHRGG EVRLQDRTHH LDEDRTLEMC PDLVLYRPEG PGGRMIPAVV LDAKYRLAIR QGARAHVYHQ MIAYCARLGA RQGWLVYAGS ERADGQPGGR GDVIRSRIGG PTPIGLVTYV LDLRLPLAEL RARIERIADD MVTPSV
|
| |