Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1963 |
Symbol | |
ID | 5670364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2359471 |
End bp | 2360640 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240884 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001506306 |
Protein GI | 158313798 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1533] DNA repair photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.003672 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAACG TGTGTTCTAA TGGCGGGGTG CGGTGGGAGA ACCTGAGCCT CGTCGAGTCC GCGGACACCC CGGTGCGCGG GCTGCTCGAC CGCGCCGCGG TGACCCGGAC GTTCGACACC CCGGGCTTCG CCGGAATGAC CTTCTACGAG ATCCACGCGC GGTCCGCGCT CAATCGCGTC CCGGCGGTAT CGCGCGTCCC GTTCCGGTGG ACGCTGAACC CCTACCGGGG CTGCTCACAC GCCTGCCGAT ACTGCTTCGC CCGCAACACC CACTCCTACC TCGACCTCGA CACCGGTCTC GACTTCGATT CCAGGATCGT CGTCAAGGTC AACGTGGCCG AGCGGCTACG GGCCGAGCTG GCCGCGCCGA AGTGGCGCGG CGAGTCCGTG GCGATGGGTG CGAACGTCGA CCCTTATCAG CGGGTGGAAG GGCGCTACCA GCTCATGCGG GGCGTGCTCG GCGTCCTGCG CGACGCGGCG AACCCGTTCT CGATCCTCAC CAAGGGCACC CTGATCCTGC GTGACCTCGA CCTGCTGGCC GAGGCCGCCG CCGTCACCGA GGTCCGCGTG GCGGTCTCGG TCGGTTTCGT CGACGACGAC CTGTGGCGCA CGGTCGAGCC GGGAGCCCCC CGCCCGGAAC GCCGGCTGGA GGTCTGTGCG GCGCTCGGCG CCGCCGGCAT CGAGTGCGGG GTGCTGATGG CGCCCGTACT CCCCGGCCTG AGTGATTCGC CGGCGGCGCT GGAGCGCGCG GTGCGCCGCA TCGCCGAGGC GGGTGCGGCC AACGTGACCC CGATCGTGCT GCACCTGCGG CCCGGGGCGC GGGAGTGGTA CCTCGGCTGG CTCGGGGAGC ATCACTCCGA CCTCCTCCCT CTGTACCGAT CCCTCTACGG CGGGGGCTCC TACGCGCCGC GGGCCTACAG CGAGCGGATC TCCGCCTTGG TCCGGGACCT GGCCCGCCGG TACGGCATCG CCGGTGCCGC CGCGCCGGCC GCCTCACCGG CCTCCTCTCG GTGGGCGCCG GCGGAGGCGC GTGGGGTGAG TGGGGTGCGG GGTGGGCGTG CGGCGGTCAT CCACCGTGGT CCGACACGTG CCGTTACGGC GGTGTCCGCG GCGGTGGCCG GCCGGGCGCC GCTGGAGGAG CAGCTCCTCC TGCCCGGCTT CGGTTCCTGA
|
Protein sequence | MSNVCSNGGV RWENLSLVES ADTPVRGLLD RAAVTRTFDT PGFAGMTFYE IHARSALNRV PAVSRVPFRW TLNPYRGCSH ACRYCFARNT HSYLDLDTGL DFDSRIVVKV NVAERLRAEL AAPKWRGESV AMGANVDPYQ RVEGRYQLMR GVLGVLRDAA NPFSILTKGT LILRDLDLLA EAAAVTEVRV AVSVGFVDDD LWRTVEPGAP RPERRLEVCA ALGAAGIECG VLMAPVLPGL SDSPAALERA VRRIAEAGAA NVTPIVLHLR PGAREWYLGW LGEHHSDLLP LYRSLYGGGS YAPRAYSERI SALVRDLARR YGIAGAAAPA ASPASSRWAP AEARGVSGVR GGRAAVIHRG PTRAVTAVSA AVAGRAPLEE QLLLPGFGS
|
| |