Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6741 |
Symbol | |
ID | 5675054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8199843 |
End bp | 8200694 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641245590 |
Product | putative DNA processing smf-family protein |
Protein accession | YP_001510981 |
Protein GI | 158318473 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000806567 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0643644 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGACC CTGTGAATGC CGACCGGCGT GCCCGCGCCG CGCTGACCTG CCTACCGCCC GACCGTCACC GCACCCAACG CATCCACCGC CACGGCCCCA TCACCGTCTG GGAGAGGATC GCCGGCCGCT ACCCCGACAT CGACCCGGGC CGTCTCCTGG AGCAGGCCAC CGAGAGCGGC TGGCGGCTGG TCATCCCCGG CGACCCCGAC TGGCCCGCCA CCCTCGACGT TCCCGGCCGG CCGCTGGGCC TGTGGGTGCA CGGCGCCGGG GACCTCCCCG GGCTGCTGCG TCGGGCGGTG ACCCTCACCG GCGCCCTCGC GGGAAGCGCC TACGGGCGCA CGGTCGCCGC CGACCTCGCC ACCAAGCTGA CCACGCAGCC CGGTGGGGAG TCGGTCACGG TCGTCGCCCA CGGCGGCGGG AACGGCACCG ACGTCGCCGC CCTGACCGCC GCCGCCCGCC GGCCGGCGGC GGTCGCGGTC CTCGACACCC CCACCGATCT GTTCGGCCAG CCCGACCTGC TTCGTACGGT CGCCGCCGGG GGGCTACTCG TCAGCGCCGC CGCGCCCGAC GCAAACCCCA CTCCCGGCCA TGTGCTGGCC CGCATCGAGC TGCTCGCCGC TCTCGCTCAC GCCACCGTCC TCATCGAGGC CCGCGCCAGC GACGACGACG CGCTTTCCGT CGCCTACACC GCCCACTTCC GGCGCCGCCG GCCGGTCCTG GCTGTCCCCG GCCCGATCAC CGCCCTGGCC AGCGCCGGCC CGCACGCCCT CCTCCGCGAC GGCACCGCCC GCTGCGTCAC CACCGCCGGC CACATCACCG CGCACCTCCC CGACCCGCCC GCCCCCCGCT AA
|
Protein sequence | MTDPVNADRR ARAALTCLPP DRHRTQRIHR HGPITVWERI AGRYPDIDPG RLLEQATESG WRLVIPGDPD WPATLDVPGR PLGLWVHGAG DLPGLLRRAV TLTGALAGSA YGRTVAADLA TKLTTQPGGE SVTVVAHGGG NGTDVAALTA AARRPAAVAV LDTPTDLFGQ PDLLRTVAAG GLLVSAAAPD ANPTPGHVLA RIELLAALAH ATVLIEARAS DDDALSVAYT AHFRRRRPVL AVPGPITALA SAGPHALLRD GTARCVTTAG HITAHLPDPP APR
|
| |