Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6742 |
Symbol | |
ID | 5675055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8200811 |
End bp | 8201719 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641245591 |
Product | SMF family protein |
Protein accession | YP_001510982 |
Protein GI | 158318474 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0382369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0640366 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCACCC AGCGCCTCCT CTGCCACCCG TTGGCCGCGA GACCGGCCCG GGAAACCCGA AAGGCCATCG TGATGACCAG CGACGACGAG CGGCACGCCC GCGCCGCCCT GACCGCCCTC CCCGCCCGCA TTTGGCCCTC GGCCACCGAC AGCCGGACTG ATCCGGTCGA GGTCTGGAAG GGCCTCGCTC CCGAGCACCC CGGGGTCGAC CCCGCCCGAC TGCTCGCGAC CGCCACCGAC GCCGGCTGGC GTTTCGTGAT CCCCAGCGAC CCCGGCTGGC CTGCCACCCT GACCCCCGGC ACCGGTCCGC TTCCGGTCGG CCTCTGGGCA CGCGGCACCG GAGACCTGAC CGGCCTCACC CGCCGGTCGG TCACCATCAC CGGCGCCCGG CTCTGCAGCG AGTACGGGCA GGCGGTCACC GCCGGCCTCG CCTATGACCT GACCACCCCG CTGGCCGCCC AGCCGGTCAC CATCACCGCC GCCGGCATCG GCGAGGGGAT CGATGCCGCC GCCCTGCGGA TCGCCGCCAC ACACCGGCGG GCCGTCGCGG TCCTGACCGC CACCACGGGC ATCACCCGCT ACAACCGCAC CCTCCTCACC GACGTCGCTG ACGGCGGGCT GGTGCTCTGC CTGGCCGCGC CCGGCCTCCC GCCGCGCTCG GGTCACCTCC TCGCCCGCGT GCGCCTACTC GCCACCCTCA CCCGGGCCAC CGTCCTCGTG GAGTCGACCA CCCGCGGGTA CGCGTTCGCG ACCGCGCAGG CCGCCCGGCT GCGACGCCGC CCCGTCATGG CCGTCCCCGG ACCGATCGGC TCCACGCTCA GCGCCGGCCC GAACGCCCTG CTCACTGAGG GCACCGCCCG CCTGGTCACC AGCGCCGAGG ACATCCGCGC CGTCCTCGAC CAGACCTGA
|
Protein sequence | MLTQRLLCHP LAARPARETR KAIVMTSDDE RHARAALTAL PARIWPSATD SRTDPVEVWK GLAPEHPGVD PARLLATATD AGWRFVIPSD PGWPATLTPG TGPLPVGLWA RGTGDLTGLT RRSVTITGAR LCSEYGQAVT AGLAYDLTTP LAAQPVTITA AGIGEGIDAA ALRIAATHRR AVAVLTATTG ITRYNRTLLT DVADGGLVLC LAAPGLPPRS GHLLARVRLL ATLTRATVLV ESTTRGYAFA TAQAARLRRR PVMAVPGPIG STLSAGPNAL LTEGTARLVT SAEDIRAVLD QT
|
| |