Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4264 |
Symbol | |
ID | 5672619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5090279 |
End bp | 5092189 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641243137 |
Product | nucleic acid binding OB-fold tRNA/helicase-type |
Protein accession | YP_001508554 |
Protein GI | 158316046 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00811307 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.186762 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTGC CAACCCTGCT CGACTACCAG CAGGCCGTCC AGATCCCGCG GCTGGCCTTC CTGGACGACC AGCTGCGGGA GAGCGTGCCG CGGCTGACCC CGCTGGGCAT GCCGGCGGTC GCCACCGGCG GGTTCGCCCT GACCTTCGAC GTAAGCCTGG GCGGCCGGCG CTACGCCGTG CGCTGCTTCC ACCGGCACAG CGACAACCTG GAGCTGCGCT ACGCCTGCAT CGCCGAGTTC GTCCGCTCGG CCGCGCTGGA CTTCCTGGTC GGCGTGGACT ACCAGCCGGC CGGAATCCGG GTGCGCGGCC ACTGCTGGCC GATCGTGCGG ATGGAGTGGA TCGACGGCGC GCGGCTCGAC GACTGGGTCC AGGAGAACCT GGACCGGCCC GCGCACCTCG ACCGGGCCCG GCTCGGCCTG GGCTTCGCCG TCGCCGAGCT GCGCCGCCGC GGTGCCGCGC ACGGCGACCT GCAGCACGGC AACATCCTCG TGCTGCCGGA CAGGTCCGTA CGGCTCGTCG ACTACGACGG GATGTACCTG CCGGGGCTGA GCGGGCTCGG CGCCTCCGAG CGGGGCCACC GCAACTACCA GCATCCGGAC CGCTCGAACC AGTACGACGT CACGCTCGAC CGCTTCGCCG AAGAGGTGAT CACGGTGTCG CTCGCCGCCC TCGCCCGGAA TCCGGGCCTG TGGCGTGAGT TCAACACCGG CGAGAACCTG ATCTTCTCGG CCGCGGACTT CGCCGACCCG TCCGCCTCGG CCCTGTTCGC ACGCCTCGAG CGGATGCCCG GGGTCGCGGC CGCCGCGCGC CGGCTGCGGG ACGCCTGCCT GGTCGACTTC CAGGACGTGC CGGCGGTGCT CGACGCCGAC GCGCTCGCCC CGTACCGGTC GGGCGCGGTG GGCGGGGCCG GCTGCGCCGG GGTGACCGGC CCGGCACTCA CGGCCGGCCC GGCGCCCGCG GGCCGCGCGG CACCCGTGGA CCGTTCAGCG CTTCCGGCCG GCCCGGTGCT CCCGGCGGCT CCGCCGCGAC GGCAGCATCA GCCACATCAG CCACATCAGC CACATCCGGT GCGACCTGTG CAGCCGGCGC GGCCGGCTGG TCCGGTGCCG GTGGGCGGGC ACGAGGCGGT GACACGGCAC CCGCGTCCGG GCGGCGCCGT GGGTGTGGTC AGCACGGTGA GCGCGGCGCG TTACCTGCGC ACCCGGCCGC TGCGGCCGGC CCCGGTGCCC GTCGGTGGGC CCGGGGTGCT CTCCGCCCTC GACCGGGCCG GTCTGCTCGC CCGCCGGGGG GAGGAGATCA CCGTCGTCGG CCGGGTCGTC CGGGTCCGGC AGCTGGAACG CACCGGCATG ATCACGGTCC TGGAGTTCGG GGACGACCGT GGCGGCGGCT TCAGTGTCGT CGGGTGGGAC CGGGTCAGCC GGGAGCTCAT CGCGACCCAC GGCGACCCTG CGCGGCTCGC GGGCACCTGG GTGCGGGTGA TCGGGCAGTT GGCCGTGGAC GACCGGGGGA GCTCGTTTCA CGGGCCCACC GCGACGTCGC CGGCGCGGCT CGTTCGGCCG ACCGGGCCGA GCTGGTCCAG CCAGTCCAGC GGCTCGGGCG GCGACGGCCG CGACGGCCGG GGGAGCCGCG ACGGCAGCGG GAGCCGGGAC AACCGGCGGG GCCGATCCGA CGCTGCCGCC GGGCGGCCGC CCGCGCCCCG GATCGAGCTG CGCCGGGGGA GCCTGCTGCG GGTGCTGACC GAGCGCCAGG CCGAGAGCCT GCTGAACCGG CCCGGGACGG TGCCCGCCGC GCCCGCGACC CGCTCGGCCT ACGGATCGGG GCCGGTCCTG CCGCCCCGCC GGCCGCTCCC TCCGCCGCCG GTGCCGGGGC CGACGCTGAC CCGTTCGCCG TCGTGGCGCG GGTGGAGCTG A
|
Protein sequence | MKLPTLLDYQ QAVQIPRLAF LDDQLRESVP RLTPLGMPAV ATGGFALTFD VSLGGRRYAV RCFHRHSDNL ELRYACIAEF VRSAALDFLV GVDYQPAGIR VRGHCWPIVR MEWIDGARLD DWVQENLDRP AHLDRARLGL GFAVAELRRR GAAHGDLQHG NILVLPDRSV RLVDYDGMYL PGLSGLGASE RGHRNYQHPD RSNQYDVTLD RFAEEVITVS LAALARNPGL WREFNTGENL IFSAADFADP SASALFARLE RMPGVAAAAR RLRDACLVDF QDVPAVLDAD ALAPYRSGAV GGAGCAGVTG PALTAGPAPA GRAAPVDRSA LPAGPVLPAA PPRRQHQPHQ PHQPHPVRPV QPARPAGPVP VGGHEAVTRH PRPGGAVGVV STVSAARYLR TRPLRPAPVP VGGPGVLSAL DRAGLLARRG EEITVVGRVV RVRQLERTGM ITVLEFGDDR GGGFSVVGWD RVSRELIATH GDPARLAGTW VRVIGQLAVD DRGSSFHGPT ATSPARLVRP TGPSWSSQSS GSGGDGRDGR GSRDGSGSRD NRRGRSDAAA GRPPAPRIEL RRGSLLRVLT ERQAESLLNR PGTVPAAPAT RSAYGSGPVL PPRRPLPPPP VPGPTLTRSP SWRGWS
|
| |