Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1656 |
Symbol | |
ID | 5670058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1978461 |
End bp | 1980530 |
Gene Length | 2070 bp |
Protein Length | 689 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240574 |
Product | hypothetical protein |
Protein accession | YP_001506000 |
Protein GI | 158313492 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.329518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.419184 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTGTTGCC TGATCGTCAC GCTGTCAGAG CGCCGATCAA GCCGAAAAGA GCAGCCGGTC AGTAAAGTTT CGTGGGTGTT TCGACCGGAC CTGGAAAGCC GTCAGATTTC TCTCCCCGAG CAGACCGTCG AGCTGCCGGC GGACCTCCTG GATCCGCTCG CGCACGTCGT CGTGGAGGTA TTTCCGCAGC TGGTGGCCCA TATTCCGCCC ACCCCCGCGG CTGAGCTCGG CCGGCTCGGC GCGACGGCGC CCGCTCCGGT CCCGCTCCCG CCCGCACCCG CCGGACCGGT GTTGACCGCG TTGCGTTCGG CCGGGCCGCG CCTACCGGAA CGACTGCTCG GCGCGCTCGA GCTCGCCATC GACGAGCTTC CGCTGCGCGC CCCGCGCGGC CTGGCCAGGG GCCTGGTCGA CGCCCCGCCG AACGGGCCGC GCGGCAGGAA CGGATCGCGC GCGTCCGCGG CCGCCGCAGG CGAGTCCGAG GCGGTCCAGG CGGTCGCGAT CCTGGATGAG GTCCATCCCG GGGTGTCCGA TCTGATCCAC AGCTACCTGC GCGCGCTGGT CGAGCATCCC GCCGTGGCCC CGCTGCTGGC GGCGGACGAC CCTGCCGCGG CCGGCCCGCT CGACGGCCAC GGCTCGGACG ACCGCCCGCT CGACGGCACC GCATTCGGCA GCACCGCGAA CGGCGGGTCC GCGTTCGGCG GGTCCGCGTT CGGCGGGAGC CTGTTCGGGC CGCCGGACGA GCCGGCCGAC GAGCCCACCC TGGCCGCCCG GCACGGGGCC GCCCACCTGG CGCTCGCGGT CACCGTCGCC GCCGCCGTCC TGCGGGAGCT GGACCCGCCG ATGATCGGCA CCGGCGCGCC GGGGATCGTC GGCACGGCCG TGGGGTCCGT CGCGCTGGTG CTGCCGGGCC GGCCGATGCC GCTCGCCTAC CCGGCCGCGC TGCTGGCCCG GCGCCGCGCG GAGTACCGGC TGCCGCGGCA GGCCGCCGGC TGCGTCACCG TCGACGGGCA CTGCTTCGGG CTGGTGGAGG GCGAGCTGCC GAATCCGCCG AGCTTCGCCC GCAACGGCCT GGTGGAGGCC GTCTCCGGCG GTGTCCTCGT CCGGACCGGC ATGGGCACGG GCCGGGTACG GGTGTCGCTG CGGGTGCTGG CGACACCGCC GGCCCCGCCG GTCCCCGCCG ACGCCGTCCG CTGGGACGAG ATCGTCGACG TGAGCTGGAC GGCCGCGAAC GGCGCCGCCG CGGTGGCCGG CGCCGGAGCG AGCACCGCCG GCGGCACCAC AACAACGGGC GTCGGCGCCG GGGCCGGCTC AGCCACCGAC CTCGGCCACC TGACGACACC GCCCTGGCCC GGGGACTACC GGCTGCGGGT TTACGCCCAC GGCCGGGACG GCGCCGGCGA GGACGAGACC TACGAGCTGG TGGTGTGGAG CGCCCCGGCC GCGCCCGAGA CCGTCCACCG GCGCACCGAC CAGCTGGGGC ACCGGCTGCG CGGCGAGGAG CTTCCGCCGG TCGTGACGGT GCCGGAGACC CGCTACCGGT GGGTGCGCCG GCGCAGCGCC TTCCGCGAGG CCGCCACGTT CACCATCGTG GTCGGCGCCT CGCCCGAGGA CGTCGTGCGC TGCTTCGACG CCGATCCCGG CGCGCCGTGC TCGCTGTCCC GGCTGCGCGC CGACCGGCGC ACCGACCCGT ACGTGCTGGT CCTGCCGCTC GACGGCGACG ACCGCGCGGT GCTCGCCGTC GAGGACAACG GCTTCCAGGG CTCCCGGCAC CCGGTGCTGT CCGCGGTCTC CCGGCACGGC CTGGCGGCGA GCATGTTCTG GAACATCAAC GCGCTCACCC GGCTCTCGCT CGCCCGGGAC GGCGAGGTGC TCGCCGCGTT CGAACCGGGG CCGGACGCCG TCCCCGACGC GGTCGTGCCG CTCCTGCGGG ACGTCGACCT GGCCGGCGCT ACGGACCGGG TCGCCAAGGG CCTCGTCGTC GTCGAGCGGT TCACCGGCCA TCCGGTCCTC TCCGAGCACC TGGACCGGAT CATCGAGAAC GACGTGGCAT ACCTGATCAA CCAGCACTGA
|
Protein sequence | MCCLIVTLSE RRSSRKEQPV SKVSWVFRPD LESRQISLPE QTVELPADLL DPLAHVVVEV FPQLVAHIPP TPAAELGRLG ATAPAPVPLP PAPAGPVLTA LRSAGPRLPE RLLGALELAI DELPLRAPRG LARGLVDAPP NGPRGRNGSR ASAAAAGESE AVQAVAILDE VHPGVSDLIH SYLRALVEHP AVAPLLAADD PAAAGPLDGH GSDDRPLDGT AFGSTANGGS AFGGSAFGGS LFGPPDEPAD EPTLAARHGA AHLALAVTVA AAVLRELDPP MIGTGAPGIV GTAVGSVALV LPGRPMPLAY PAALLARRRA EYRLPRQAAG CVTVDGHCFG LVEGELPNPP SFARNGLVEA VSGGVLVRTG MGTGRVRVSL RVLATPPAPP VPADAVRWDE IVDVSWTAAN GAAAVAGAGA STAGGTTTTG VGAGAGSATD LGHLTTPPWP GDYRLRVYAH GRDGAGEDET YELVVWSAPA APETVHRRTD QLGHRLRGEE LPPVVTVPET RYRWVRRRSA FREAATFTIV VGASPEDVVR CFDADPGAPC SLSRLRADRR TDPYVLVLPL DGDDRAVLAV EDNGFQGSRH PVLSAVSRHG LAASMFWNIN ALTRLSLARD GEVLAAFEPG PDAVPDAVVP LLRDVDLAGA TDRVAKGLVV VERFTGHPVL SEHLDRIIEN DVAYLINQH
|
| |