Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1547 |
Symbol | |
ID | 5669950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1848044 |
End bp | 1850014 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240466 |
Product | hypothetical protein |
Protein accession | YP_001505892 |
Protein GI | 158313384 |
COG category | [S] Function unknown |
COG ID | [COG2898] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.736781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.90669 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCATCG CGAGCAGATC GGGCCCACTG CCGGGCCCGT TGACGACGTC GGACCGCCGG GAGCCAGACG GCCCGGGCGC CCGCGGTAGT GGTCGGCGGG ATGACCGGCG CGCCGACAGA CCGCGGGCCG CCGACAGCAC TCGGCCGGCC GACGGCCCAC GGATCACCCG CGGGCCGCGG CGTGCCGACA GCCCGCGGCG TACCGAAGGC CCATGGCGTG CCAGCGGGCC ACGGGTCGCC GCGCTGGTGA CACTGCTGCT CGGCCTCGTG GACATCGCGG CCGTGTTGAC GCCGGGGTGG CACTCGCGGC TGGAGGCGCT GCGGGAGCTG CTGCCCGCCG CCGCCTCCCG GCAGGCCGCC GCGTTCACCG TCGTGGTCGG AATGCTGCTG GTGCTGCTCT CGGCCGGGCT CCGGCGGCGC AAACGGCGAG CGTGGCGAGC CGTCGTGATA CTCCTCGGCT CGAGCGTGAT CCTGCACCTC GCCCGGGGGC TGGACTACGA GGAGGCTGCC GGGTCGGCCG CGCTGTGCGT CGCCCTGCTG CTGGCCCACA GGCAGTTCCA GGCCAAGGGC GATCCGACGA CGCGCTGGCG GGCGGCCGGC GTGGGGCTCC TGCTCACGGT CGTCTCGATC GGGGTCGGCC TGCTGCTGCT GAACCTGCGC GGCAGTCGGA TCGCCGGCCC CCATCCACTG TCCGCGGAGC TGGAGCAGAT TGTCCTGGGG CTGGTCGGCA TTCCGGGCCC GCTGGGGTTC AGCTCCGCCC GCTTCGCCGA CCTGGCCAAC CGGATGCTGC TGACGATGGG TGTACTGACC ATCGGGTCCA CCGCCTATCT GGCGCTTCGC CCGCCCGAGC CACGGCCGAG ACTGACCGAC GCGGACGAGG CCCGGGTGCG GGACCTGCTG GCCGGTCACG GCTGCGCCGA TTCGCTCGGG TATTTCGCGC TGAGATCCGA CAAGTCGGTG ATCTGGTCGC CGACCGGGAA GTCCTGTGTG GCCTACCGCG TGGTCTCCGG GGTGATGCTC GCCAGCGGCG ATCCGCTGGG TGACCGGGAG GCGTGGCCAG GCGCCATCAG GGAGTTCCTG CGCGAGGCGG CCGACCACGC CTGGACGCCC GCGGTGATCG GCTGCTCGGA GGCGGGCGGA ACCGCCTGGA CCAGGGCCGG CCTGTCCGTT CTCGAGTTCG GCGACGAGGC CGTCGTCGAG ACGGCCGGTT TCACCCTGGA GGGCCGGACG ATGCGTAACG TCCGGCAGGC CGTCGCCCGA GTCGAACGCG CCGGCTACAC GGTGGACATC CGACGAGTTC GCGACCTGAC GCCGCAGGAT GTCGACCGTC TCAAGGCACA GGCCGCGGCC TGGCGGGGCA CCGAGACCGA GCGGGGATTC TCCATGGCGC TCGGCCGGAT CGGCGGTGCG TCGGACGGCG ACTGCGTCGC CGTGATGGCT TTCTCCACGG ACCCGGACGG CGCGGACCCG GACGGCCCGA ACCCGGACAG CGCGGGCCCG GACAGCGCCG AGCCGCGGCT GCGCGCGCTG TTGCACTTCG TGCCGTGGGG ACGGACGGGA CTTTCACTGG ATGCGATGAT CCGTGACCGG ACGGCGGACA ACGGGCTGAA CGAGTTCCTG ATAGTCAGTG CCCTGCGTCA GGCCGGCGAC CTCGGGGTCG AGAGGCTGTC CCTCAACTTC GCGTTCTTCC GGTCCGCGCT CGAACGCGGT GAGCGCCTCG GCGCCGGGCC GGTGATCCGT CACTGGCGCG GCCTGCTGAT GTTCTTCTCC CGCTGGTTCC AGATCGACAG CCTGTACCGG TTCAACGCGA AGTTCCAACC TGTGTGGCTG CCCCGCTACG TCTGCTATCC GACGTCCGCG GAGCTGCCCC GGATCACGCT GGCGATGCTC AGGGCTGAGG CCTTCCTCGT CCGGCCACGC TGGTGCTCCC GCCTCCCCCG GCCCTCCCGG CCTGCCAGGC GCCCGGGGTG A
|
Protein sequence | MAIASRSGPL PGPLTTSDRR EPDGPGARGS GRRDDRRADR PRAADSTRPA DGPRITRGPR RADSPRRTEG PWRASGPRVA ALVTLLLGLV DIAAVLTPGW HSRLEALREL LPAAASRQAA AFTVVVGMLL VLLSAGLRRR KRRAWRAVVI LLGSSVILHL ARGLDYEEAA GSAALCVALL LAHRQFQAKG DPTTRWRAAG VGLLLTVVSI GVGLLLLNLR GSRIAGPHPL SAELEQIVLG LVGIPGPLGF SSARFADLAN RMLLTMGVLT IGSTAYLALR PPEPRPRLTD ADEARVRDLL AGHGCADSLG YFALRSDKSV IWSPTGKSCV AYRVVSGVML ASGDPLGDRE AWPGAIREFL REAADHAWTP AVIGCSEAGG TAWTRAGLSV LEFGDEAVVE TAGFTLEGRT MRNVRQAVAR VERAGYTVDI RRVRDLTPQD VDRLKAQAAA WRGTETERGF SMALGRIGGA SDGDCVAVMA FSTDPDGADP DGPNPDSAGP DSAEPRLRAL LHFVPWGRTG LSLDAMIRDR TADNGLNEFL IVSALRQAGD LGVERLSLNF AFFRSALERG ERLGAGPVIR HWRGLLMFFS RWFQIDSLYR FNAKFQPVWL PRYVCYPTSA ELPRITLAML RAEAFLVRPR WCSRLPRPSR PARRPG
|
| |