Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4250 |
Symbol | |
ID | 5672605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5063699 |
End bp | 5065603 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641243123 |
Product | hypothetical protein |
Protein accession | YP_001508540 |
Protein GI | 158316032 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTACAGCG GTTCCGCATG GCTAGTGAGT ACTCGGAAGG GCGAGATCGT CCGGGTCAAC GGCCTGGCGG GTCGGGTTGA TGCCGCTCCG GTCGGGATCA CTCGCCCGGG AGACGCCGTT CAGGTCGTCC AGACGGACGA TCTTGTTCTC GTAGCCGCGA ACGCCTACCT GTCCAGCATC GATCCACGGC TGTTGGCCCC GACCCGGGGC GCAAGCCTGC AATCGGCCGG GGCCAGGATT CTCGCCACAG GGAAAAGGGC ATATGTGCTC GATCCTGCCA GCGAGACTGT CTACCGGCTT GATCCTCGGA CACTGCGACC CGCCGGTCCG ATGGTTGCCC TGCCTGGCCG CGCGGGTGAC ACGGCGCTCC AGGAAGGCGG TGAACGGCTC TGGGTGGCCC TGCCCGACCT GGGCGCCCTG ACGCTCGTCG AGAACGACGT GGCCGGTATG CCAGTGCCGG CCGGAGCACC CGGGCCGCAT CTGGTGTTCG CCCGGGTGGC CGGCCAGACC TGGGTTCTTA ATGGCAACGA CGGGACCGCG GGACAGCTGC GGGCCGACGG CGCCACGAGC CGCCGAATAC AGCTCGGCGC CGACTACGAG GGCACCGCGC TGCTTCCCGC CGCGGGGGAC AGCCCGCTGC TGGTTTTCGC CCTGCCAGGT TCACGCCGGC TCGCGGTGCT CGAACCCGGT CGCGAACAGC CTCGGATCGA GAGCATTCCG GTCGCGGTCG GGCCGCTGGG CGCACCACTC GTCGCCGACG GGCTGGTCTA CGTCCCGGAT GAAGGCAGCG GACGCCTCGC TGTATACGAC CTGGCACGCC ATGAGTTCAA ATCGCCCATC TCGGTGACGG CGACCGCCGC CACCGACCTC GAGCTCTTCC GGGCCGGCGG GATGGTGTGG GCCAATGCCA TGTCTACGCC GGATGCGGTG GCAGTGTATG ACGGAGTCGT CCACCGAATC GTCAAGTACG GGGAACCGGC GGCACCCCCG ATCGCTCCGT CGCCGACCAG AACACCTGCC GCCACCGCTC CGCCGAATCC GAATCCGAAT CCGAAAACAC CGGCCAGCGC GATCCCCACG CGTTCGGTGC CTGGTACGCC GTCAGCGGGG CCGGCTGCGG GCGGCGCCGG CCCGTCATCC GCAACACCTC CAGCCCGGGC CACCTCACCC GCCCGGGGTG GCGACGCCGG GCCCTCCGGT GAGGACGGAG CGGACGACGC GGCCACCGGC CCGCCCGAGA GAGTCCCCGA CCTCACCGGC GACGACTCGA CGGACTCGGC GCGCCGGCCG GGCGAGAACG GTGGCATTCC GATCACTCGT CGGAATGGGC CGTATACGCA CCAGACCGCG TTTGGTCGGG TCGTCGTCGA GGCCGGAAAC GGCTACGTGG ATGTTTCGTG GGAGCTTCCG GCTGGCGAGG GCGGCCCAAC CGACCTCGCG TGGATGGCCG GTGCCCGGGG TGGCGGTGGT GCCGGCAACG GCGGGCCGCT GCCGCCCGGC AGCACCTCAA CGCGCTTCAA CGTCACCTAC ACCGGCGACA CGCCGACCCT GACTTTCTCA TCGGCCGCGA ACACCTTCTC CTTTGATATC CGGGCCTGGG AGCTGTGTGA CTTCTGCAAC TACGGACACC CCACCTATAC GGTGCCGCTT CGCGACGCCC CACACGGTAC CCCGATCGGG ACGGCGCTTC CTCCGGTACC GGACGGGCAG ATGGGAAATC AGGTCGAACT GCACTGCGTG GCGGAATCCA CCGTCGAATA CGCTGACCCG GTCTATCCCG GTCGGGTTGG CACATTCGCC TGGTACAAGA TTACCTATCA GGGGGCCACG GGATATGTGC CGACGAACTA TATCAGCATT CCCGACACTG GGATGGAACC ACGATACGCG GTGCGTCCCT GCTGA
|
Protein sequence | MYSGSAWLVS TRKGEIVRVN GLAGRVDAAP VGITRPGDAV QVVQTDDLVL VAANAYLSSI DPRLLAPTRG ASLQSAGARI LATGKRAYVL DPASETVYRL DPRTLRPAGP MVALPGRAGD TALQEGGERL WVALPDLGAL TLVENDVAGM PVPAGAPGPH LVFARVAGQT WVLNGNDGTA GQLRADGATS RRIQLGADYE GTALLPAAGD SPLLVFALPG SRRLAVLEPG REQPRIESIP VAVGPLGAPL VADGLVYVPD EGSGRLAVYD LARHEFKSPI SVTATAATDL ELFRAGGMVW ANAMSTPDAV AVYDGVVHRI VKYGEPAAPP IAPSPTRTPA ATAPPNPNPN PKTPASAIPT RSVPGTPSAG PAAGGAGPSS ATPPARATSP ARGGDAGPSG EDGADDAATG PPERVPDLTG DDSTDSARRP GENGGIPITR RNGPYTHQTA FGRVVVEAGN GYVDVSWELP AGEGGPTDLA WMAGARGGGG AGNGGPLPPG STSTRFNVTY TGDTPTLTFS SAANTFSFDI RAWELCDFCN YGHPTYTVPL RDAPHGTPIG TALPPVPDGQ MGNQVELHCV AESTVEYADP VYPGRVGTFA WYKITYQGAT GYVPTNYISI PDTGMEPRYA VRPC
|
| |