Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0521 |
Symbol | |
ID | 5668940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 606994 |
End bp | 608250 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641239450 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001504888 |
Protein GI | 158312380 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTTTT CCGCTAAGCG CGACTTTCTG ATTACTGAAA GGAGAGTGGA GTATCCAGTC GCTGGGCGGC GGTTGCGGAT CGCTACTGGG GCCGCAGCGT CGGGGGCTGC GCTGGCGACG GTGGGGATCT TCTACCTGCG CGCCTCGATG CACCCGTCTG GCCTCGGCGG AGAAGCGGAG ACTCGGGCGA TGGTGGAGTG CGTGTTGCTG ATTGCGTCGG TCGCGCTAGC GACGCTGGCC GGCTACCGGG TCGCCGCGGA CGGGATGCGA CAAGCGTACG CGCAGGCATG GAAAGCGGGC ATCCGGGAAA AGGACAGCAA TGCCCACTTT TGTGCCCGGG AGTTCTTCGG CCGCGCGGTC GACCGGCTCG GCAGCGAGAA CAACACGACT CGCCTTGGCG GGATCTACGC CCTGGAGCGG ATCGCGATGG ACTCACCGAG TGACCAGCGC GCGGTCGTCG AGGTCCTCTC CGCCTTCATC CGCACCCGCA GCACGGACCC CACGCTGCGG CCCGCCGTGT CCGGTCCGGT CGTTCCGCTG CGCCCTGCCG TGGATATCCA CGCCGCGGTG GCGGTCCTGG GGCGTCTGTC CGTCCTCGAC GGTGTCCCCC GCGCGGACTT GAGCGGTGCG AAGCTGACAG GTCCCGCCGC CCTGCACTGC ATACAGGCCA GCTATGGCAA CCTCAGCAAC ACCGACCTCA CCGGAGCGGA CCTCAGCCGC GCCCATCTGG GTCGGGCGGA CCTCACTGCC AGCCGGCTGG GCGGCACGGA TCTCACGGGC GCCTCGCTGA ACGAGGCCAA CCTCAGCTAT ACCTGGTTGG GCGGAGCGAA CCTGACCCGC GCCCGGCTAA GCGGAGCGGA TCTCACCGGT GCATCGCTAA GCGGAGCGGA CCTGACCCGC GCCTGGCTGG ACGGCGCGGA TCTCACGGGC GCATCGCTAG GCGGAGCGAA CCTGACCCGC GCCTGGCTGA CCGAGGCGGA CCTGACCCGC GCCTGGCTGG GCGGGGCGAA CCTCATCACT GCGGTGGGAC TGGTCCAGGA TCAGATCGAC GCGGCATACG GCGACGGGTG GACGCGGCTA CCGCCGGAAC TAACGAGACC GGCTTTGTGG ACCTCGGCCG AGGCTGACGA GTACCGCCCG GCAGACCCAC ACCAGGTTGT CGGACAGTGG CATCCGGAGG TGCTGGCGGG GCAGGACAAC GCCCTGTCCT CCCGGGACTA TCACCACACG ATCCTTCCGA AGGTCGTGAT TGTGTGA
|
Protein sequence | MHFSAKRDFL ITERRVEYPV AGRRLRIATG AAASGAALAT VGIFYLRASM HPSGLGGEAE TRAMVECVLL IASVALATLA GYRVAADGMR QAYAQAWKAG IREKDSNAHF CAREFFGRAV DRLGSENNTT RLGGIYALER IAMDSPSDQR AVVEVLSAFI RTRSTDPTLR PAVSGPVVPL RPAVDIHAAV AVLGRLSVLD GVPRADLSGA KLTGPAALHC IQASYGNLSN TDLTGADLSR AHLGRADLTA SRLGGTDLTG ASLNEANLSY TWLGGANLTR ARLSGADLTG ASLSGADLTR AWLDGADLTG ASLGGANLTR AWLTEADLTR AWLGGANLIT AVGLVQDQID AAYGDGWTRL PPELTRPALW TSAEADEYRP ADPHQVVGQW HPEVLAGQDN ALSSRDYHHT ILPKVVIV
|
| |