Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2185 |
Symbol | |
ID | 5670585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2619002 |
End bp | 2620087 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641241106 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001506527 |
Protein GI | 158314019 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.225539 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCATGA CTGACAACGG GGGCACTCGT CGGCCAGCAG GTCGGCGGTC GCTGTGGATC GTTGCCGGGC TCGCCGCGGT CGGCGCGGTT GCCGCCGTGG TGGGGATCTG GCACCTGCCG GACCGGATGT ACCCGCCGGG CACCGACGGT GAGGCGGAGG CGCGTGCGGC ACTGCAGGGC GGTCTGTTGA CGGCGGCCGC CGCGCTGACG GCGGTGGCCG GTGCCCTGAT CGCACTGGAC GAGACCCGGC AGGCCAACGC TGAGGTGCGT CGCGCGAACG CAGAGGTCCG CCGGGCGAAC GAGAACACGC ATGTCCGGGA GTTGTATGCG ACGGCAATCG GGCTGCTGGG GGCGGACACG ATCGACAGCC GCCTTGGTGG GATCTATGCC CTGGAACGGG TCGCTGTCGA CTCACCAGCC GATCAGCGCA CCGTGGTGGA GGTCCTCTCG GCGTTCGTTC GAGTCCACAG CACCGACCCT GCCCTACGCC CTGCTGTCCC TGACCCAGCT TCTCCTGTAC GCCCGGCGGT GGACGTGCAC GCTGCTGTCA CCGTGCTAGC CCGTCTCCCC GTGATCCCCG ACATCCCACG TGCAGACCTG AACGGGGCAA AACTCACCGG TCCGGCCGCC CTCGACCGTC TCCAAGCCGC CCGCGGCAAC CTCGCCCAGG TCGAGCTTGC CGAGGCAGAC CTCCGCGGCG CCCGCTTGGA CGAAGCAGAT CTTGCCGACA TCAAGATGGT CGAGGTTGAC TTCACCGGCG CGCAGATGGT CGGGGCGAAC CTCGCCGGAG CACAGATGGT GGAGGCGAAC TTCGCTTGGG CCGAGCTGAC GAGAGCGGAC TTCAGTGGGG CGCAGCTGGT GCAAGCGGAC TTCACGGAGG CGCAGATGGT CGGGGTGAAC TTCACGGGGG CGCAGCTGGT GCAAGCGGAC TTCACGGGGG CGCGCTTGAA CGGTGCGAAC CTGATGAACG CTGAGGGGGT GTCGCAGGAG CAGGTGGACG TCGCCTTCGG GGACAGCGAG ACTCGCCTGC CGCCGGGGCT GACGCTTCCG GCGTCGTGGA CGGCTGGCGG TGCCAGTGGG TCATGA
|
Protein sequence | MPMTDNGGTR RPAGRRSLWI VAGLAAVGAV AAVVGIWHLP DRMYPPGTDG EAEARAALQG GLLTAAAALT AVAGALIALD ETRQANAEVR RANAEVRRAN ENTHVRELYA TAIGLLGADT IDSRLGGIYA LERVAVDSPA DQRTVVEVLS AFVRVHSTDP ALRPAVPDPA SPVRPAVDVH AAVTVLARLP VIPDIPRADL NGAKLTGPAA LDRLQAARGN LAQVELAEAD LRGARLDEAD LADIKMVEVD FTGAQMVGAN LAGAQMVEAN FAWAELTRAD FSGAQLVQAD FTEAQMVGVN FTGAQLVQAD FTGARLNGAN LMNAEGVSQE QVDVAFGDSE TRLPPGLTLP ASWTAGGASG S
|
| |