Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6261 |
Symbol | |
ID | 5674580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7605861 |
End bp | 7607162 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641245113 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001510509 |
Protein GI | 158318001 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.559875 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGACA ACGGGGGAGC AGGCCGGCGA ACCGCCCGGC GCTGGCTGTG GATCATCGCC GGGATTGCAG CGGCTGGAGC GGTGTGCGCG GTGGTCGGGA TCTGGCACCT GCCCGCACGG ATGTACCCGG ACGCTGGCGA CGCTGACGCG CGGGCAGCAC TGCAGAGCGG ACTGCTCACC GCCGCCGCTG CGCTCACCGC CGTAGCTGGC GGGCTCATCG CCCTCGACGA AACCCGCCAA GCCAACGCCG AGGTACGGCG GGCGAACGCC GAGGTACGAC GGGCGAACGA GAACACCCAT GTGCGGGAGC TGTACGTAGA GGCAGTGAAG CTCCTCAACG ACACCGAGCG AGGGATCCGC CTGGCCGGGA TCTACGCCTT GGAACGCATC GCGGTGGACT CCCCGGCTGA CCAGCGCACG GTGGTGGAGG TGCTGTCGGC GTTCGTGCGT GACCGCAGCA CCGACCCCGC CCTACGTCTG CCTCCAACAC CTGGCGAGGA CGGCACGGTC CCGCCGGTAC GGGCCGTGGC GGACATCCGC GCCGCCGTCC AGGTCCTGGC CCGGCTCCCG GTCCGCAAGG GCATCCCGCG TTCCGACCTG ACCGGCGCCA CTCTCACCGG ACCGGCCAGC CTCGCCCACC TCACCCTCAC CAACGCCAAC CTCACCGGCG CGCGGCTAGA CGGCGCGGAC CTCGCCGGCG TCCGGCTGGA CGGGGCGAGC CTCACCGGCG CCGTGCTGCT CGGCGCGACC CTCACCTTGG CCTCGCTGGA CGGGTCGGAC CTCACCGACG CCGGGCTGGA CGGGGCGAGC CTCACCCACG CCCATCTGAA CAGAGCGGAC CTCACCGGCG CCACGCTGAA CGGGGCGGAC CTCACCTGGG CCCAGCTGAA TGGCGCGGAC CTCACCCGCG CCCAGCTGAA CGTGGATGTG TCCCTTACCC ACGCCTCGCT GCGCGGTGCG ACCCTCACCG GCGCCGAGCT GTCCGGCGCG AGCCTCACCA GCGCCACGCT GGACGGGACG GACCTCACAG ACGCCACGCT GGACGGCGCG GACCTGACCG ACGCCACGCT GAATGGCGCG GACCTCACTC GCGCCTCGCT ATTCCTCGCG ACCCTCACCG GCGCCTCGCT GGACGGCGCG ACCCTCACCC ACGCCCGGCT GGGCTGCGCG ACCCTCACCG GCGCCTTTGG GCTGTCGCAG GGACAGGTGG ATGCCACGCA GGGGGATGAG CGGACACGGT TGCCGGAGGG TCTGGTCCGG CCGGCGTCAT GGCCTCCAGA GGAGCCGCCG GACGGCGGCT GA
|
Protein sequence | MVDNGGAGRR TARRWLWIIA GIAAAGAVCA VVGIWHLPAR MYPDAGDADA RAALQSGLLT AAAALTAVAG GLIALDETRQ ANAEVRRANA EVRRANENTH VRELYVEAVK LLNDTERGIR LAGIYALERI AVDSPADQRT VVEVLSAFVR DRSTDPALRL PPTPGEDGTV PPVRAVADIR AAVQVLARLP VRKGIPRSDL TGATLTGPAS LAHLTLTNAN LTGARLDGAD LAGVRLDGAS LTGAVLLGAT LTLASLDGSD LTDAGLDGAS LTHAHLNRAD LTGATLNGAD LTWAQLNGAD LTRAQLNVDV SLTHASLRGA TLTGAELSGA SLTSATLDGT DLTDATLDGA DLTDATLNGA DLTRASLFLA TLTGASLDGA TLTHARLGCA TLTGAFGLSQ GQVDATQGDE RTRLPEGLVR PASWPPEEPP DGG
|
| |