Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4644 |
Symbol | |
ID | 5672987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5541797 |
End bp | 5543047 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641243502 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001508918 |
Protein GI | 158316410 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACA GCGTAGTAGC CCGCCGGCGC ACCGGGCGCC GCTGGCTGTG GATCGTGGCC GGGGTTGCGG CCATCGGGGC GGTGTGTGCC GTGGTGGGGA TCTGGCATCT GCCAGACCGG ATGTACCCGG GTACCTACGC CGAGGTGGGG GAGGCTCGCG CTGCGTTGCA GGCCGGGCTA CTGACTGCTG CCGCCGCTCT GACCGCCGTA GCCGGTGGGC TGATTGCCTT GGACGAGACC CGGCATGCCA ACGCCGAAGT GCGGCGGGCG AACGCGAACA CTCATGTCCG TGAGCTGTAC GCGACCGCGA TTGGTCTTCT CAGCGCGGAT GACATCGATA GCCGCCTTGG TGGGATCTAC GCGCTGGAAC GGATCGCTCG GGATAGCGCG GCTGACCATC GTATCGTCGT GGAGGTGCTC TCGGCATTCC TGCGCGAGCA CACCCAGCCC GCTTCGGTGC TCGAGCAACG GCCACCTCCC GGACGACGTT GGAGACATCC TCCGGTCGGA GCGGGTGGTG ACGACGAGGG CCGCGTCCGA CTGCGGACGG ATATGCATGC CGCGTTCGCG GTCCTGGGGC GGCTCCCTGT CCGGCCCGGA GCGCCCCCCG CTGACCTGAC AGGCCTTCAT CTGGGTGCGG CAGACCTGGC TGACGTTCAG CTGACGGGCG CAGATCTCAC CGGCGCCCAG CTTGCTGGCG CAAATTTGAC CAATGCCTGG CTAAGTGGAG CTAACCTCAC CCGAGCACAT CTTGACGGCG CAGTCTTGAC CGACGCCCGG CTGGATCGGG CTGATCTCAC TCGGGCCCGG CTGGGAGGGG CGGACCTCAC TCGAGCCTGG TTGCAGCATG CCAACCTCAC CCGAGCGCAG CTTGGCGGCG CTAATGTGAC CGACGCTCGC CTGGTTGGCA CGGACCTTAC CGGAGCCCGA CTAGATGGTG CCAACCTCAC CCGCACCTGG CTGGACGGTG CAAATCTCAC CGGCGCCCGA CTGGAAGGGG CGAAACTCGT CAACGCCTGG TTGGAAAGGG CAAACCTCAT CGGTGCCCGG TTGATTGGAG CGGATCTTGA TGGGGCATGG CTCAATGGAG TGGACCTTTT GGGTGCCTGG CTGAACGGAG CGGACCTCGC TCGCGTTGTG GGATTGTCGC AGAGCCAGCT GGATGAGGCG CGGGGCAACG ACGAGACGCG GATACCAGAC GGATTGGTAC GGCCAGAATC ATGGACGTCG GGGGACGGCA GTGGGGGATG A
|
Protein sequence | MADSVVARRR TGRRWLWIVA GVAAIGAVCA VVGIWHLPDR MYPGTYAEVG EARAALQAGL LTAAAALTAV AGGLIALDET RHANAEVRRA NANTHVRELY ATAIGLLSAD DIDSRLGGIY ALERIARDSA ADHRIVVEVL SAFLREHTQP ASVLEQRPPP GRRWRHPPVG AGGDDEGRVR LRTDMHAAFA VLGRLPVRPG APPADLTGLH LGAADLADVQ LTGADLTGAQ LAGANLTNAW LSGANLTRAH LDGAVLTDAR LDRADLTRAR LGGADLTRAW LQHANLTRAQ LGGANVTDAR LVGTDLTGAR LDGANLTRTW LDGANLTGAR LEGAKLVNAW LERANLIGAR LIGADLDGAW LNGVDLLGAW LNGADLARVV GLSQSQLDEA RGNDETRIPD GLVRPESWTS GDGSGG
|
| |