Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4278 |
Symbol | |
ID | 5672633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5116082 |
End bp | 5117317 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641243151 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001508568 |
Protein GI | 158316060 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0535647 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.226875 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA ACGCCGGTGT GGCCCCCGGC GCTCATGGAA CCTGCTCCTG CGGCGGCGCG GCCGGCGTCG CCACCAGCGG CGCCGCGCGA CGGTCCGCGG GCCGGCGCTG GCTGTGGATC GTCGCCGGGC TGGCCGCGGC CGGCGCGGCG GCGGCCGCGG TCGGGATCTG GCATCTCCCC CCGCGGATGT ACCCGGACCC AGGCGACACC GACGCGCGGG CGGCCCTGCA GGGCGGCCTG CTGACCGCGG CCTCGGCCCT CATCGCCGTG GCCGGCGCCC TGGTCGCCCT GGACGAGACC CGGGTGGCCA ACACCGAGAC CCGGCGGGCG AACGAGGCGG CCGACGAACG CGAGCGGCAG GCCTACGCGA ACACCCACGT CCGCGAGCTC TACACCCGGG CGATCGACCA GCTCGGCTCG GACAGCGACA CGATCCGCCT GGGCGGCATC TACGCCCTCG AACGGATCGT CGCCGACAGC CCCGCCGACC GGCGGGCCGT CGTCGAGGTC CTCGCCGCCT TCGTCCGCAC CCTCAGCACC GATCCCCGGC GCGCCCCGGC ACCCGCCGCA CCCGCCGCGC CGTCCGCCAA GCCCGGGCGG CGCGGGCCGT CCCGGCCGCC CGCCGTCGAC ATCCGCGCCG CCGTCGGCGT CCTCGCCCGG CTCCCGCACC CCGCGGACCT CACCGGCACC AACCTGACCG GGCTCACCGG CCTCACCGGC CACGCGGATC TTCCCGGTGC CCCCAGCCTC GCCCACCTGA CGCTCACCAA CGCCACCCTG GCCGACGCCC GGCTGGCCGG GGTCGACTTC ACCGGCGGCA GCCTGGACGA CGTCGATCTC GCCCGCGCCG ACCTGCGCCG GGCGAACCTC ACCGACGCCG AGCTTGTCGA CGCGGACCTC ACCGGCGCCC GGCTCGCCGA CGCGACCCTT GCCGGCGCCC TGCTCTTCCG GGCGACCCTC ACCGGCGCCC AGCTGGGCCG GGCCGATCTC ACCGGCGCCC AGCTCGGCGG CGCCGACCTC ACGAACGCCG TCCTGGACGA GGCGATCCTC GCCGACGCCG TCCTCTCCGG GGCGAACCTC ACCAACGCCC GACTGGACGG CGCCGACCTC ACCGCCGCCA CCGGCCTGGC CCAGAAGCAG GTGGACTCCG CGCGCGGCGA CCGGCGGACC CACCTGCCGG CGGGCCTGGC CCGGCCGGCG TCATGGGACA CCGCGGAAGG GCCGGCCGGG CAGTAG
|
Protein sequence | MTDNAGVAPG AHGTCSCGGA AGVATSGAAR RSAGRRWLWI VAGLAAAGAA AAAVGIWHLP PRMYPDPGDT DARAALQGGL LTAASALIAV AGALVALDET RVANTETRRA NEAADERERQ AYANTHVREL YTRAIDQLGS DSDTIRLGGI YALERIVADS PADRRAVVEV LAAFVRTLST DPRRAPAPAA PAAPSAKPGR RGPSRPPAVD IRAAVGVLAR LPHPADLTGT NLTGLTGLTG HADLPGAPSL AHLTLTNATL ADARLAGVDF TGGSLDDVDL ARADLRRANL TDAELVDADL TGARLADATL AGALLFRATL TGAQLGRADL TGAQLGGADL TNAVLDEAIL ADAVLSGANL TNARLDGADL TAATGLAQKQ VDSARGDRRT HLPAGLARPA SWDTAEGPAG Q
|
| |