Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6262 |
Symbol | |
ID | 5674581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7607226 |
End bp | 7608452 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641245114 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001510510 |
Protein GI | 158318002 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.104366 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.857591 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGACA ACGGGGAAGC ACGCCGTCGG GCTGGTCGGC GCGGGCTGTG GATCGTCGCC GGGATCGCCG CGGTGGGGGC AGTCTGCGCG GTGGTGGGGA TCTGGCACCT GCCGGACCGG ATGTACCCGC CAGGAACCGA CGGTGGGGCG GAAGCACGAG CCGCGTTGCA GGGCGGGCTA CTCACGGCGG CGGCTGCGCT CACCGCCGTG GCTGGCGGTC TGATTGCCTT GGACGAGACC CGGCGGGCCA ACGCCGAAGT GCGGCAGGCC AATGCCAACA CCCACGTCCG CGAGCTCTAC ACGGCCGCGA TAGGGCTGCT GAGCTCGGAT GCGATCGACA GTCGGCTAGG TGGGATCTAC GCCCTCGAAC GGATTGCGTG GGATAGTCCT GCCGACCAGT CCACTGTCGT CGAGGTCCTC TCCGCGTTTG TCCGCGAGCA CGCCCGACCC CTCACAGATG CGCCGGCCGG CCTCCCGGCC GAGATTCGGG GCCGAGGTGG TGGGGGTCTG CGCAGTCGAC GTCGTCGTGG TCACGCCGCT GGCAGGAGGT CGGAGGTCCG CCACCGGCTA CCGCCCTGGG ATCGATTTAT CCAGATAGGC CCATGGAGTA ACGAGGCTCC GCCACCCACT GATGTGCAGG CGGCTCTTAC CGTCCTAGGA CGTCTACCCG ACCTCGGAGG CTTTCGCGCC GACCTCACCG GAGCGAATCT TACCGGTGCC GAGCTAGAAG GCGCGAATCT CTTTCCCGCA CGGCTGACTA GGGCCACTTT TACCGGATCA CACCTAGGCA GGGTAAACCT TAAGTACGCC CAGTTGTACT TGACAAATTT CACCGATGCC ACCATAAATT CCATAAACTT TACCCGCGCA CAAATACAAA ATACAAACTT TACGGGCACG CTAATGATGG GTGCAGACTT TAGTGAAGCA CTGATCTCCG ACGCAGATTT CACCGACGCC TTCCTGACTG CAACGGTTCT CACCGACGCT ATAATAAGCG CAAACTTCAC GCGCGCATTT ATTGTGGAAG TGGATTTCTC CGGATTAAAT ATCGGCGGGA TAAACCTCAC CGATGCTCGA CTGCAAGCAG TGAATTTCTC CGGCGCTAAA GGTCTTACAC AGGAACAGGT GGATAGCGCC CAGGGCGACG GGCGGACGCG GTTGCCGGCG GGTCTGGTGC GGCCGGCGTC GTGGGGTCCG GAGGAGCCGC CAGTCGGGGG CGGCTGA
|
Protein sequence | MVDNGEARRR AGRRGLWIVA GIAAVGAVCA VVGIWHLPDR MYPPGTDGGA EARAALQGGL LTAAAALTAV AGGLIALDET RRANAEVRQA NANTHVRELY TAAIGLLSSD AIDSRLGGIY ALERIAWDSP ADQSTVVEVL SAFVREHARP LTDAPAGLPA EIRGRGGGGL RSRRRRGHAA GRRSEVRHRL PPWDRFIQIG PWSNEAPPPT DVQAALTVLG RLPDLGGFRA DLTGANLTGA ELEGANLFPA RLTRATFTGS HLGRVNLKYA QLYLTNFTDA TINSINFTRA QIQNTNFTGT LMMGADFSEA LISDADFTDA FLTATVLTDA IISANFTRAF IVEVDFSGLN IGGINLTDAR LQAVNFSGAK GLTQEQVDSA QGDGRTRLPA GLVRPASWGP EEPPVGGG
|
| |