Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6284 |
Symbol | |
ID | 5674603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7631856 |
End bp | 7633001 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245136 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001510532 |
Protein GI | 158318024 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGACA ACGGGGGAGC ACGCCGAGTC GGTCGGCGGT GGCTGTGGAT CTGCGCTGGG ATCGCCGCGG CCGGCGCCGT GGCCGCAGTC GTGGGAATCT GGCACCTGCC CGACCGGATG TACCAGGGCG AGGAGGCACG GGCGGCGCTG CAGGGCGGCC TGCTGACGGC GGCCGCCGCG CTGACCGCGG TGGCCGGTGG TCTGATCGCG TTGGACGAGA CCCGGCAGGC TAACGCGGAG ACCCGGCGGG CGAACCGGGA TACCCACGTG CGGGAGTTGT ACGTGGAGGC GGCGAAGCTA CTCAACGACC CGGACAACCT CGGGGTCCGC CTGGCCGGGA TATACGCCCT GGAACGGATC GCGGTGGATT CCTCGGTGGA TCAGCGCACG GTCGTGGAGG TGCTCTCCGC GTTCGTGCGA ACCCGCAGCA CCGACTCCGC GCTACGCCCA CCATCACCCG CCGATAGTCA GGAATCACCA CTGGTGCGGC CGGCAGCGGA CATCCGTGCC GCCGTCCAGG TCCTGGGTCG CCTCCCGGCC CTCGATGGTG TCCCACGCTG CGACCTGGAC GGCGCGGACC TCACCGGTCC CGCCGGGCTC GGGGGCCTCG ATCTTTCTGA GGCCAACCTT CTGGGCGCCC AGCTGGCTGG GGCGGAACTT ACCTACGCCG TGCTGCACGG AGCGAATCTC ACTGGCGCCC GGCTGGACGG TGCGGACCTC ACCTCCGCCG TGTTGATAGG AGCGCACCTC GCTGGCGTCA AGATGGACGG GGCGAACCTT AGCGATGTCC GGCTGGTGGG TGCGGACCTG ACCTTCGCTC AGCTGGGCGG AGCAAACCTC ACCAACGCCT TCCTCGCCAT GGCCACCATG ACCTATGCCG TGTTGGAGGG AGCGCACCTC GGCGGCGCCC TGCTGGCCGG AACGAATCTC ACCGGCGCCC GGCTAGTTGG TGCAGATCTC ACCGGCGCCC AGCTGGTGAA TGCGGATCTC ACCGCTGCCC AGATGGAGGG GACGGATCTC ACCGGCGCCC GGGGCCTGGC GGCGGAGCAG GTGGCAGCGG CGTCCGGGGA TGCGCGGACG CGGTTGCCGG ACGGAGTGGA ACGGCCTGCG TCCTGGCCGC CGTACGAGCC GCCTCCGGAG CAGTAA
|
Protein sequence | MADNGGARRV GRRWLWICAG IAAAGAVAAV VGIWHLPDRM YQGEEARAAL QGGLLTAAAA LTAVAGGLIA LDETRQANAE TRRANRDTHV RELYVEAAKL LNDPDNLGVR LAGIYALERI AVDSSVDQRT VVEVLSAFVR TRSTDSALRP PSPADSQESP LVRPAADIRA AVQVLGRLPA LDGVPRCDLD GADLTGPAGL GGLDLSEANL LGAQLAGAEL TYAVLHGANL TGARLDGADL TSAVLIGAHL AGVKMDGANL SDVRLVGADL TFAQLGGANL TNAFLAMATM TYAVLEGAHL GGALLAGTNL TGARLVGADL TGAQLVNADL TAAQMEGTDL TGARGLAAEQ VAAASGDART RLPDGVERPA SWPPYEPPPE Q
|
| |