Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3773 |
Symbol | |
ID | 5672138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4472984 |
End bp | 4474201 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242654 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001508074 |
Protein GI | 158315566 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGAG CGATCAGGGG CGATCAGAAC GAGGCGTCGG CGCCTCAGCA CAACATGATC ACCGAGTGGC TCGAGGCCGA CGCCGAGCCG CGGCCGTGGC GGAAGCCAAG ACGGCGGAAG GTCCTGTTCG CGCTGGCCAC GGTCCTGCTG ATCTCACTGA CCGCGCTGAC CGCGCTGCTC TGGCCCGGCA GTGAAGCGGA TGCCCCCCTG CCCAGTGCGA CCCCCGAGGA GTGGTCCCAC GCCGCCCCGC TCGAAAAGAC CCTCTACGGC CATACCGGCT GGGTGCGGTC GGTGGTGTTC TCCCCGGACG GGCGCACGCT GGCCAGCGGC TCCGACGATG ACACGGTGCG GTTCTGGGAC ATGTCCGACC CCACCGACCC CACCCCCCTC GGCAAGCCCC TCACCGGCCA CACCGACCCG ATAGCGGCGG TCGCGTACTC CCCCGACGGA CGCACCCTCG CGACCAGCTC CACCGACCGC ACGGTGCGGT TCTGGGACAT GTCCGACCGC GCCGACCCCA CCCCCCTCGG CAAGCCCCTG ACCCGCCACT CCACGGACCC CACGAGCTGG GTGGCCGCGA TGGCGTACTC GCCTGATGGC CGCACCCTCG CCACCGGCGG CCTCGACAAC ACGCTACGGC TGTGGGACGT CACCGACCGT ACCAACCCCA CCCCGCTCGG CCCGCCCCTC ACCGGCCACA CCAAAACAGT GGCGTTCCTG GCGTACTCCC CGGACGGGCG CACCCTCGCC ACCGGCTCCT ACGACCGCAC GGTGCGGTTG TGGGACGTCT CCGACCGCAC CAGTCCCGCC CCCCTCGGCC CACCCCTCAC CGGCCACACC AACCCGCTGA TGTCGCTCGC GTACTCCCCC GACGGGCGCA CCCTGGCCAC CAGCGCCTCC GACTACTCGA CGCGGTTCTG GGACGTCTCC GACCGCACCA GGCCCACTCC CCTGGGCGAA CCGATCTTCC TGGGCGACAG CCCGAGCGTC TGGTGGGTGG TGGGTCTGGC CTACTCCCCT GATGGGCACA CCCTCGCCAC CATCTCCGGC GACCGGCTGG TGCGCTTCTG GGATGTCACC GACCGCACCA GACCCACCCT CCGCGGTCAA ACCCTCACCG ACCAGACCGA CCGACTGGAG TCCGTGGCGT TCTCCCCGGA CGGACGCACC CTCGCGACCA GCTCCGATGA CGGCACGGTG CGGTTGTTGA CGCCGTGA
|
Protein sequence | MTRAIRGDQN EASAPQHNMI TEWLEADAEP RPWRKPRRRK VLFALATVLL ISLTALTALL WPGSEADAPL PSATPEEWSH AAPLEKTLYG HTGWVRSVVF SPDGRTLASG SDDDTVRFWD MSDPTDPTPL GKPLTGHTDP IAAVAYSPDG RTLATSSTDR TVRFWDMSDR ADPTPLGKPL TRHSTDPTSW VAAMAYSPDG RTLATGGLDN TLRLWDVTDR TNPTPLGPPL TGHTKTVAFL AYSPDGRTLA TGSYDRTVRL WDVSDRTSPA PLGPPLTGHT NPLMSLAYSP DGRTLATSAS DYSTRFWDVS DRTRPTPLGE PIFLGDSPSV WWVVGLAYSP DGHTLATISG DRLVRFWDVT DRTRPTLRGQ TLTDQTDRLE SVAFSPDGRT LATSSDDGTV RLLTP
|
| |