Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0494 |
Symbol | |
ID | 5668913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 574870 |
End bp | 576030 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641239423 |
Product | WD40 domain-containing protein |
Protein accession | YP_001504861 |
Protein GI | 158312353 |
COG category | [R] General function prediction only |
COG ID | [COG3211] Predicted phosphatase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACGGC GAACGTTCTT GCGTTCCGCT ATCGCGGGAA CCGGCGTGGT CGCCTTCTCC GGGGCGATAT GGGACACGGC CCTGGCCGCT CCCGCCCAGA ACGGCTCCAG CCCCTATGGA TCGCTGTTGG CGGCTGACGC CAACGGGGTC ATGCTCCCCT CCGGTTTCAC TAGCCGCATT GTCGCCCGTT CTGGTCAGGT GGTTTCCGGA ACCAGCTACA CCTGGCACAA CGCCCCGGAC GGCGGCGCGG TTTTCCTGAA CGGCACGGGT TGGATGTACG TCTCCAATTC GGAGGTCGGC AGCAGCGCGG GCGGGGCTTC GGTGCTGCGC TTCGACTCCA GCGGAACCGT CACCTCCGCC CAGCGCATTC TTTCAAACAC GAGCAGCAAC TGTGCGGGTG GGGCGACTCC GTGGGGCACG TGGCTGTCCT GCGAGGAGAC CTCCAACGGG CGGGTGTGGG AGACCTATCC GGCCACCGGC GCGTCGGCTG TCAGCCGGCC GGCCATGGGC CGTTTCAAGC ACGAGGCGGC TGCGTGCGAC CCGGTCCGCC AGGTCATCTA CCTGACCGAG GACCAGACTG ACGGCTGCTT CTACCGGTTC CGTCCGACCA CCTGGGGAAA CCTCTCCTCC GGCACGCTCG AGGTGTTGGT TGCGGGATCT GGAACATCCG GTACGGCAAC CTGGCAGGTC GTCCCGGACC CGGACGGTTC ACCGACCGCC ACCCGCAGCC AGGTTTCCGG AGCGAAGCAC TTCAATGGCG GCGAGGGCTG CCACTACGCG AACAACACCG TCTGGTTCAC CACCAAGGGC GACAACCGGG TGTGGGAGGT TCACGTCGAC ACCAACACAT TCGAACTTGC ATATGACGAC TCGCTGGTAT CTCCTGGACC TGCCCCGCTG ACCGGTGTCG ACAACATCAC CGGATCGACG TACGGGGACC TTTACGTTGC TGAGGATGGC GGGAACCTGG AGATCTGCAT CATCACTCCG GACGACATCG TGGCCCCGAT CCTGCGGCTG GTCGGGCACA ACTCATCCGA AATAACCGGA CCAGCATTCT CCCCGAACGG CCAGCGGCTG TACTTCTCGT CCCAGCGCGG CACGTCCGGC TCGTCCTCCG GCGGCATCAC CTTCGAGGTC ACCGGTCCGT TCCGGACCTG A
|
Protein sequence | MERRTFLRSA IAGTGVVAFS GAIWDTALAA PAQNGSSPYG SLLAADANGV MLPSGFTSRI VARSGQVVSG TSYTWHNAPD GGAVFLNGTG WMYVSNSEVG SSAGGASVLR FDSSGTVTSA QRILSNTSSN CAGGATPWGT WLSCEETSNG RVWETYPATG ASAVSRPAMG RFKHEAAACD PVRQVIYLTE DQTDGCFYRF RPTTWGNLSS GTLEVLVAGS GTSGTATWQV VPDPDGSPTA TRSQVSGAKH FNGGEGCHYA NNTVWFTTKG DNRVWEVHVD TNTFELAYDD SLVSPGPAPL TGVDNITGST YGDLYVAEDG GNLEICIITP DDIVAPILRL VGHNSSEITG PAFSPNGQRL YFSSQRGTSG SSSGGITFEV TGPFRT
|
| |