Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5076 |
Symbol | |
ID | 5673411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6076136 |
End bp | 6077092 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641243927 |
Product | hypothetical protein |
Protein accession | YP_001509341 |
Protein GI | 158316833 |
COG category | [S] Function unknown |
COG ID | [COG0327] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00486] dinuclear metal center protein, YbgI/SA1388 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.839992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0147635 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCCG TTGGTGCGAC GGTCCGTGAC TGCGTCTCCG TTCTCGAAGC ATGCTTCCCG CCGGCCTGGG CGGAGTCGTG GGACGCGGTG GGCCTCTCGG TCGGTGACCT GGACGCCCCC GTGACCGACG TCCTGTTCGC GGTCGACCCG ACGCCCGCCG TCGCCGCCGA GGCGGCGAGT GGCTCCCAGC TGCTGGTCAC GCACCACCCC CTGTTCCTGC ACGGCGCGCA CACGGTCGCC GAGACGACCG CCGGCGGGCG AGTGGTGGCG ACCCTGGTCC GGGCCGGTGT CGCGCTGTTC ACCGCGCACA CCAACGCCGA CGTGGCCGAC CCGGGCGTCA GCGACGCGCT CGCCGCTGCG CTCGGCCTGC GAGAGGTCCA CCCGCTGGCA ACGGGATCGA CCGGCGCGGG GGCCCGCGCC GACGCGCGGG CCCGCGGGCG GACCGACGCC GGCACCGAGT GCCGCGGCCT GGGCCGGGTC GGCGTCCTGC CGGCCCCCGA GCCGCTCGGC GCCTTCTGCG AGCGGGTGGC GCGGGCCCTG CCGGCGACCG CCGGCGGCGT GCGGGCGACC GGTGCGGCGG ACCGGCGGGT CCACCGGGTG GCCGTGTGCG GGGGCGCGGG CGGCGAGCTC GCCGGCGCCG CCGCGGCGGC CGGGGCGGAC GTCCTGGTGA CCGCGGACGG CCGGCACCAT CACACCCTCG ACGTGGTCGG TGCCCACCCG CTCGACGTCG TCGACGTGGC GCACTGGGCC AGCGAGTGGC CCTGGCTCGC CGGCGCCGCC GACCGGCTGC GCGCCGGCCT CGCGGCCCGG GGACGTACGG TGAGCACCTC GGTGTCCACA CTCGTCACCG ATCCCTGGCA GCTGCATGTG AGCGCCGCGC CCACGTGGCG CGAGCCCGCC ACCGGCCTGC CAGTCCCATC CCATGCGCCC GGCGCACGCC GGGAAGAAAG GCTCTAG
|
Protein sequence | MTSVGATVRD CVSVLEACFP PAWAESWDAV GLSVGDLDAP VTDVLFAVDP TPAVAAEAAS GSQLLVTHHP LFLHGAHTVA ETTAGGRVVA TLVRAGVALF TAHTNADVAD PGVSDALAAA LGLREVHPLA TGSTGAGARA DARARGRTDA GTECRGLGRV GVLPAPEPLG AFCERVARAL PATAGGVRAT GAADRRVHRV AVCGGAGGEL AGAAAAAGAD VLVTADGRHH HTLDVVGAHP LDVVDVAHWA SEWPWLAGAA DRLRAGLAAR GRTVSTSVST LVTDPWQLHV SAAPTWREPA TGLPVPSHAP GARREERL
|
| |