Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3483 |
Symbol | |
ID | 5671854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4141672 |
End bp | 4142793 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641242371 |
Product | hypothetical protein |
Protein accession | YP_001507791 |
Protein GI | 158315283 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0462629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.03688 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGATC TCGCCGAGCT GAAGACCTTC GTCGTGGCAC ACGCGGTGTC GCAGGGGCTG CCCACCGGGC ACTACGACCC GCTGCTGGCC CGCATCCACC ATGACGAGGA CGGCGTCCCG GGCTCGTGGG CGTTCGAGTG GAGCGCGCTG GCCGACGGCC TGGCCGCCGA GGGCCGGCCG CTGGAGGCCT GCGTGCACTA CACGATGGCC CGGTTCCCGT TCGTCGACGG GCCGGCGCGG GCGCGAGCAC TCGAGCGGGC GACCGGGGAG TTCGCCCGCT GGAGCGCCGC GCACCCGGCG CTGCGCGGCC TGGACGTCGA GCTGCCCGCG GGGCGGGTGC GCTGCTGGAC GACCGGCCTG GACGCCCGCG ACGCCGGCGA CGCCGAGGAC CCCGCCGGGC CGCGGCCGCT GCTGGTCATG ACCGGGGGCA TCGTGTCGAC CAAGGAGCAG TGGGCGCCGG TGCTGCTGGG CCTGGCCGAG CTGGGCTTCG CCGGGCTGGT CACCGAGATG CCGGGCGTCG GCGAGAACAC GCTGCCCTAC CGGGCCGACA GCTGGACGCT GTTCCCCGCC CTGCTCGACG CGATCGGCCG GCCCGCCGGC ACCGCCGACG TCTACCTGCT GGCGCTGAGC TTCAGCGGTC AGCTGGCGCT GCGGGCCGCG CTGCACGACG ACCGGATCGC CGGGGTGGTG GGCGCCGGCG CCCCGGTGCG GGAGTTCTTC ACCGACACCG CCTGGCAGCG CCGGGTGCCC CGGGTCACCA CCGACACCCT GGCGCACCTG ACCCGGACCA GCGCCGACGA GGTCTACCCG ACCGTGCGGG ACTGGGCGCT GCGGGAGGAC GAGCTGGCGG CGCTGCGGAT TCCGGTCGCG CACGTGACCA GCCTGCGCGA CGAGATCATC CCGCCCGGCG ACGCGCGGCT GCTGCGCCGG TTGGTGCCGC GGATCCGGCT GCTCGCCCAT GACGACGTGC ACGGCGCGCC GTCGCACTTC GCCCAGACCC GGCTGTGGAC GCTGCTGTCG GTGCTGCGCA TGCACGGCGG CAACGCCCCG ACCCGGCTGG CACTGACCCG GCAGTTCGCC CGGCTGCGCT ACGCCGACCC GGCGGTGCGC TCCGCCGCCT GA
|
Protein sequence | MNDLAELKTF VVAHAVSQGL PTGHYDPLLA RIHHDEDGVP GSWAFEWSAL ADGLAAEGRP LEACVHYTMA RFPFVDGPAR ARALERATGE FARWSAAHPA LRGLDVELPA GRVRCWTTGL DARDAGDAED PAGPRPLLVM TGGIVSTKEQ WAPVLLGLAE LGFAGLVTEM PGVGENTLPY RADSWTLFPA LLDAIGRPAG TADVYLLALS FSGQLALRAA LHDDRIAGVV GAGAPVREFF TDTAWQRRVP RVTTDTLAHL TRTSADEVYP TVRDWALRED ELAALRIPVA HVTSLRDEII PPGDARLLRR LVPRIRLLAH DDVHGAPSHF AQTRLWTLLS VLRMHGGNAP TRLALTRQFA RLRYADPAVR SAA
|
| |