Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3896 |
Symbol | |
ID | 5672257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4660545 |
End bp | 4661762 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641242775 |
Product | hypothetical protein |
Protein accession | YP_001508192 |
Protein GI | 158315684 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.103005 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCACG GTGGTGACGG GACGGGCCAG CACGCGGACG AGGGCGACAT CCCCGCCCAC CGCTCACCGT GGGCACGGCC CGCATCGGGC GAGTGGCCCG CATCGGGCGA GTGGCCCGCG CCGGGTGGCG ACCAGCCGAC GATGGACCTG CACCACCGCG CCGGCGCGGC GGCCGCCACC GGCCCGCGCT CGGGGCAGCC GGCCCCCCCG CCCCCCGCCG GGGCGTTCCA ATCGGGCCGG CCGGCGTGGC CGGCGGCCGG CGGAACGGGG GGCACGGACC GGTGGGAGGA GAACCACCAG CCCGACGAGC CCGACGAGCC CGGCCGCCCG CCGATCCCTC CGACGATGGC CTACGGATCC GCCCCGGACG ATCACCGGCT GCTGGCCGGG ACGACCCCGT CGGCCGGTAC GACTCCGGCG CGCCGGCGCC GGCGCCGGCG CCGGTGGCTG TGGGCCCTGG TCGCGGCGGC CGTCGGCTTC GTGCTGCTGG TGGTGGGTGA CCGGGCCGCG GTCTCGGTCG CCGAGGGCCA GATGGCCAAG CAGATCAAGG TCAGTGTGGC GGAGAGCCTG GAATGCGGTG CCCCCGCGCC GACCGTGCGC GACGTGCACA TCGGCGGCTT CCCCTTCCTC ACCCAGATCC TGCTCGGCAA GTTCAAGGAG ATCGGGGTGA CGATCGAGGG AATCGCCACC CCGGGGCCCC GGATCTCCGC CGTGCAGGCG CAGCTGTCCG GCATCCACGT GCCGCTCGGG GACATGATCT CCGGGTCGGT CGGCGCGGTG CCTGTTGACG ACATCCGGGC GACTGTCCGG CTTGACTACG CCGACCTCAA CACCTACCTG GCCGGGCTGC CCGGCGCCCT CCAGGTGAAC CCGGTCGACG GCGGGCGGCG GGTCGAGATC TCCGGGCGCA CCGACCTGTG GCTGTTCGGC TCACAGGAGA TCGGGGGCGT CACCACCTTC GAGGTCCGTG ACAACGTGCT CACGCTCGTC CCCAGTGAGG TGACGCTGCG CGGGGCCATC AACGCCACGA TCCCCGTGCC CGTGGGCGGC CTGCTGCCCC CGATCAAGAT CCCGGTCGGC CAGCTGCCGC TGGATCTCGA CATCGTCGAG GCGTCGACGG GCGGGTCCGG GCTGTCGCTG ACCGCCGCCG CCCACGACGT CGTCCTGCCC GCGGCGGAGC AGCCCGCGCC GCGTCAGTGC CCGCCCGGCA ACACCTGA
|
Protein sequence | MNHGGDGTGQ HADEGDIPAH RSPWARPASG EWPASGEWPA PGGDQPTMDL HHRAGAAAAT GPRSGQPAPP PPAGAFQSGR PAWPAAGGTG GTDRWEENHQ PDEPDEPGRP PIPPTMAYGS APDDHRLLAG TTPSAGTTPA RRRRRRRRWL WALVAAAVGF VLLVVGDRAA VSVAEGQMAK QIKVSVAESL ECGAPAPTVR DVHIGGFPFL TQILLGKFKE IGVTIEGIAT PGPRISAVQA QLSGIHVPLG DMISGSVGAV PVDDIRATVR LDYADLNTYL AGLPGALQVN PVDGGRRVEI SGRTDLWLFG SQEIGGVTTF EVRDNVLTLV PSEVTLRGAI NATIPVPVGG LLPPIKIPVG QLPLDLDIVE ASTGGSGLSL TAAAHDVVLP AAEQPAPRQC PPGNT
|
| |