Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0219 |
Symbol | |
ID | 5668644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 266560 |
End bp | 268047 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641239148 |
Product | SCP-like extracellular |
Protein accession | YP_001504592 |
Protein GI | 158312084 |
COG category | [S] Function unknown |
COG ID | [COG2340] Uncharacterized protein with SCP/PR1 domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.20701 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTCAG AGCGTCCTGA ACGTGGCCGG GCTGCACCGT CCGGTCCGGG CTTGCGTGTA CGGCCGGGCT CGAATGCACG ACCGAGCGCG CATGCGCGGC CGGCCCTGCG TGCGCGGAGT TCGCGCGCCG CGGACCCGCC CGGCCCCGCG GGCCCGGTCC AGGACGTGGT CCGTGGCCGG CGGCACCGAA GACGTCCCAC GCCCAGGCGG GCGGGCGGGC TGCGCGCCGC CGCGGGGCTG GTGGCGCTGA CGCTGGTCAC GCTCGGGCTC GTGGCGGCCT CGGGCGAGGT CCCCTCGACG TCGAGCGCGG CGGGCCGTGG CCTCTCCGGG TCCAACAGGG TTGTCGCGGA CGGGTCCGTC GCGGGCGGGA CCGGTGCGGG CACCGCGGAT CACACCGCGG GCACGGATGC CGTCGGCGCA CCGCCTGGCA GTGCGTCGCG GAGCGGGCGG GGCGCCCTGG CGGGGTACGG CGGGTCGTCC GCCGATCCGG TGTGGCCGGC CGACCGGGGG CCGGTGAGCG GTGTGCTGGT TCTGCGGGCC GATCCCGCGC TGCTCGCGAT CATGGCCGCC GACGGGCAAC CGGGCTCGGC ACCGGGCGAG CCCTGGGTCC TCTACCTGCT CTCGGGCCCC GTGAACGTGA TCCGGTGGGC GATGGGCGAG CCGTTCGAGG CAAGCATCGA CACCCGGCAG CTGCCGAACG GCGACTACAC CCTGTCCGAG GTGATCTTCC GTGCGACGCA CGCCCCGCTG GTCCGCACCG GCCGGGTGCC CGTCGCGAAC CCCCTCCCGC CCGGGGCGGA CCAGACGGCT CCGGGCGGAG CCGCGCCGCG GTCAGCGGCG GCCGCCGCCG CCGATCCGGG CGACGGCACG GCCCCGGGTA CGCCGGCCGG CCCTGCCGGG TCGGCCTCCG CGGGAGTGTC CGGCTCCCCG ACGGGCGCCG GTTTGCCGTC GGGCGCCGAG CCGGCGGCGG GCGCGAGCAC CGCGGCGGGC GCGAGCACCG CGGCGGGCGC GGTGGCCGGG CCGGCACCGG CCCCCGTCGC TTCCGGCGCC CGGCTCGCCG GGACCCCGAC CGGCGCGGGC GGCGGTGCCG GGTCGGCGGC GACGGCCGCG CTGATCGAGG AGGTCGTCAC GCGGACCAAT GCCCAGCGCT CGGCCGCGGG CTGCCCGGCC CTCACGGTCG ACGCCCGCCT GGCGGCTTCG GCCCAGGAGC ACAGCGCGGA CATGGCGGCC CGGAACTACT TCGACCACAA CGGCCGGGAC GGGCGGTCGC CCTTCGACCG GATCGCGGCG GCGGGCTACG TCTTCTCGAT CGCGGCGGAG AACATCGCGG CCGGCCAGCG GACCCCGGCC GACGTCGTCG CGGACTGGAT GGCGAGCCCC GGCCACCGGG CGAACATCCT GAACTGTTCG CTCAGCCAGA TCGGTGTCGG GCTCGCCACC GGAGGTGACT ACGGCACCTA CTGGGTCCAG GATTTCGGCT CGCCGTAA
|
Protein sequence | MHSERPERGR AAPSGPGLRV RPGSNARPSA HARPALRARS SRAADPPGPA GPVQDVVRGR RHRRRPTPRR AGGLRAAAGL VALTLVTLGL VAASGEVPST SSAAGRGLSG SNRVVADGSV AGGTGAGTAD HTAGTDAVGA PPGSASRSGR GALAGYGGSS ADPVWPADRG PVSGVLVLRA DPALLAIMAA DGQPGSAPGE PWVLYLLSGP VNVIRWAMGE PFEASIDTRQ LPNGDYTLSE VIFRATHAPL VRTGRVPVAN PLPPGADQTA PGGAAPRSAA AAAADPGDGT APGTPAGPAG SASAGVSGSP TGAGLPSGAE PAAGASTAAG ASTAAGAVAG PAPAPVASGA RLAGTPTGAG GGAGSAATAA LIEEVVTRTN AQRSAAGCPA LTVDARLAAS AQEHSADMAA RNYFDHNGRD GRSPFDRIAA AGYVFSIAAE NIAAGQRTPA DVVADWMASP GHRANILNCS LSQIGVGLAT GGDYGTYWVQ DFGSP
|
| |