Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2813 |
Symbol | |
ID | 5671202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3327822 |
End bp | 3329387 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641241722 |
Product | integrase domain-containing protein |
Protein accession | YP_001507142 |
Protein GI | 158314634 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAACA GCGCGTCCTG GATAGTTGAA TTCTATGACC TTTCAGTCAA TCTTCCTCAG GTTGAGGTTG AGGGGATCGC CGGGCTGGGT GATGTGCACG AGCGGGCGGT GCGTAACGGC GCGCGGCACG GCACGCCACT GATTATGAAA ACGTCGGGTA TGGTCGATTC CCGGGTTAAC CTGTTCTTCC GCACCGGCCC CATGGCCGCG GCGCGACCAT CGACGTGGCG GCGTTACGCG TACTCGCTGG TGGTCTGGCT GAACTTCCTG GAGGTGTTCG GGCGGCGCTG GGATGCGGCG ACTGCGGGCG ATGTGGAGGC GTTCAAGGAC TGGCGGGTGA CCGACCGGTC GAACGACGGC AGGGTCGCAC CGGCGAGTTT CGAGGCTGAC AGGGCGGCGC TGAACTGCTT CTACTCGTGG GCGGCGGTGC GCTACGGCGT GGTCAACCCG GTTCCCTCCG GCCGCTGGGT GCGTGCTGGA CGGCCAGGTG ACGACGTCGA TGGCCGGCGT GGGTCTCGGG ACCCGCTGCG CCCGGCGGGG GCGAGACGGC GGCAGGTGAA GTGGCTGTTG CGGACGGCGT TCGAGCAGTG GCGGGACATC GGCCTGCGCG GCTACGGCTT CGACGGCGTC CGGCGCCCGG ACTGGTGCGG CCCGTATGAG GACCGGGATG CCGCGTTCGT CGACGGCCTG TACGGGACGG GGCTGCGGCT GACCGAATGG GCGAGCATCC TCGACGTCGA GCTACCGGGC GAGGGTAACG ATCGTTTTCT CCGGGCGTGG CTGGCGGCGG CGTGCGCGAA GGGCGGCCGG CACGGGCGGG TCTACCGGAT ACCGCGTCGG GTGCTGACGG CGGTGCGCGG CTATCTGGAC CCTCAGGAGG GTTCACGGGG CGAGAGGGTC CGCTGGGCAC AGCGAGCCGG CCGCTACGAA CAGCTGACCG TCCGGAGGAT CGTCACCGGC TACAACCCCC GCTCCCGCAT CCTCACCTTG GAGGGCACGG CCGGGCCGGT CCGGATGTCG GTGGACGTGC TGGGCCCGCA CGAGCGCCGC CACTTGTTCC GCCGGACTGA CAACGGTCTG GAGCCGCTGG CTGTGTGGCT GGGACGCGAT GGGCTGCCGA AACATCCGCA TGGCTGGGAG GACACGTTCA CCGCCGCAAA CCGTCGCGTT ACAGGGGCGT GGATGGCCGC GGGCCGGGCG GGCAGCGCGC CATTGTGGTG CACGCCGCAT ATGTGTCGGC ACAGCTTCGC GCTGAAATGG TTCTCGATCC TGTCGATCGT GTGGGACAAC CGGCTGACGG GTTTCACTGC CGAGGAGCTG AAGGACTATC GCGCCCAGTT CGGTGATATC TGGTATCAGC TTGCCGGTCT GCTCGGGCAC GCCGATCCGT CGACGACCCG TGATCATTAT CTGGAACCGT TCACCCGTTT GGACGTCGAC TATCTGATGG CGCTGCTGGA CGGCGAGGAA CAGACGGCCG TCGACACGCT GCTGCGCGCG GTGGCCGCTG ACAGCGGACA CGTGCTCACG GGGACGGACC TGAGCGGCAC GGGGAGCGCC GGGTGA
|
Protein sequence | MENSASWIVE FYDLSVNLPQ VEVEGIAGLG DVHERAVRNG ARHGTPLIMK TSGMVDSRVN LFFRTGPMAA ARPSTWRRYA YSLVVWLNFL EVFGRRWDAA TAGDVEAFKD WRVTDRSNDG RVAPASFEAD RAALNCFYSW AAVRYGVVNP VPSGRWVRAG RPGDDVDGRR GSRDPLRPAG ARRRQVKWLL RTAFEQWRDI GLRGYGFDGV RRPDWCGPYE DRDAAFVDGL YGTGLRLTEW ASILDVELPG EGNDRFLRAW LAAACAKGGR HGRVYRIPRR VLTAVRGYLD PQEGSRGERV RWAQRAGRYE QLTVRRIVTG YNPRSRILTL EGTAGPVRMS VDVLGPHERR HLFRRTDNGL EPLAVWLGRD GLPKHPHGWE DTFTAANRRV TGAWMAAGRA GSAPLWCTPH MCRHSFALKW FSILSIVWDN RLTGFTAEEL KDYRAQFGDI WYQLAGLLGH ADPSTTRDHY LEPFTRLDVD YLMALLDGEE QTAVDTLLRA VAADSGHVLT GTDLSGTGSA G
|
| |