Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6526 |
Symbol | |
ID | 5674841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7933863 |
End bp | 7935599 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641245374 |
Product | hypothetical protein |
Protein accession | YP_001510769 |
Protein GI | 158318261 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02231] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.122318 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTCCA CGGCGGACGG CACACCTGAC CTTCCTGACG CACCGGCGGG CGCACCGACG GGCGCGCCGC CGTCGGTGCT CCTCGACGCG CCGATCAGCG CCGTGGTCGT CTATCCCACC GCCGCGCGCG TGACCCGCCG TGGCCGGTTC GATGTCGCGA CGCTCGGCTC GCCGTCCGAG CGGGTCGAGG TGACTCTCAG CGGGCTGCCG CTCGAGCTCG ACGAGGACTC GGTGCGGGTC AGCGGCACCG GCTCCGCGCG GGTGCTCGGG GTGCGGGTCG TCTCCCAGGC CCGGGCTACC CCCGACGCCG GTGCCCTCGC CGACCTGCGC GCCCGCCAGG CCGAGCTGCG CGCGCACCTG GCCGAGATCG GCGACGAGGA CGAGACCGAG CGGGCCCGTC GCTCCTTCCT CGACGTGGCC GCCCGGGTGG GAGCCGGCGC CCTCGCGCGC GGCTGGGCAC ACGAAGGCAC CGACCGCGAG GGCGACGGGC CCGGGGACGC GGCAGCCCGG CTGGTCACTG TCGGCGACGC GTTCGCGGCG CAGATGTCGG CCGTCCACGC GCGCCGGCGG GCGCTCGCCG AGCGGCGGAA GGAGACGGAA CGGGAGCTGA CCGTGCTCGG CCGGGTCATC GAGGCCCGCC ACGCGCGGCC CGAGCCGGAC ACCCGCGCGA TCGTCGTCGA CCTCGAGCCG CCGACGGCCA CCGAAGGCCC GGCGGAGGGC GTGCTGGTCG ACAGGACAGC CTCGCTGGTG GACCTGGAGG TCTCCTACCT GGTCCGTTCC GCCTCCTGGA GATCCGGCTA CGACGCACGG CTGGACGGCG AGCGGGTCAC CCTGACCTGG TTCGCGATGA TCAGTCAGCG CACCGGCGAG GACTGGCCCG TGACGGACCT GCGGCTGTCC ACGGCCCGCC CGTCCAGCGG CGTCGACCTG CCCGAGCTGA GCCCGCAGTA TGTGGACATC GCCCGCCCGC GGGTCCTTCC CCGCGGGCGG GCCAAGGCGT CCGGCGAAGG CAGGGGCGGT GGCGGCGACG GCATGACGAC CTTCATGCCG GCGGCGATGG CGGCGCCGGC ACAGGCCCCG CCGCCGATGG CTGCCGCCGA GGCCACGCTG GAGTCGGCCG GCCCCGCGTC GACCTACCGC CCGCCGCGGC CGGTGGCGGT ACCCGCGGAC GGGGACCCGC ACCGCACCAC CGTGGCAGTG ATCGAGCTCG ACGCCGTTCT CGACCACGTC ACCGTGCCCA AGCTCGCCGC CGAGGCGCTG TTGCGTGCCG CGGTCGTGAA CACCTCGTCG CACACGTTGC TGCCAGGTAA GGCGTCGGTC TTCCACGGCC CGGAGTTCGT CGGCACCACG CGCCTCGAGC TCGTCCCGCC CGGCGGGGAG ATGGAGCTTC GGCTCGGGGT GGACGACCGC ATCAGGGTGG AGCGCGAGCT GGTCAGCCGG GTCACCGGCC GGCGGGTGGT CGGCAACACC CGGCGGACGG ACGTCGTCCA CCGCACCACC GTCACCAACC ACGCGCCGAT GCGGGCCCGG GTGACCGTGC GGGACCAGGT GCCGGTCTCC CGGCACGAGA ACATCCAGGT CAGGGAGGTC GTGGCCGCTC CCGCCGCCAC CGAGCACACC GATCTGGGGC TGCTTACCTG GGAGCTGGAG CTCGAGCCGG GCTCGAGCCG GGAGATCACG CTGTCCTACC GGCTCGAGCA CCCTCGCGGC GTGGAGATCA CCGGCTGGGG CGACTGA
|
Protein sequence | MMSTADGTPD LPDAPAGAPT GAPPSVLLDA PISAVVVYPT AARVTRRGRF DVATLGSPSE RVEVTLSGLP LELDEDSVRV SGTGSARVLG VRVVSQARAT PDAGALADLR ARQAELRAHL AEIGDEDETE RARRSFLDVA ARVGAGALAR GWAHEGTDRE GDGPGDAAAR LVTVGDAFAA QMSAVHARRR ALAERRKETE RELTVLGRVI EARHARPEPD TRAIVVDLEP PTATEGPAEG VLVDRTASLV DLEVSYLVRS ASWRSGYDAR LDGERVTLTW FAMISQRTGE DWPVTDLRLS TARPSSGVDL PELSPQYVDI ARPRVLPRGR AKASGEGRGG GGDGMTTFMP AAMAAPAQAP PPMAAAEATL ESAGPASTYR PPRPVAVPAD GDPHRTTVAV IELDAVLDHV TVPKLAAEAL LRAAVVNTSS HTLLPGKASV FHGPEFVGTT RLELVPPGGE MELRLGVDDR IRVERELVSR VTGRRVVGNT RRTDVVHRTT VTNHAPMRAR VTVRDQVPVS RHENIQVREV VAAPAATEHT DLGLLTWELE LEPGSSREIT LSYRLEHPRG VEITGWGD
|
| |