Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0008 |
Symbol | |
ID | 5668435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 12333 |
End bp | 14270 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641238936 |
Product | hypothetical protein |
Protein accession | YP_001504383 |
Protein GI | 158311875 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGACA GTCAGAATGA CCGGCCCGCC CGGTCGGCCG TCCACCGCGC ATCCCCGGCC CGTCCGGAGC AGCGCACCGC GGCCGTGGCC GGTGGCGCGC CTGTCGGTGG TACCGCCGCC GGGCGCGCCG CGGCTGGCGG TACTGCCGCC GGCGGTGCCT CTGTGGCTGG TTCCAGCCGG TCCGGCACGG CGAAGGCCGG GCCCGTGGGC GCGGCACCAG GCGTGGGTTC CGGTGCCGGT TCCGGAACTG GGATCGGCAA GGGAGTTCCC GGGACAACCG GCTCCGGAAC CGGCGTGAGC TCGGCGGCGG GGGGCGGTGT GGGTGGCCCC CGCTCGACAC CGGGGAAGGA CGACGGAGAG TCCAAGCGTG CGTTCCTCCC CGGCACCGGC CGTCCCGCCG ACGGGCGTCC CGGGACCGAT GGCCGTCCCG CTGACGGCCG CTCGGGGGAG GGCCGTCCGG CCGCGGGGGG CGGGGCTGGA TCGGGTTCAC GTGGTCCGTC CCGGCCCTCC TCCGGATCGG CGTCGCCCGG AAAGGGTGCC CCCGGATCGG CGTCGCCCGG CACCTCCGGA TCGGCGTCGC CCGCGCGTGG CGCGGGCGCA AGCGGTGCGG GCGCGGGCGG GGCGGCGAGC CCGTTCTTCC AGCGTCCCGG GCGGGACGAG AACGACCCGT CCGACCGGGA CGGTGCGGGT GGCCAGACCA GTCAGACGAC ACGCCTCGGC GTCGGCGGGA CGGCAGAGCC CAAGCTGCTG AGCGCCTCGG CGACCGTTCC GGGCTCGTCC TCGGCGGGCA CCGGCGGCGG GACGGACCGC GCCGCCCCCG CGCCGCGCGG CGGTGACTCC GACCCCGACG AGACATCACG CCAGCCAGCC GGCCGGCAGC CCGCCCGCGA CGCGGACGCG GCCAGGCCGG CGGGCGACGG CAGGAAGCCG ACCGAGGCGA CCCGCGGGGC CGGCGCGGGC ACCTCCAAGG ACGCCGCCCC GGCGGCTGGC AAGGGGGCAT CCCGAACGGG CACGCGAGCG GACGCCGCCC GACCGGACGC CGCTCGCCCG GATACATCCA GAGCCGAGAC GACGAGAAAG CCCGATGCGC AGACCACCGT CAGAAGCCCG GCGGCACGCC CCGGCACACC GGTCGACCCA CCCCGGGACA GCGACACGAT CTCGCTGGTG CGACCGAACC TGCCCAAGCG GGGCTCGAAG CCCCCGGCGG ATCGCGTCGG CGCGGACGTG AAGACCTCCC CGGATCGCGG CGCGGTCACC GACCGCATGC CCGCCGAGCG CCGCCCCACC CCTGAACCGG TGCGGGCAGC CACTCCGTCG ACCCGGCAGG CGCCGCTGGC GGGCCGCACA GCCCCCTACG ATCGCCCTGG GACGCCGCCA GGGCCACTGC CGACGTCCGG AGGTGTGGGC CCGAACGGTC TGTCCACCGA GCCGTTCGAC CGGGTCGACG ACGCCGACCA CGGCCGGCCC GGCGGTGCTC CTGGACCACA GGGTGGTCCC CCGCGGGGGC AGCAGCAGCC GGGCGGCCGT GAGCCGGGGC GGGACACCGC CGGCCAGGGC CCACGCCGGG GCCCCGCCGG TGGCCGGCGT GCCCGCCTGC GGGTCTCCAG GGTGGAACCG CTCTCGGTGA CCAGGCTCTC GTTCGCGTTC TCGCTGTGCG TCTTCCTGAT CATGATTGTC GCCGTGGCGG TGCTGTGGTT CGTGCTGAAC TCGATCGGGG TCTTCGACAG CGTCACCAAG GCCGCTGACA CCCTGACCGA CGGCACGAAC GCCAATGTCT CAGGCTGGCT GTCCTTCGGG CGGGCGATGC AGGTCACCCT GCTGGTCGGG GCGATCAACG TCGTCCTGAT GACGGCGCTG GCGACCCTGG GCGCACTGCT CTACAACCTC TGCGCGGACA TGATCGGCGG GCTCGAGGTC ACCTTGAGTG ACCAGTAG
|
Protein sequence | MSDSQNDRPA RSAVHRASPA RPEQRTAAVA GGAPVGGTAA GRAAAGGTAA GGASVAGSSR SGTAKAGPVG AAPGVGSGAG SGTGIGKGVP GTTGSGTGVS SAAGGGVGGP RSTPGKDDGE SKRAFLPGTG RPADGRPGTD GRPADGRSGE GRPAAGGGAG SGSRGPSRPS SGSASPGKGA PGSASPGTSG SASPARGAGA SGAGAGGAAS PFFQRPGRDE NDPSDRDGAG GQTSQTTRLG VGGTAEPKLL SASATVPGSS SAGTGGGTDR AAPAPRGGDS DPDETSRQPA GRQPARDADA ARPAGDGRKP TEATRGAGAG TSKDAAPAAG KGASRTGTRA DAARPDAARP DTSRAETTRK PDAQTTVRSP AARPGTPVDP PRDSDTISLV RPNLPKRGSK PPADRVGADV KTSPDRGAVT DRMPAERRPT PEPVRAATPS TRQAPLAGRT APYDRPGTPP GPLPTSGGVG PNGLSTEPFD RVDDADHGRP GGAPGPQGGP PRGQQQPGGR EPGRDTAGQG PRRGPAGGRR ARLRVSRVEP LSVTRLSFAF SLCVFLIMIV AVAVLWFVLN SIGVFDSVTK AADTLTDGTN ANVSGWLSFG RAMQVTLLVG AINVVLMTAL ATLGALLYNL CADMIGGLEV TLSDQ
|
| |