Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0745 |
Symbol | |
ID | 5669161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 870145 |
End bp | 871782 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641239672 |
Product | carbohydrate-binding CenC domain-containing protein |
Protein accession | YP_001505109 |
Protein GI | 158312601 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.556771 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00012998 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATCAGA GCATCATCGA GGACGAGGCA AGGCCGGACG GCCCCGCACA GAATTCGCGT GGCGCGGGCC GCCGGGTGGG GCGGCGTGAC CGGCGCAGGC GTGACCGCGT GGTCCGAACC GCCAGACGCA AGATCCTGCT ATCCGCCCTC TGTCTCCTTG TCGTGGTCGG CGGGACGACC GGCGTGATCC TGTACTCCGC GCTCGGGGGC GACGATGCGG GTGGGAAGCC CATCCAGCTG GCCGACACCG AGCCGCGATA CCAGACCGGT GTGAACTTCC TTCCGAACCC TGGCTTCGAG AACACGGTCG CGGGCTGGTC GGCCGGGCCA TCCGAGAGCG CTCGCCTGTC CCCGACGGTG CCCGGCCGCA GCGGGTCCGG CGCGGCGATC GTCACGGCCA GGGCCCGGGT ACCCGCCATC ACCCTGACCG ACACCCCGGA CGCCGTGACC TGGACCGGCG GCAGGCTGAC CTATCTCGGC GCCGTCTGGG TGCGCACGAC CGCCCCGGGC ACCCGCGTCC AGCTGAAGCT GACCGAACTC GACGGTGCCC GCACGGCCGC ACGCGCCCTG ACCAACGTCA CCCTGAACGA TGCCACCTGG CGAAAGGTCG AGGTCTCCCT GCGGGCGACT GCCACGGGCC ACCGGCTGGA CTTCGCGGTC GTCGCCGACC GGCTTCCCGC CGGCCAGTTC CTCTACGCGG ACGATGCGTC CCTTCTCATC GGGAGGCCGT CATCGCCGGC CCCGACGGGG CCGGCGGCGA CCACGGGCGG CCAGCCGAGC GCGTCCCCGC CGGTGAGCGG CAGCCCGCGG CCTGGCACGA GCGCGACTCC CAGCGCGCCG ACGAGTGAGC GACCGCGCCC GAGCACGGCA CCGCCCGCGC CGGGTGGGTC GGGTATCACG CCGATCCCGG CCGGCATGCC CAACTCGAGC ACCACTGGCG TCCGGGCGGG GACGTCGCTG ACCCTCCTCT CCGGTGACCA GCGCATCACC CGCGCGGGAA CGGTGATCGA GAACAGGGAT GTGCGCGGGT GCATCCGAGT GGAGGCCGAC AACGTCGTCA TCCGAAACTC CCGGGTCTCC TGCCGGTCGT CGGGTTCCGC GGTGATCAAG AACCTCGGCC AGAACCTGCT GGTCGAGGAC GTCACCATCG ACGGGCAGGG CGCCTCGAAC TCCGGCATGT CGACCGCCGA CTTCACCGCC CGCCGGGTGA ACATCTCGAA CACGATCGAC GGCTTCTTCA TTAACGACAA CATCCTCATC GAGAACTCGT ACGTTCATCA CCTCGCCCAG ACCCCGTCCA GCCACAACGA CGCGGTACAG ACCACCGGCG GCAGCAACAT CGTGCTGCGT CACAACACCC TGATTCCCAT CAACGAGACG ACCGGCGAGT CGAGTAACGC CGGCTACATG TTCGGGCAGA ATATCGGGCC GATCTCGCGG GTTGTCTTCG ACGGCAACTT CATCAACGGC GGCGGCTTCA CCCTCAACGG TGCCGGTGAC GTGACCTTCG TGAACAACCG GTTCGGCCGC AACTGCCTGT ACGGGATCAA GGCGTTCAAG GGCAGCCAGG TGAAGTTCGA CCAGTCCAAC ATCTGGGCCG ACACCGGCCG TCCGGTGAAC GACGACCCCA AATGCTGA
|
Protein sequence | MHQSIIEDEA RPDGPAQNSR GAGRRVGRRD RRRRDRVVRT ARRKILLSAL CLLVVVGGTT GVILYSALGG DDAGGKPIQL ADTEPRYQTG VNFLPNPGFE NTVAGWSAGP SESARLSPTV PGRSGSGAAI VTARARVPAI TLTDTPDAVT WTGGRLTYLG AVWVRTTAPG TRVQLKLTEL DGARTAARAL TNVTLNDATW RKVEVSLRAT ATGHRLDFAV VADRLPAGQF LYADDASLLI GRPSSPAPTG PAATTGGQPS ASPPVSGSPR PGTSATPSAP TSERPRPSTA PPAPGGSGIT PIPAGMPNSS TTGVRAGTSL TLLSGDQRIT RAGTVIENRD VRGCIRVEAD NVVIRNSRVS CRSSGSAVIK NLGQNLLVED VTIDGQGASN SGMSTADFTA RRVNISNTID GFFINDNILI ENSYVHHLAQ TPSSHNDAVQ TTGGSNIVLR HNTLIPINET TGESSNAGYM FGQNIGPISR VVFDGNFING GGFTLNGAGD VTFVNNRFGR NCLYGIKAFK GSQVKFDQSN IWADTGRPVN DDPKC
|
| |