Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1271 |
Symbol | |
ID | 5669684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1528864 |
End bp | 1530831 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240203 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001505631 |
Protein GI | 158313123 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.424952 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGAGAC GTAGAGTAGA CGGACGACGT CACGGTCTGT GGACAACAGC GGTTGCGCGG CGCGTGTCGC GACTCAACCT CCAGCAACGG GTCGGTCTGC AGATCGTGCT GGACGTCTGC GCGCTCGCGC TCGGGTTCAT CGCCGCTCAG GTCGGCCGAC TCGACCTGGA CCCGGCCGCG CTCACCGATC CGGGCTTCTG GGTCATCGTC TTCCTCGCCG TGTGCCTGCT CCACTTCCTG GGCACGGCGC TGCACCTCTA CCTGGGCCGG TACCGGTTCG GCGGGTTCGA GGAGGTGTTC GGCATCCTGG TCGCGGTGGC TCTCACCGTC CTCGGGGTGC TCGTGGTGGT GCTGGCGGTC GGCGTGCCGC GGCCGGTACC GCTCAGCGTG CCGCCGCTGG GCGGGGCGGT CACCCTGGTG CTGATGCTCG GCATCCGCTA CCTGTGGCGG CTGGCCGAGG AGCGGCTGCG CCGGCCGGCC CCCGACGCCA CCGAGCCGCT GATCGTCTTC GGCGCGGGCG ACGGCGGGCA GCGGGTGCTC ACCGCGATGC TGCGCACGCC GAGCAGCCCC TACTACCCGG TCGCGCTGCT CGACGACGAC CCCCGGACCT GGAACCTGCA GCTGTCCGGG GTGCGGGTCC GCGGCGGCCG GGACGCGATC GCCGGCGTCG CCGCCTCCAC CGGCGCACGC ACCCTGCTCG TCGCGATCCC GAGCGCGGAC GCGGCGCTGC TGCGGGAGAT CAGCGCGCTG GCCGAACCGG CCGGTCTCGC CGTCAAGGTG CTCCCCCGCG TCGCAGACCT GGTGGACGGC ACCGTCGGCG TAGCGGACAT TCGCGATCTC GACCTCGCCG ACCTGCTCGG CCGCCGGCAG ATCCAGACCG ACATGACGGC CGCCGAGCGC TACCTCACCG GCCGCCGGGT CCTCGTGACC GGCGCCGGCG GGTCGATCGG ATCGGAGCTG TGCCGGCAGA TCCACGCCTT CGGGCCGGCC GAACTGATCA TGCTCGACCG GGACGAGTCG GCGCTGCGCG CCGTCCAGCT CTCGCTGCAC GGCCGGGCGA TGCTCGACGA CGACACGATC GTCCTCGGCG ACATCCGCGA CACCGAGCTC ATGGCCGCGC TGTTCGCCGC CCGCCGGCCC GAGGTCGTCT TCCACGCCGC GGCGCTCAAG CACCTCCCGC TGCTGGAGCG CTTCCCGGGC GAGTCGGTGA AGACGAACCT GTGGGGGACG CTGACCGTCC TGGAGGCCGC GGCCGCCTGC GGGGTGCGGC GCCTGGTGAA CATCTCGACC GACAAGGCCG CCAACCCGAG CAGCGTGCTC GGCCACTCCA AGCGGATCAC CGAGCGCCTC ACCGCGCACG TCGCGGGCCA GGCGCCGGGG GTGCTGGTCA GCGTCCGCTT CGGCAACGTG CTCGGCAGCA ACGGCTCGGT GCTGACCGTC TTCGCCGGCC AGCTCGCCGC GGGCGGGCCA CTGACGGTCA CCCACCCGGA GGTGACCCGC TACTTCATGA CCATCCAGGA GGCCGTCCAG CTCGTCCTGC AGGCCGGGGC GCTGGGCTCC GCCGGCGAGG CGCTGGTCCT CGACATGGGG GAACCGGTGC GCATCGCGGA CGTCGCCCGT CGCATCGCGG CCCGCGCGCC CGCGCCGGTG GACATCGTCT ACACCGGGCT CGGGGCCGGC GAGAAGCTGC ACGAGGAACT GCTGGGCGCC GGCGAGTGGG ACTCCCGGCC GCGGCACCCG CTGATCTCAC AGGTACCGGT ACCGCCGCTG GACCCGGCCG CCGTCCGGGA CATCGACCCG TACGCGGCAC CGGATCTGAT CCGGGCCACG CTGACCCGGC TGGCCGCCGA ACAGCCCATG CCGAACGTGC CGCGCCAGAC CAATCCAGGT CAGGACGAGC CGCGCCAGAC CGGGCCGCGT CAGGACGGGC CGCGTCAGGA CGGACCGACC GAGGCGCGGA CCGGGTGA
|
Protein sequence | MWRRRVDGRR HGLWTTAVAR RVSRLNLQQR VGLQIVLDVC ALALGFIAAQ VGRLDLDPAA LTDPGFWVIV FLAVCLLHFL GTALHLYLGR YRFGGFEEVF GILVAVALTV LGVLVVVLAV GVPRPVPLSV PPLGGAVTLV LMLGIRYLWR LAEERLRRPA PDATEPLIVF GAGDGGQRVL TAMLRTPSSP YYPVALLDDD PRTWNLQLSG VRVRGGRDAI AGVAASTGAR TLLVAIPSAD AALLREISAL AEPAGLAVKV LPRVADLVDG TVGVADIRDL DLADLLGRRQ IQTDMTAAER YLTGRRVLVT GAGGSIGSEL CRQIHAFGPA ELIMLDRDES ALRAVQLSLH GRAMLDDDTI VLGDIRDTEL MAALFAARRP EVVFHAAALK HLPLLERFPG ESVKTNLWGT LTVLEAAAAC GVRRLVNIST DKAANPSSVL GHSKRITERL TAHVAGQAPG VLVSVRFGNV LGSNGSVLTV FAGQLAAGGP LTVTHPEVTR YFMTIQEAVQ LVLQAGALGS AGEALVLDMG EPVRIADVAR RIAARAPAPV DIVYTGLGAG EKLHEELLGA GEWDSRPRHP LISQVPVPPL DPAAVRDIDP YAAPDLIRAT LTRLAAEQPM PNVPRQTNPG QDEPRQTGPR QDGPRQDGPT EARTG
|
| |