Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1580 |
Symbol | |
ID | 3903715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1894480 |
End bp | 1896276 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637878917 |
Product | glycosyl transferase family protein |
Protein accession | YP_480685 |
Protein GI | 86740285 |
COG category | [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.47169 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCCC CGGCCGGCCC GCGGCTCTCG GTGGTCATCT GTGCCTATAC CGAACGGCGC CGGCACGATC TGCAGCGCGC GGTGACCAGC ATCGCCGAAC AGACCAGGAA GGCCGACCAG CTGATCCTCG TCATCGACCA CAATGACCGG TTGCGCCGGT GGGCCGAGGC TGCGTACCCC GGTGCCACCG TGATCCCCAA CACCGGCCGC CGTGGCCTGT CCGGTGCCCG CAACTGCGGC GTCGGGGCCG CCACCGGCGA CGTGGTCGCC TTCCTGGACG ACGACGCCCA CGCCGAGCCG GACTGGCTGG CCCAGTTGGC CGCCCACTAC ACCGACCCCC GGGTCGCCGG GGTCGGCGGT GCCGCCATGC CGGTGTGGCC GCACCGCCGA CCCCGCTGGT TCCCCCCCGA GTTCGACTGG GTGGTCGGAT GCAGCTACGT CGGGCTGCCG ACCGACACGG CTGCGGTCCG TAACCCGATC GGAGCGGGGA TGTCGTTCCG GCGGGCCGTG TTCGATCGCG TCGGCGGGTT CACCGAGGGG CTTGGGCGGG TCGGCACCAC ACCGCTGGGC TGCGAGGAAA CCGAGTTCGG TATCCGCCTT CAGGCGATCC TCCCGGACGC TGTCGTCCGC TACGAGCCAC GGGCACGGGT CTGGCACCAG GTAACCGGCG ACCGGGCCTC CCTGCGGTAC TTCCTGGCAC GCTGCCATGC CGAAGGCCTG TCCAAGGCCG CGGTAGCCGA CCGGGCCGGG GCGGACGCGG CGCTCGCCAC CGAACGGCAG TACCTGAGGC GGACGCTGCC CCGAGCACTG GCTCGTGACC GCCACAGCCT GGGCACCTGG CCGCGAGCCG GGGCCGTGCT GGTCGGCACC GGGTCCACGG CCATCGGCTA CGCACGCGGC CGGCTACGCC TCGCCGCCGG CCGGCGTGAC ACGCCACGTC AGCCGATCCC GTCGGAGGTA GCTGTGATCC CGATTCTGCT GTACCACTCG GTGACCGATT ATCCGGTGGC GAGCTATCGA CGTTGGACGG TCGATACCGC AACCTTTGTC CGCCATCTCA CCCTCATTGC CGGTTCCGGC CGGGTACCGC TGACGGTGTC CGAGTATGTC GAACGGAGGC GGCACCAGAC CCTGCCTCCG CGACCGGTCC TCATCACCTT CGACGACGGA TTCGCCGACA ACCTGGCCGC CGCTCGTGAG GTCGTCGCGC ACGGGCTCAC CGCGACCTGC TACGTCGTCA CCGACTGGAT CGGCCAGGTC GGCATGCTGC GCGGCGCCGA CCTGCGGACC CTGGCCGGCC TCGGGGTCGA GATCGGTGGT CACAGCCACA CCCATCCCCG GCTCGACGAG CTTCGCCCCG ACGAGGCGCG CCGGGAGATC AGCGACTGCA ACGCCCGGTT GACCGCTGCG ATCGGCGCAC CGGTGGGCTC CTTCGCCTAC CCCCACGGCA ATTACGACCA TGCCGTTCGG CGGCTCGTCG GGCAGGCCGG CTTCACGTCC GCCTGCGGGG TGCGCAACAT GATGTCGCAC GGCGCCGACG ACCCGTTCGC CCTTGCCCGA CTGACCGTGA CGGTCGACAC CCCGGACCGG CAGATCAGGG CGTGGCTGGA CGGGGCCGGC CGCGCGGCCC CCGCTCGGGA GTTGCTGCGT ACGCGGGGCA GCCGGCTCTC CCGTCGAACG AGGGCTCGGC TCCTCGGCCC GCGTCTACCC TACCGACCAA TCGTCACCGA TCTGCCCGTA CCAGCCGATC TGCCCGTACC AGCCGTCCCG GCGGCGTTCG GGGAGGTGCG GCCGTGA
|
Protein sequence | MTPPAGPRLS VVICAYTERR RHDLQRAVTS IAEQTRKADQ LILVIDHNDR LRRWAEAAYP GATVIPNTGR RGLSGARNCG VGAATGDVVA FLDDDAHAEP DWLAQLAAHY TDPRVAGVGG AAMPVWPHRR PRWFPPEFDW VVGCSYVGLP TDTAAVRNPI GAGMSFRRAV FDRVGGFTEG LGRVGTTPLG CEETEFGIRL QAILPDAVVR YEPRARVWHQ VTGDRASLRY FLARCHAEGL SKAAVADRAG ADAALATERQ YLRRTLPRAL ARDRHSLGTW PRAGAVLVGT GSTAIGYARG RLRLAAGRRD TPRQPIPSEV AVIPILLYHS VTDYPVASYR RWTVDTATFV RHLTLIAGSG RVPLTVSEYV ERRRHQTLPP RPVLITFDDG FADNLAAARE VVAHGLTATC YVVTDWIGQV GMLRGADLRT LAGLGVEIGG HSHTHPRLDE LRPDEARREI SDCNARLTAA IGAPVGSFAY PHGNYDHAVR RLVGQAGFTS ACGVRNMMSH GADDPFALAR LTVTVDTPDR QIRAWLDGAG RAAPARELLR TRGSRLSRRT RARLLGPRLP YRPIVTDLPV PADLPVPAVP AAFGEVRP
|
| |