Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0930 |
Symbol | |
ID | 3906094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1087983 |
End bp | 1089173 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637878264 |
Product | glycosyl transferase family protein |
Protein accession | YP_480043 |
Protein GI | 86739643 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.72237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCG TGCTGTTCGT GGTGCCGCCC CTGACCGGGC ATGTCAACCC GGCGGTCGGG GTCGCCGCCG AGCTGGCCGC CCGTGGTCAC GAGGTGGCGC TCGCCGGGTA CGCGGGCGTC ATCGGATCGT TGATCCCGCC GGAGCTGGCC CTGCTGGCGT TACCCGAGGC GGGCCTGGGC GAGAAGTGGT CCCGGATCCA GGACGCGTCT CGGGCGCTGC GGGGGCCGGC GTCGCTGAAG TTCCTGTGGG AGGACTTCCT GCTCCCGCTC GGCTCGATGA TGGCCCCGGC GATCGACAGG ATCATCGACG ACTTCCAGCC CGACCTGCTC GTCATCGACC AGCAGGCGGT GGGCGCCGCG CTCGTCGCCC GTCGCCGGGG CCTGCGCTGG GCGACCCTCG CGGCGACCTC CGCCGAGTTC GACAATCCTT ACGGGGTGCT CGCGGGCCTG GGGCAGTGGG TCGTCGACCG CCTGCGGGAG TTCCAGACCG GGCACGGAGT GCCGCCCGAC GAGGCTGCCA TCGGGGACCT GCGCTTCTCC GAGGCGTTGA CCCTCGTCTT CTCCGTCCGG GAAATGCTGC ACAATCCCGG GATCCCGGAC TACGCGGTCT TCGTCGGCAG CGCGGTCGGG AAACGGGCCG GGGCGGGCGA GTTCCCCTGG GACTGGCTCG ACCCGGCCCG CCGGGCGGTG CTCGTCTCCC TCGGCACGGT GACCCGGGAG GCCGGCGGTC GGTTTCTGCG GGCGGCGGCC GAGGGGCTGC TCGGCCTGCC CGAGCGGGTA CAGGCGATCG TCGTCGCCAC ACCCGGGCTC GTTGACGACC TCGCCGCCGC GGCGCCCGAC GATCTGCTGG TCGCGCCGTT CGTGCCGCAG GTCGCGTTAC TTCCGCGGTT GTCGGCGGTG GTGTGCCACG CCGGCAACAA CACCGTCTGT GAGTCGCTGT CACACGGGGT TCCGCTGGTG GTCGCCCCGG TTCGCGACGA TCAGCCGATC ATCGGCGAGC AGGTCGTCCG TAACGGGGCC GGGGTGCGCG TCAAGTTCGG TCGGGCCGGA CCCACCGCCG TGCGTTCCGC GGTGACGGCC GTGCTGGACG ATCCGTCGTA CCGGGCCGCC GCCGCCCGGA TGCGGGCCGC CTTCGCCGCG GCCGGCGGGG TGGCCGCCGC CGCCGATCAC CTTGAGAAGC TGGCGGTCTG A
|
Protein sequence | MSRVLFVVPP LTGHVNPAVG VAAELAARGH EVALAGYAGV IGSLIPPELA LLALPEAGLG EKWSRIQDAS RALRGPASLK FLWEDFLLPL GSMMAPAIDR IIDDFQPDLL VIDQQAVGAA LVARRRGLRW ATLAATSAEF DNPYGVLAGL GQWVVDRLRE FQTGHGVPPD EAAIGDLRFS EALTLVFSVR EMLHNPGIPD YAVFVGSAVG KRAGAGEFPW DWLDPARRAV LVSLGTVTRE AGGRFLRAAA EGLLGLPERV QAIVVATPGL VDDLAAAAPD DLLVAPFVPQ VALLPRLSAV VCHAGNNTVC ESLSHGVPLV VAPVRDDQPI IGEQVVRNGA GVRVKFGRAG PTAVRSAVTA VLDDPSYRAA AARMRAAFAA AGGVAAAADH LEKLAV
|
| |