Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1051 |
Symbol | |
ID | 3905297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1248968 |
End bp | 1250305 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637878385 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_480162 |
Protein GI | 86739762 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.876229 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCG CGAGCAATCG GCGCGGCCCC GGCGCGACGA TCCCGCCGAC CCCCGCGGGC CTCGTCGTGC CGCCGGCCGT CCCGGTCCCG CCGGGCCTCT TGGCCGCGCA GGTGGGAGCG CCCGGGATCG AGGCCGCGCC GCACCGGCCC TCCCTCGCCG AGCTGGTCGA GACTTCGGGG CTGCGCCGCG TGCACATGCT GGCCTGGCGT GACCTCGACG ACCCCGAGTC GGGCGGGTCG GAGCTGCACG CCGACAAGGT CGCGGAGCTG TGGGCGCAGG CCGGCATCGA CGTGAGCCTG CGCACGGCCG TGGCGTCCGG CCATCCGGAA TCCGTCCACC GCAACGGCTA CACGGTGGTC CGCAAGGCGG GCCGGTACTC GGTCTTCCCG CGCACGGCGG TGTCCGGCGC GCTGGGCCGC GGCGGGCCCT GGGACGGCCT GGTCGAGATC TGGAACGGGA TGCCGTTCTT CTCTCCGGTC TGGGCCCGCT GCCCCCGGGT GGTGTTCCTG CACCACGTGC ATGCCGAGAT GTGGCGGATG GTGCTGTCCC CGAAGCTGGC CAGGATCGGC GAGACCGTCG AATTCAAGAT CGCACCACCG CTGTACCGGC GTACGCGCAT CCTGACGCTG TCCCCGTCGT CCCGGCACGA GATCATCGAC CTGCTCGGCC TGCCGCCGCG CAACATCTCG GTCGTGCCGC CGGGTATCGA CCCGTCCTTC TCCCCCGCCG GCGAGCGCTC GCCGCATCCC CTCGTCCTCG CTGTCGGGCG TCTGGTGCCG GTGAAACGGT TCGACGTCCT CATCGACGGG CTCGTCCACG CCCACGACGA ACATCCGACG ATGGAGGCGG TCATCGTCGG CGAGGGCTAC GAGCGGGTGG AGCTGGAGAA GCGGATCTCC GCTGCCGGAG CCGGCGGCTG GCTGCGCCTC GTCGGGCGGG TCGACGACGA CGCCCTGCTG ACGCTGTATC GGCGCGCCTG GGTGCTGGCC TCGGCCTCGG CCCGTGAAGG CTGGGGCATG ACGATCACCG AGGCCGCCGC CTGCGGCACG CCGTCGGTCG CGACGAAGAT CGCCGGACAC ACCGACGCGG TCGTCGACGG CGAGACCGGC GTGTTGGTGG AGGATCCGGC CGACCTGGGC AAGACACTGG CCGGCGTGCT GACCGATCAC GATCTGCGCG CCCGCCTGTC CGCCGGGGCG CTGGCGCATG CGGCGACCTT CACCTGGGCG CAGACGGCTC GCTCGACGTT CGCGGCGCTA GTCCGGGAGG CGGCCCGGCA TCAAGGCCGC CGCTCCAGCG CGGCCCGGGC CGCGGACCTG GTTGGGCCGC ACCGGTGA
|
Protein sequence | MSAASNRRGP GATIPPTPAG LVVPPAVPVP PGLLAAQVGA PGIEAAPHRP SLAELVETSG LRRVHMLAWR DLDDPESGGS ELHADKVAEL WAQAGIDVSL RTAVASGHPE SVHRNGYTVV RKAGRYSVFP RTAVSGALGR GGPWDGLVEI WNGMPFFSPV WARCPRVVFL HHVHAEMWRM VLSPKLARIG ETVEFKIAPP LYRRTRILTL SPSSRHEIID LLGLPPRNIS VVPPGIDPSF SPAGERSPHP LVLAVGRLVP VKRFDVLIDG LVHAHDEHPT MEAVIVGEGY ERVELEKRIS AAGAGGWLRL VGRVDDDALL TLYRRAWVLA SASAREGWGM TITEAAACGT PSVATKIAGH TDAVVDGETG VLVEDPADLG KTLAGVLTDH DLRARLSAGA LAHAATFTWA QTARSTFAAL VREAARHQGR RSSAARAADL VGPHR
|
| |