Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0829 |
Symbol | |
ID | 3905106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 967427 |
End bp | 969034 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637878162 |
Product | glycosyl transferase family protein |
Protein accession | YP_479942 |
Protein GI | 86739542 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03469] hopene-associated glycosyltransferase HpnB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.759846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGCGC GGCAGGAGAG TTTCGGGGGG GAGATCTCCA TACCGGCTCG GCCAGACCGG GACGGAAGCC TTGTAACCCC GCATTACCCG GGGCGCCCGC ACGCCCTTGA GGGGCCCGTG GCAGTATCAC TTTCTGTGAC AGTGCTGCTG TGGGTCGCCG TTGCCTCTCT GCTCGCCTGG CTCTACCTGA CCGTCGGCCA TGGGTTCTTC TGGCGGACCG ATCAGCGCCT GCCATCCCGG CAGGCGCCGA CCGCTTGGCC CAGTGTGGCG ATCGTCGTGC CGGCTCGGGA CGAGGCCGAC GTGCTTCCCG TGACGCTTCC GACGCTGCTT GCCCAGGACT ATCCCGGTCC TGTCCAGCTG ATCCTGGTGG ACGACGGTTC CACCGACGGG ACCACGGAGG TTGCCCGGGA CCTGGCGGAA CAGGCCGCCA CGCGTGGTGA GACCAACGTG ACGCCGGCCA TGACCACATC CACCGAACCC CCCACTGGCT GGACCGGGAA GCTGTGGGCG CTGCGGCGGG GCATCGAGTG CGCTGGCGAC GTCGACTTCC TGCTCCTCAC CGACGCGGAC ATCTCCCACC GCCCGGGATC GCTGACCGCG CTCGTCGAGT CGGCGACCTC CCAGAAGCTG GACATGGTGT CGCAGATGGC CGTGCTACGG GTGCAGACCG GCTGGGAGCG TCTGATCGTC CCGGCCTTCG TGCACTTCTT CGCGATGCTG TACCCGTTCC GGTGGTCGAA CCGGCCGGGT TCGCGGGTGG CCGCCGCGGC CGGCGGATGC TCCCTGATCC GCCGGGAGGC CCTTGCGGCG GCCGGTGGAC TGGCGGAGGT GCGCGGCGCC GTCATCGACG ACGTGGCGAT CGCGCGGATC ATCAAGCGCT CGGGCGGGCG GACGTGGCTC GGACTCGCCG AGCAGGTCCA CAGCCAGCGC CCGTATCCGC GGCTCGCGGA TCTGTGGAAG ATGGTGTCCC GCAGCGCCTA TGCGCAGTTG CGGCACTCCC CGTCGCTGCT GGTCGGGACC GTGTTGGGAC TGAGCCTGGT CTTCGTCATC CCAGTGGTCG CGACCATCGT GGGCATCGTG ACCGGCGACG TCGCCACGGC TCTCGTCGGT GGGATCGCTT GGTTGATCAT GACTGTCACC TATCTGCCGA TGACCCGCTA CTATTGTCAG CCGCTGCCGC TGGCCCTGCT GCTCCCCGGA GTGGCCGTGC TGTACCTCGC GATGACGGTG GACTCGGCGC GGCTCAAGCG GGCCGGGCGG GGAGCGGCCT GGAAGGGACG TACCTACCAG GATCACGGTG CCCCGGCCGC CGCCCCCGAG TATCCGAACG GCCCGGGTGA ACGGCGTGAG GGTTCGGTGG CCGGGCCGCC CGACTCCGGC GGCTCATCGG CCTCCGCCGG CGGCTCATCG GCTTTGCCAT CCGCCTCGTC CACGGTGCCC GCAACACCCC TGGTAATGGC GACATCCTCG GCGGTCCCCG CGCCGGCCGT GGCGCCGGCT TCCCCCACGC CTACGTCCAT ACCGACGCCC ACTCCCACGC GGTGGCCCGC ATCGTCATCG GCCTCGCCGC CGGTCCGGGG CGGATCGACC GATCAGCCCC GGACCTAG
|
Protein sequence | MVARQESFGG EISIPARPDR DGSLVTPHYP GRPHALEGPV AVSLSVTVLL WVAVASLLAW LYLTVGHGFF WRTDQRLPSR QAPTAWPSVA IVVPARDEAD VLPVTLPTLL AQDYPGPVQL ILVDDGSTDG TTEVARDLAE QAATRGETNV TPAMTTSTEP PTGWTGKLWA LRRGIECAGD VDFLLLTDAD ISHRPGSLTA LVESATSQKL DMVSQMAVLR VQTGWERLIV PAFVHFFAML YPFRWSNRPG SRVAAAAGGC SLIRREALAA AGGLAEVRGA VIDDVAIARI IKRSGGRTWL GLAEQVHSQR PYPRLADLWK MVSRSAYAQL RHSPSLLVGT VLGLSLVFVI PVVATIVGIV TGDVATALVG GIAWLIMTVT YLPMTRYYCQ PLPLALLLPG VAVLYLAMTV DSARLKRAGR GAAWKGRTYQ DHGAPAAAPE YPNGPGERRE GSVAGPPDSG GSSASAGGSS ALPSASSTVP ATPLVMATSS AVPAPAVAPA SPTPTSIPTP TPTRWPASSS ASPPVRGGST DQPRT
|
| |