Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3098 |
Symbol | |
ID | 3904224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3670095 |
End bp | 3671225 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637880419 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_482184 |
Protein GI | 86741784 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.034289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.933214 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTGC TGGCCGTCAC GAACGATTTC CCGCCCCGTC CCGGCGGGAT CCAGGCCTAC GTGCACAACT TCGCCTCGCG GCTGCCCGAG GGCGAGATCG TCGTCTACGC CCCGGCCTGG AAGAACGCCG CCGCCTTCGA CGCCGAACAG AACTTCCCGG TCGTGCGGCA CACCACGTCG CTAATGCTGC CGACACCGGA CGTCCTGCGC CGAGCCAGAG AGATCGCCCG GGCGGAGGGA TGCGACACGA TGTGGTTCGG CGCCGCGGCA CCGCTCGGGC TGCTTGGAGC CCGGCTGCGC CGCGACACGG CCATGCGCAG GATGGTCGCG AGCACCCATG GTCACGAGGT CGGCTGGGCG GCGCTGCCGG GGGCGCGTCA GGCGTTGCAC AGCATCGGCA CCGCGGCCGA CGTCATCACC TATCTGACCG ACTACACCCG GGCCCGGATC CGGCCCGCCT TCGGCGGCCA TCCCACCTTC GCCCGGCTGC CCAGCGGAGT CGACCCCTCG CTGTTCCATC CCGGTCACGG GCGCGAGGAG ATGCGCCGGC GCCACGGGCT GACGGGCCGC CGGGTGGTGG TGTGCGTAAG CCGGCTGGTC GCCCGCAAGG GCCAGGACAT GCTGATCAGG GCGCTGCCCA TGGTACGGCG CCGCGTACCG GACGCCGCGC TGCTGATCGT CGGCGGCGGT CCCCGGCGGG GTGACCTTGA ACGGCTCGCC CGGGAGAACG ACGTCGCCGA GCATGTGATC ATGACTGGTT CGGTGCCGTG GGAGGAACTG CCGGCGCACT ATGCGGCGGG CGATGTGTTC GCGATGCCCT GCCGCTCCCG CCTCGCCGGC CTGGAGGTCG AGGGGCTCGG CATCGTCTTC CTCGAGGCGT CGGCGACCGG CCTGCCGGTG GTGGCCGGCC GCAGTGGGGG TTCCCCCGAC GCCGTCCTGC ACCAGCACAC CGGCATCGTG ATCGACGGTA CCGATCTGGC GCAGGTCGTG ACGACCATCG GTGATCTTCT TGCCGACCCC GACCGGGCGG CGTCGATGGG TGCCGCGGGG CGGGCGTGGG TCGAGCTGCG CTGGCGGTGG GACGTCCTCG CGCAGGACCT GCGCACGCTG CTCGCCGGCC CGGACGGTTA G
|
Protein sequence | MRVLAVTNDF PPRPGGIQAY VHNFASRLPE GEIVVYAPAW KNAAAFDAEQ NFPVVRHTTS LMLPTPDVLR RAREIARAEG CDTMWFGAAA PLGLLGARLR RDTAMRRMVA STHGHEVGWA ALPGARQALH SIGTAADVIT YLTDYTRARI RPAFGGHPTF ARLPSGVDPS LFHPGHGREE MRRRHGLTGR RVVVCVSRLV ARKGQDMLIR ALPMVRRRVP DAALLIVGGG PRRGDLERLA RENDVAEHVI MTGSVPWEEL PAHYAAGDVF AMPCRSRLAG LEVEGLGIVF LEASATGLPV VAGRSGGSPD AVLHQHTGIV IDGTDLAQVV TTIGDLLADP DRAASMGAAG RAWVELRWRW DVLAQDLRTL LAGPDG
|
| |