Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0722 |
Symbol | |
ID | 3903512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 830685 |
End bp | 832103 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637878055 |
Product | glycosyl transferase family protein |
Protein accession | YP_479835 |
Protein GI | 86739435 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.693159 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGG AACGCCCGAC CCGGCCGGAG GCGGGATCGG CCGTCGAGTC GAAGACCGTC GAGCCGCTGG CGACCATCGT GATCGTCAAC TGGAACGGCG CGCATCTCCT CCCGGCCTGC CTGGACGGCA TCGCCAAGCA GGAGGCGGAT TTCGCCTACC AGACCTGGGT GGTCGACAAC GCCTCGTCCG ACGGCTCGCG GGAGCTGCTC GCCGAGCGCT ACCCCTGGGT ACGCGTGGTG CCCGCGGAAT GCAACCTCGG TTTCGCCGGC GGCAACAACC TGGCGCTGCG CCAGGTCACG ACGCCGTTCG CGGTCCTGGT GAACAACGAC GCGATCCCCG AGCCCACCTG GCTCGCGGCG CTGCTGGCGC CGTTCGACGA GCCCGGCGGC CGGCATCTCG GCGCCACCAC CGGCAAGGTC GTCTTCCTCC CCCGGTTCCT GCGGATCAGG CTGCACACGC CTACCTTCGT CCCGGGTCCG CACGACTCCC GCGAGCTGGG GGTGCGGGTG TCCTCAGTGA CGGTGGACGG GCGCGAGGCG TTGCGCGAGG TGCTCTGGGA GAACCTCACC TATGGTGCCG AGGGACCGAG CGGCTCGCCG TACTTCTGGA CCCGAGGGGA CGGCGAACTG TGCGTGCCGA TCCCCGACGG CGGCCCGGTC ACCCTCGGGT TCACCTGGGC GGCGGAATCG CGCAAACAGG TCACCCTGAG CTGGGACGGC CCGACCGGCA CGTCGGCCGG AGGGCGAGCG GACCTCGCGG TGGGCCCCGA GTCGACGACG GTGTCGCTGA CCGTCGGCGG TGAGGCGCCG CGGGTGGACG TCATCAACAA CGTCGGCGGC ATCGTCCTGA CCGACGGTTA CGGTGCCGAC CGTGGTTACC AGCAGGTCGA CGTCGGCCAG TTCGACCAGC CGGAGGACGT GTTCACCGCC TGCGGGAACG GGATGGCGAT CCGTTCGTCG CTCGGCCACG AGCTCGGCTG GTTCGACGAC GCGTTCTTCC TCTACTACGA GGACACCGAC CTGTCCTGGC GGATCCGGGC CCGCGGGTAC GGCATCCGCT ACGTCCCGGG GGCGGTGCTG CGTCACATCC ACTCGGCGTC GAGCGTCGAA TGGTCACCGC TGTTCGTCTT CCACACCGAC CGCAACCGGC TGCTGATGCT CACCAAGGAC GCGACCGCGC CGATGGCGCT GGCCGCCGTG ATCCGCTACC CGCTGACCAC CGTCTCGATC GCGGTGCGCA CGCTGCGCCA GGCCTGGCGC GCGCGCAGCC AGCCCGCGAT CCGACCGACC CTGCTGCGGC TGCGGGTGTA TGGCTCCTAC CTGCGGCTGC TGCCCGACAT GCTGCGGGCC CGGCGCGAGA TCGGGCGCAG CGCCGCGCAA CGGCGGGCGA GCCTGCAGAG CTGGCTGGTG TCCCGATGA
|
Protein sequence | MSEERPTRPE AGSAVESKTV EPLATIVIVN WNGAHLLPAC LDGIAKQEAD FAYQTWVVDN ASSDGSRELL AERYPWVRVV PAECNLGFAG GNNLALRQVT TPFAVLVNND AIPEPTWLAA LLAPFDEPGG RHLGATTGKV VFLPRFLRIR LHTPTFVPGP HDSRELGVRV SSVTVDGREA LREVLWENLT YGAEGPSGSP YFWTRGDGEL CVPIPDGGPV TLGFTWAAES RKQVTLSWDG PTGTSAGGRA DLAVGPESTT VSLTVGGEAP RVDVINNVGG IVLTDGYGAD RGYQQVDVGQ FDQPEDVFTA CGNGMAIRSS LGHELGWFDD AFFLYYEDTD LSWRIRARGY GIRYVPGAVL RHIHSASSVE WSPLFVFHTD RNRLLMLTKD ATAPMALAAV IRYPLTTVSI AVRTLRQAWR ARSQPAIRPT LLRLRVYGSY LRLLPDMLRA RREIGRSAAQ RRASLQSWLV SR
|
| |