Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0073 |
Symbol | |
ID | 5668498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 90092 |
End bp | 91351 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641239001 |
Product | glycosyl transferase family protein |
Protein accession | YP_001504446 |
Protein GI | 158311938 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.657142 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTGTTG CAATCTTCGC GATGGGTACC CGGGGGGATG CCCAACCCGC GGCGATAATC GGCGCCGAAC TGGTCCGGCG CGGTCACGAG GTGGTGTTGG GCGTGCCCGG GGACCTTGCT GGTTTCGGTA TCAGAATGGG ACTCGACACG GCGTCGATCG GCGTCGACGC GCACGAGTTC ATGGGCTCCG AGGAGGTACG GGCCTGGCTG GCATCAGGTG ACCTGAGGAA AATCATGAAC GGGTTCGGTC GGTACAAACG CCAGCGGGCG GAACGTATCG CCGACGCCAT GGCGGACATC TCCACCGACG CAGATCTCAT CGTGTCCGGG GTGACCATCG AGGACGAGGC GGCCTGTATC GCGGAGTGGC GGGGGGTGCC GATGGCATGT CTGCACCACG CGCCGATGCG GGCCAACGGA GAGTTCCCCT TCTTCATCGC CAGCACCCGC CGGCTGCCGC GGGTCGTCAA CCGCCTGATG TATCCGGCTG TCGAGTTCGC CGGGTGGCGG GCCCTCGCCG CCGACGTCAA CCGGCTGCGT GCGAGGCTGG GCCTGCGGCC GGCCCGGGAA CCCACCCCAC GCCGGCTGGC GCGGGCCGGC TCGACGGAAA TCCAGGCCTA CAGCCGGTTC CTGGTGCCAG AACTCGCTGA CTGGGGTCAG CGTCGCCCGC TGGTGGGTTT CCTCACTCTG TCGCCCGAGC AGCGCCGGCT GCTCGGGGAG CACCAGCTCG ACCCCGCCGT CGACCAGTGG CTGGACGAGG GCGAGCCACC CGCATACTTC GGATTCGGGA GCATGCCGGT CCTGGATCCG CCCCGGATCC TCGAGTTGCT TAGCACGGTC GCCGACAGAC TGGGGCTGCG CGCGCTGGTG AGCGGGGCGT GGGCCACGAC CGGCGTCAGC GCCGACCGGC GGGTGTGCGT CGTCGGAGAC CTCGACCACG ACACGGTGCT CCCGCGTTGC CGCATCGCCG TGCACCACGG CGGCGCCGGC ACCACAGCGG CCTCCGTCGC AGCCGGACTG CCGACCGTCG TGTGCTCGGT CATCGGCGAC CAGCCCTTCT GGGGCGCCCG GCTCGAACGC CTCGGTATCG GCGCATCCCT TCGCTTTTCC GAGATGAGCG AGCGGGCCCT CGTCGCTGCC GCGGTCCCCC TGCTGGCCCA CGAACCACGG GAACGTGCAG CGCGGCTGGC CAGCCGGCTG AAGACAGAGA ACGCGGCATG CCGTACCGCC GACGTTCTCG AGGAGATCCA CAAGTCCTGA
|
Protein sequence | MRVAIFAMGT RGDAQPAAII GAELVRRGHE VVLGVPGDLA GFGIRMGLDT ASIGVDAHEF MGSEEVRAWL ASGDLRKIMN GFGRYKRQRA ERIADAMADI STDADLIVSG VTIEDEAACI AEWRGVPMAC LHHAPMRANG EFPFFIASTR RLPRVVNRLM YPAVEFAGWR ALAADVNRLR ARLGLRPARE PTPRRLARAG STEIQAYSRF LVPELADWGQ RRPLVGFLTL SPEQRRLLGE HQLDPAVDQW LDEGEPPAYF GFGSMPVLDP PRILELLSTV ADRLGLRALV SGAWATTGVS ADRRVCVVGD LDHDTVLPRC RIAVHHGGAG TTAASVAAGL PTVVCSVIGD QPFWGARLER LGIGASLRFS EMSERALVAA AVPLLAHEPR ERAARLASRL KTENAACRTA DVLEEIHKS
|
| |