Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1673 |
Symbol | |
ID | 5670075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2001345 |
End bp | 2002580 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641240591 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001506017 |
Protein GI | 158313509 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.272276 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATCG CTTTCCTATG CGAGCAGTAC CCACCGATCA TCTGGGATGG CGCTGGCGTC TACACGCACG ACATCGCTCA CGCACTGGTT CGGCGTGGCC ATGAGGTTCA TGTGCTGTGC ACCCAGGGCC GCCGTATCCG CGACGACGTG TTCGACGGCG TTCACGTTCA CCGCCGCCCG CTGTTGCGTG CGCCTGTCAC TCGATACCTT GGGCCGGCAG CAAAGCTGAT CAACGGCCGG GATCATCCGC GTGACTCCTT GTCGCTGCGG GCCTCACTGG CGGTGTCGTA CGGCTTCTGG CTTCGGCAGA GCGGGATCAA CCCGGACGTC ATCGAGACAC AGGACGGCGA GACACGGGGC CTGTTCACGG CTCTCGGTCG GCGTACGCCG CTGGTGATCC ACCTGCACAC GCCGACGATG ATGGACGTCC GTCTGCGGGA TCCCGAACTG AGCCGCAAGG GGGAGCTGGC CGACCGCATC GACCGCTTCT CGGCGCTGCG AGCGGACGCG CGGACTTCCC CGTCCCAGCT CCTCGTCGAC ACGCTGCACG AGCTCGACTG GCTGCGACCG GACACCGACG TCGACGTCAT CCCGTACCCG TTCGACAACG TGCCGTTCGC GAGCGTCCCC ACGGCAGAGC ACACCGGACC GAACCTGGTC GTGGTCGGCC GGCTGGAGTG GCGCAAAGGG CTGGACGTGC TCGTCGAGGC CGCGTCCCGG CTACTGGCGC GGGGAGTCGA GGCCAAGCTG ATCTTCGTCG GGCAGTCGTC CGGTCGGATC GAAGGCGTCG AGACCGGGGC ATGGCTCGAG CGGAAGGCCG CTGAGCTGGG CGTTCCGGTC CGGTTCGAGG GGCATGTCTC CCGCACGGAG CTTCCGGCGC TCTACGGTGA GGGCCGGGCG GTCGTTGTGC CGAGCCGGTT CGAGAGCTTC TCCATCGCGG GCCTTGAGGG GATGGCCGCC GCTCGCCCGG TGGTCGCCAC AGCGACGACC GGCGTCTCGA CCTGGGTCGA CCGCTGGAAG GGCGGCGCCG TCGTGCCGCC GGAGGACCCG GAGGCGATGG CGGACGCTCT CGAGCCGTTC CTGACCGACC AGGACCACGC GGCCGTCGTC GGCCTGCGTG GTCGGATGGG CACCGCTGAG CTGGATCCGG CGCGCATCGC CGAGCGCCGC GAGGAGGTCT ACCTCAAGGC GATCGCGCGT CATGAGGTCC GCCGGCCCGA GAGGCAGCGG GGATAG
|
Protein sequence | MRIAFLCEQY PPIIWDGAGV YTHDIAHALV RRGHEVHVLC TQGRRIRDDV FDGVHVHRRP LLRAPVTRYL GPAAKLINGR DHPRDSLSLR ASLAVSYGFW LRQSGINPDV IETQDGETRG LFTALGRRTP LVIHLHTPTM MDVRLRDPEL SRKGELADRI DRFSALRADA RTSPSQLLVD TLHELDWLRP DTDVDVIPYP FDNVPFASVP TAEHTGPNLV VVGRLEWRKG LDVLVEAASR LLARGVEAKL IFVGQSSGRI EGVETGAWLE RKAAELGVPV RFEGHVSRTE LPALYGEGRA VVVPSRFESF SIAGLEGMAA ARPVVATATT GVSTWVDRWK GGAVVPPEDP EAMADALEPF LTDQDHAAVV GLRGRMGTAE LDPARIAERR EEVYLKAIAR HEVRRPERQR G
|
| |