Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3857 |
Symbol | |
ID | 5672220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4584655 |
End bp | 4585926 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641242735 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001508155 |
Protein GI | 158315647 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.352881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGTG TGCTGGTGGT GGCGGACAAG TTCCCGCCGA CGATCGGCGG GATCCAGACG TTCGCCTGCC GGCTGACGGC CGGCCTACCC CCGGACCGGG CCGTCGTCCT CGCCCCGGCC CAGCCCGGTG ACGCCGAGTT CGACCGCACC CTCGGCTTTC CGGTGATCCG CACCGAGCAC GGCATGATCA CCTCGCCGCG TGGCCGGCGG GAGCTGCGGG CCGCCGTCCG GGCGCACGGG TGCGAGGTGG CGTGGTTCCC CACCGCTGCC CCGCTCGGCG TGCTCGCGCC GGTGCTGCGC GAGGCCGGCG TCGAGCGGGT GGTGGCCTCC AGCCACGGGC ACGAGGTCGC CTGGTCCCGG CTGCCGTTCG GACGGCTGCT GGTCTCCACC GTCGGCGCGC GGGTCGACGT GCTGACCTAC CTCACCGAGT TCACCCGCCG CCGGCTCGCG GCGGTCACCC CGCCGGGGAC CGAGCTGGCC CGGCTCACCG GCGGCGTCGA CACCGAGCGG TTCCAGCCGG GCACCGGCGG GGACGAGATC CGCCGGGGCC TGGGCTGGTC GGACGAGCCG GTCGTGATCT GCGTGGCCCG CCTCGTCACC CGTAAGGGCC AGGACACGCT CATCCGGGGC TGGCACGACG TCCGACGCCG GCACCCGCAC GCGCGGCTGC TGCTCGTGGG CGGCGGGCCC GCCGAGGACC GCCTGCGCCG CTTGGCCGCG CGGGCCGGCG TCTCCGACGG TGTGCACTTC GCCGGTCCCG TGCCCGACGA GCTCCTCCCC GCCTACCTCG ACGCGGCGGA CGTCTTCGCG ATGCCGTCAC GCACCCGGCT GTGCGGGCTC GACCTGGAGG GCCTCGGGCT CTCCGCGCTC GAGGGGGCGG CCAGCGGCCT GCCGGTGATC ACCGGCGCCC AGGGCGGCGC ACCGGACGTC GTCATCCCCG GCCGCACCGG CGTGGCCGTC AACGGGCACG ACCGCACGGC CGTGGCCGCC GCCGTCATCG ACCTGCTCGA CGACCCGCGG CAGGCGGAGC GCATGGGCGC GGCCGGCCGC GCGTGGATGC GGGCGGCGTG GAGCTGGGAG ACGCTCAGCC TGCGCCTCGC CGGCATCCTC AGCGGGCAGG CCCCGACCGC CATCGGCGCC GGCGCCGATG CGGCATGGAC CGCGGCAGAT GCCGAGGATG GGGCAGATGC CGTGGCCGGG GCCCGGGCGG CTGTCGCGGT GGGCGGCGCC CGGGTGGGCA TGACGGTGCC GGAGCGCGGC GATGGTCGTT GA
|
Protein sequence | MPRVLVVADK FPPTIGGIQT FACRLTAGLP PDRAVVLAPA QPGDAEFDRT LGFPVIRTEH GMITSPRGRR ELRAAVRAHG CEVAWFPTAA PLGVLAPVLR EAGVERVVAS SHGHEVAWSR LPFGRLLVST VGARVDVLTY LTEFTRRRLA AVTPPGTELA RLTGGVDTER FQPGTGGDEI RRGLGWSDEP VVICVARLVT RKGQDTLIRG WHDVRRRHPH ARLLLVGGGP AEDRLRRLAA RAGVSDGVHF AGPVPDELLP AYLDAADVFA MPSRTRLCGL DLEGLGLSAL EGAASGLPVI TGAQGGAPDV VIPGRTGVAV NGHDRTAVAA AVIDLLDDPR QAERMGAAGR AWMRAAWSWE TLSLRLAGIL SGQAPTAIGA GADAAWTAAD AEDGADAVAG ARAAVAVGGA RVGMTVPERG DGR
|
| |