Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5878 |
Symbol | |
ID | 5674201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7135435 |
End bp | 7136565 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244728 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001510130 |
Protein GI | 158317622 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0821211 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.729107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGACCCC GGGTGCTCGT TGATGCGACC TCGGTTCCGG CCGACCGCGG TGGTGTCGGG CGGTATGTCG ACGGGCTCGT CGCTGCTCTG GGCGCGGCCG GCGCCGACAT GGCGCTGGTG TGCCAGCGAT CGGACGAGGA ACGTTACAGC CGGATGGCGC CGCGGGCGAC CGTCCTGTCG GGCCCGGCGG CCATCGCGCA CCGGCCGGCT CGGCTGGCGT GGGAGCAGAC AGGTCTTCCG CTCGTCGCCG AACAGGTCAA TGCGGACGTC ATCCACTCGC CGCACTACAC GATGCCACTG CGCGCGCAGC GGCCGGTATG CGTGACGATC CATGACGTCA CCTTCTTCAC CGAGCCGGAG ATGCACACGG CGGTGAAGGG CACGTTCTTC CGGTCGGCGA TGCGGACGGC GGTGCGCCGG GCGAGCCGCA TCATCGTCCC GTCGAAGGCC ACGCGCGACG AGCTCGTCCG CGTCCTCGAG GGCGAGTCGA CGACGACCGA CGTCGCCTAT CACGGGGTGG ACACGACCAC GTTCCACCCG CCGACGGAGG AGGACCGGCG CCGGGTGCGG CTGCGCCTCG GCCTCGGTGA CACCCGTTAC GTGGCCTTCC TCGGAATGCT CGAGCCGCGC AAGAACGTCC CGAACCTGAT TCGCGGCTGG GCGGAGGCGG TGCACTGGCG GGACGAGCCC CCGGCGCTCG TGCTGGCCGG TGGTTCCGGC TGGGATGACG ACGTCGACGC GGCCGTCGCC TCGGTGCCGA GCCATCTGCG GGTGATCCGG CCCGGCTACC TGCGCTTCTC CGACCTCCCG GGCTACCTGG GCGGTTCGGA GCTGGTCGCC TATCCGTCGC ACGGTGAGGG CTTCGGCCTA CCGGTGCTGG AGGCGATGGC CTGCGGCGCC CCCGTGCTGA CGACCCCGCG CCTCTCGCTG CCCGAGGTGG GCGGCGACGC GGTCGCCTAC ACCCAGCCCG ACCCGGACTC GATCGCCCGC GAGATGAGCG CGCTGCTCGA CGACGCCGAG CGTCGCGCCC AGCTCGCCGC GGCCGGGCTC GCCCGGTCCC ACGAGTTCAC CTGGGCGGCC TCCGCGGAGG CCCACCTGGC GAGCTACGCC CGCGCGGTGG CCGACGCCTG A
|
Protein sequence | MGPRVLVDAT SVPADRGGVG RYVDGLVAAL GAAGADMALV CQRSDEERYS RMAPRATVLS GPAAIAHRPA RLAWEQTGLP LVAEQVNADV IHSPHYTMPL RAQRPVCVTI HDVTFFTEPE MHTAVKGTFF RSAMRTAVRR ASRIIVPSKA TRDELVRVLE GESTTTDVAY HGVDTTTFHP PTEEDRRRVR LRLGLGDTRY VAFLGMLEPR KNVPNLIRGW AEAVHWRDEP PALVLAGGSG WDDDVDAAVA SVPSHLRVIR PGYLRFSDLP GYLGGSELVA YPSHGEGFGL PVLEAMACGA PVLTTPRLSL PEVGGDAVAY TQPDPDSIAR EMSALLDDAE RRAQLAAAGL ARSHEFTWAA SAEAHLASYA RAVADA
|
| |