Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5892 |
Symbol | |
ID | 5674214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7154705 |
End bp | 7156135 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641244741 |
Product | glycosyl transferase family protein |
Protein accession | YP_001510143 |
Protein GI | 158317635 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.861107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACC GGGCGACGAG CAACGGAGCG ATGACCACCA TGGCCATGAA CGTGGCCATG ACGGGCCAGG CGGCACCCGA GCCGCTCGCC ACGATCGTCA TCGTGAACTG GAACGGTGCC CACCTGCTCC CGCCCTGCCT GGACGCCGTC GCCAAGCAGG ACGCGCCGTT CACCTTCGAG ACGCACGTGG TCGACAACGC CTCGGCCGAC GACTCGCGCG AGGTGCTGGC ACAGCGCTAC CCGTGGGCGA AGCTCGTCCC CTCGGACCGC AACCTCGGCT TCGCCGGCGG CAACAACCTC GCCCTGCGAG GTGTCACGAC CCCGTACGCG GTACTGCTGA ACAACGACGC GATCCCCGAG CCGGACTGGC TGGCCCGGCT GCTGGCCCCG TTCGCCGAGC CGGGCGGCAA CCGGCTCGGC GCCGTCACCG GCAAGGTCGT GTTCCTCCCC CGCTTCCTGC GGCTGACCCT GTCGACGCCC ACCTTCTCGC CGGGGCCGCA CGACCCGCGC GAGCTGGGCG TCCGGGTGAG CTCGGTGACG GTGAACGGCC GCGAGGCGCT GCGCGAGGTG CTGTGGGAGA AGCTCACTTT CGGCGCCGAG GGCCCGGCGG ACGCGCCGTT CTTCTGGACC CGCGGCGAGG GTGAGCTGTG TGTGCCGGTG CCCGAGGGCG GCCCGGTCAC CATCGGCCTC ACCTGGGCCG CGGACACCGC CAAGCAGGTC ACGCTGGGCT GGGCCGGCGC CGGTGACACC ACCGCGACGC GCACGCTGCC GGTGGGCACC GAGCCGTCCG CCGTGTCGTT CACCGTGGAC GAGGGCGCCC CGCGCGTCGA CGTGATCAAC AACGTCGGCG GGATCGTGCT CACCGACGGC TACGGCGCCG ACCGCGGCTA CCAGCAGATC GACACCGGCC AGTTCGACAA CCCCGAGGAG GTCTTCACCG CCTGCGGCAA CGGCATGGCG ATGCGGACCG AGCTCGGCCA GGCGCTCGGC TGGTTCGACG ACGACTTCTT CCTCTACTAC GAGGACACCG ACCTCTCCTG GCGCATCCGG GCCCGCGGGT ACCAGATCCG CTACGTCCCG GGCGCGGTGC TGCGGCACGT CCACTCGGCG TCGAGCGTCG AGTGGTCCCC CCTGTTCGTG TTCCACACCG ACCGCAACCG GCTGCTGATG CTGACCAAGG ACGCGACCGT GCGCACGGCC GTCTCGGCGG TCACGCGCTA CCCGCTGACC ACCGCGTCGA TCGCCGTGCG GACCTGGCGA CAGGCACTGC GCTCGCGCAG CCGCCCGGCG GTGCGGCCCA CCGTGCTGCG GGTTCGGGTC TTCGCCTCCT ACCTGCGGCT GCTGCCGGCG ATGCTGCGCC GCCGCCGGGA GATCGGCGCG ACCGCCACCG AACGCCGGGT CAGCCTGCAG AGCTGGCTGG TGGAGCGATG A
|
Protein sequence | MTDRATSNGA MTTMAMNVAM TGQAAPEPLA TIVIVNWNGA HLLPPCLDAV AKQDAPFTFE THVVDNASAD DSREVLAQRY PWAKLVPSDR NLGFAGGNNL ALRGVTTPYA VLLNNDAIPE PDWLARLLAP FAEPGGNRLG AVTGKVVFLP RFLRLTLSTP TFSPGPHDPR ELGVRVSSVT VNGREALREV LWEKLTFGAE GPADAPFFWT RGEGELCVPV PEGGPVTIGL TWAADTAKQV TLGWAGAGDT TATRTLPVGT EPSAVSFTVD EGAPRVDVIN NVGGIVLTDG YGADRGYQQI DTGQFDNPEE VFTACGNGMA MRTELGQALG WFDDDFFLYY EDTDLSWRIR ARGYQIRYVP GAVLRHVHSA SSVEWSPLFV FHTDRNRLLM LTKDATVRTA VSAVTRYPLT TASIAVRTWR QALRSRSRPA VRPTVLRVRV FASYLRLLPA MLRRRREIGA TATERRVSLQ SWLVER
|
| |