Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5703 |
Symbol | |
ID | 5674029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6920571 |
End bp | 6921935 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244556 |
Product | glycosyl transferase family protein |
Protein accession | YP_001509959 |
Protein GI | 158317451 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03469] hopene-associated glycosyltransferase HpnB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0459973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTCCAC TGCTGGTGAT TGCGCTGATC TCGCTCGCGT CCTGGATTTT CCTGGCGCTG TTCCGAGGAT TCTTCTGGCG GACCGATCAA AGACTGCCCG CCGGCGACGG ACCGCTACCG GAACAGTGGC CGGCGGTAGT CGCCGTGGTC CCCGCGCGTG ACGAGGCGGA CGTCCTTCCC GACACGCTTC CCTCACTGCT CGCGCAGGAC TATCCCGGGC GCCTGAGCAT CATTCTGGTG GATGACGCCA GTACCGACGG GACGGGCGAG CTGGCCCGGG AGCTGGCGGC GCGGGCGGCG GCGGCCCGTC CCGAGGCGGC GGTGGCGCTC ACCGTCATCG GGTCGAGCGA GCCGCCGGCC GGCTGGACGG GCAAGCTCTG GGCGTTGCGG CACGGCATCG CCGCGGCCGG CGCGCCCGAG TTCCTGCTGC TGACCGACGC CGACATCGCG CACGATCCGA GCTCGGTGCG CGAGCTCGTC CGGGCGGCGA CGGCCCGCCG GCTTGATCTC GTCTCGCAGA TGGCGCGGCT GCGGGTCAAC ACCGGATGGG AACGCCTCAT CGTGCCGGCC TTCGTCTACT TCTTCGCGAT GCTCTACCCG TTCCGGTGGT CCAACGACCC GGACTCGCGG ATCGCCGCCG CCGCCGGGGG ATGCTCGCTC GTCCGCCGCC GGGCGCTCGC CGACGCCGGG GGGCTGGACG CCATCCGGGA CGCGGTGATC GACGACGTCG CACTGGCCCG CGTGATCAAG AAGTCCGGCG GGCGGACCTG GCTCGGGCTC GCCGACCACG TCTCCAGTCG CCGGCCGTAC CCGCGGCTGG CGGACCTGTG GCACATGGTG GCGCGCACCG CCTACGCGCA GTTGTTCTGG TCGCCGCTGC TACTCGTGGG CACGGTTCTC GGGCTCGGTT TGGTCTTCGT CGCCCCGGTC GTCGCGACCA TCGCCGGCAT CGCCGCCGGA AATGTGGCAG TGGCCGCCGC CGGGCTGCTC GCCTGGTCGG TCATGATCAC GACGTTCGGA CCGATGCTGC GGTACTACGA CCAGCCCGTG CTCTCCTCGC TCGCGCTGCC GTTCACCGCG GCCCTCTACC TGGCCATGAC CATGGACTCG GCCCGGCGGC ACCGTGCCGG CCGGGGCGCG GCCTGGAAGG GGCGCACCTA CTCAGCTCCC GACGGGAAGC GGGTCGGCCA GGGAGCCGGT CAGGACGCCG GCCAGGGAGT CGAGACGGCG GCGCAGGGCG AGGGCGTCGT CCGCCAGCGC GCAGACGAGG ACCCCCGGTC CGGCCAGGGG CATCAGGGCG ACACTGTCCC CTACCGCGGC GGTGACCGGC TCGAGACCGG TCGCCGGCCC GGCGACCAGG ACTGA
|
Protein sequence | MLPLLVIALI SLASWIFLAL FRGFFWRTDQ RLPAGDGPLP EQWPAVVAVV PARDEADVLP DTLPSLLAQD YPGRLSIILV DDASTDGTGE LARELAARAA AARPEAAVAL TVIGSSEPPA GWTGKLWALR HGIAAAGAPE FLLLTDADIA HDPSSVRELV RAATARRLDL VSQMARLRVN TGWERLIVPA FVYFFAMLYP FRWSNDPDSR IAAAAGGCSL VRRRALADAG GLDAIRDAVI DDVALARVIK KSGGRTWLGL ADHVSSRRPY PRLADLWHMV ARTAYAQLFW SPLLLVGTVL GLGLVFVAPV VATIAGIAAG NVAVAAAGLL AWSVMITTFG PMLRYYDQPV LSSLALPFTA ALYLAMTMDS ARRHRAGRGA AWKGRTYSAP DGKRVGQGAG QDAGQGVETA AQGEGVVRQR ADEDPRSGQG HQGDTVPYRG GDRLETGRRP GDQD
|
| |