Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3677 |
Symbol | |
ID | 5672043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4353396 |
End bp | 4355135 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242560 |
Product | glycosyl transferase family protein |
Protein accession | YP_001507980 |
Protein GI | 158315472 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.846949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTACC AGGCAGCGCC CGCCGACGCG GCCGGCCCGG CTTCACCCGA GTTCGCGCCA CCGTCAGCAC CGCCCAGCCT GGCTCCGCGT GAGCCGTCCG CGGTCGCGGC CCCGGCCGTC CCGGCGATCC CGGCGCAGGC CGTGGCCGGC GACATCCGCG ACGCCGCCCG GGCCGGGATC CCGCCGCTGG ACGGCGTCGA CACCTGGGTC CGGGTGCCCG TCGCGTGGTG GAGGGCCGTC GCCGGACGGG TCGTCGGGGT CGCGATCACG CTGATCGGCG CGGTGTACGC GGTGTGGCGC GCCGGGACGC TGGACGGCAC CGGCGTGGCC GGCCACCTGT TCTACGCGGC CGAGATCGTC AGCTACCTCA CGATCGTGTG GACGGCGGTG ATGACCGGCC GGATGCGCAC CGGCCACGTC CGGCGCGCCC CGGCGCCAGC CGGCACCCTC GACGTCTTCG TCACCGTCTG CGGCGAGCCG GTCGAGATGG TCGAGGCGAC GCTGCGCGCG GCGCTGGCGA TCGACTACCC GCACCGCACC TACGTCCTCA ACGACGGGCG GATCGCCGGC CGGCCCAACT GGCGCGACAT CGACGCCCTC GCCGCTCGGC TCGGGATCAT CTGCTTCACC CGCACCGACG GGCCTCGCGG CAAGGCCGCG AACCTCAACC ACGGCCTGGC CCGCACCGAC GGCGACGCGA TCATGACGCT GGACGCCGAC CACATCGCGG TGCCCGATCT CGGCGAGCTG GTCCTCGGCT ACCTGCGGGA CCCGAAGGTC GGGTTCGTCT GCACCGAGCA GCGCTTCGAC GTCGGCCGCC ATGACGTGCT CAACAACGCC GAACCGATGC TGTACAAGGC GGTGCAGCCG GCGAAGGACC GCGACAGCGC CGCGTCGTCC TGCGGGAACG GCACCCTGTA CCGGCGGACG GCGGTTGAGT CCGTCGGCGG CTTCAGTGAG TGGAACATCG TCGAGGACCT GCACACCTCC TACCAGCTGC ACGCCGCCGG CTGGCAGAGC GTCTACCACC ACGGCCCGGT GTCCGTCGGG ATCGCGCCGG CCACCGCGGC GGAGTACGCC AAGCAGCGCA GCCGGTGGGC GATGGACGGC CTGCGCCTGC TGCTGTTCGA CAACCCGCTG CGCAAGCCCG GCCTGACCGG CTGGCAGCGG GCGCACTACC TGCATACCGG GATCGGCTAC CTGGTGGCGT GCGCGCAGAT GATGTTCCTG CTGGGGCCGC CGCTGAGCGT GCTGGCCGGG GTCCAGATCG CGGCCGGGGT GTCGCTGACC GCCTACGTGC TGCACGCCCT GCCGTACCTG GTCGGCTCGC TGCTGTTCAT CGTCGCCTAC ACCGGGCCGC GCGGAGCCCA GCGGACGGTG GCCAGCACCC TGTTCAACGC TCCGCTGTAC GCGCTGTCGT TCGTGCGGGT CGTGCTCTCC GGCCGACCCG ACTCCGGCGC GACCGCGAAG ACCGCGCTGC CGCGGATGTC GCTCCTGCTG CTGCCCCAGG TGCTTTTCGC CGCCAGTCTG GTGGTCACCA TTCTCGTCGT CGGCGTCAGC CCGGACGTGG CCGACCTGTC CGCGCTGGTG TGGGCCGGGG TGCTGCTGTC GATGGTGGCC GGGCCGCTGT CGGCGCTCTC GGAACGCCAG GACCGGGTGG AGCGGGCCCA GCTGCCGATC CGGGCCGTCA TCCTCGGACT GGTCCTGAGC TTCGCGGTCG TCACCCTCCT GGAGGGCTGA
|
Protein sequence | MAYQAAPADA AGPASPEFAP PSAPPSLAPR EPSAVAAPAV PAIPAQAVAG DIRDAARAGI PPLDGVDTWV RVPVAWWRAV AGRVVGVAIT LIGAVYAVWR AGTLDGTGVA GHLFYAAEIV SYLTIVWTAV MTGRMRTGHV RRAPAPAGTL DVFVTVCGEP VEMVEATLRA ALAIDYPHRT YVLNDGRIAG RPNWRDIDAL AARLGIICFT RTDGPRGKAA NLNHGLARTD GDAIMTLDAD HIAVPDLGEL VLGYLRDPKV GFVCTEQRFD VGRHDVLNNA EPMLYKAVQP AKDRDSAASS CGNGTLYRRT AVESVGGFSE WNIVEDLHTS YQLHAAGWQS VYHHGPVSVG IAPATAAEYA KQRSRWAMDG LRLLLFDNPL RKPGLTGWQR AHYLHTGIGY LVACAQMMFL LGPPLSVLAG VQIAAGVSLT AYVLHALPYL VGSLLFIVAY TGPRGAQRTV ASTLFNAPLY ALSFVRVVLS GRPDSGATAK TALPRMSLLL LPQVLFAASL VVTILVVGVS PDVADLSALV WAGVLLSMVA GPLSALSERQ DRVERAQLPI RAVILGLVLS FAVVTLLEG
|
| |