Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2166 |
Symbol | |
ID | 5670566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2600408 |
End bp | 2601814 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241087 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001506508 |
Protein GI | 158314000 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.485753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCAGG AACCCGCACC GCGAGGTGTG GGAATCCCCG GTCTTCAGAC CGGGGAGGAG GTCAACACGG AGTCTCGGAT CCGTCGGAGT GCCGACGCGG CGGCGTGCGC CCGTTCCCCG GCGGACAAGA GCGCGGGACC CACTGCGATC GACAGACCCG CCGCGGTCGA CGGGACCACC GGTCACGGCG AGCGGCGCGG CTCGGGAAAC GACCGCGAGC GCACGTCGGT CGGGAGGCGG GTGGACCCCG CGCCCGCCCG CCCTGACCGC GACAGTCCTG TGACGGTGCT GGAGGTGCTC CCGAGGATGG ACCGGGCGGG CGAGGTGATC CGCGCGGTCA ACCTCCTGCG GCGCCTGGAC CCGCAGGAGT ACCGGCTGCT GTTCTGCGTC ACCTCGGGTG CCCCCGGATC GCTGGACGAC GAGATCCGGG CACTGGGTGG CGAGGTCTAT TACTGCCGCG CCGACCTGAG GTTCCCGCTC GCCTTCTACC GGCTGCTGCG TTCGGTCCGG CCGGACATCG TCCACTCGGG TGTGGCGACC TTCTCGGGCG TGGTGCTCGC GGTGGCCCGG GTGGCCGGTG TGTCGCGCCG CGTCGCGCAC TTCTTCAGCA GCGCGGACCA GAGCGGCGAC AGCCTCCGCG GCCGCCTCCA GCGGATGGTG GGGCGGGTGT TGCTGGACGC GTTCGCCACC GACCTGCTCG CGGTCAGCGA GGCGGCGATG CGCGGACGGT GGCGGGAGAC CTGGCGGCTC GACCCCCGGT GCCGGGTCAT CTACAACGGG GTCGAGCTCG AGCCCTTCGG AGTGGCCATC GCGGGCCAGC GGCCCATGCC GGACCTCCCC GAACTCGACG AGTTCGGGGA GGCCATGGCA CCGCAGCTGA CCGTCCTGCA CGTCGCCCGC CCGGACCCGG TCAAGAACCG GGCCCGGGCC ATCGAGATCG TCGCGGCGAT GTGCGCGCGG GGGCTCGACG TCCGCCTGCG GATCGTCGGG CGCCAGACCG AGGAGGAGAC CGAGCGGCTG ATGACCCTGG CCCGGGGTCT GGGTGTGTCC GACCGGGTCG AGTTCATCGG CGAGCGGCTC GACATCCCGA AGCTGTTGGT GACCTCGTCG CTGCTGCTGG TGACTTCGCT GCGCGAGGGG CTGCCGAGTG TGGTGCTCGA GGCCTGCGCG GTCGGGACCC CGGTGCTGTC GTCCGACCTG CCGGGAGTGG GGGAGATCGC CCGGGTGCTG CCCGGGATCA CCATGCTGCC GCTGGGCACC CCCAACGAGA TCTGGGCCAA CACCGCGGCT GATCTGGCGG TCGTCCCGCC CACGATGGAC GAGCGCCGTG AGGCGATGCG GCGGCTGCGG CGGTCCCCGT TCACGATGGA GAACTGGCAG CGCGACATCA CGGCCGTCTG GTCGTAG
|
Protein sequence | MKQEPAPRGV GIPGLQTGEE VNTESRIRRS ADAAACARSP ADKSAGPTAI DRPAAVDGTT GHGERRGSGN DRERTSVGRR VDPAPARPDR DSPVTVLEVL PRMDRAGEVI RAVNLLRRLD PQEYRLLFCV TSGAPGSLDD EIRALGGEVY YCRADLRFPL AFYRLLRSVR PDIVHSGVAT FSGVVLAVAR VAGVSRRVAH FFSSADQSGD SLRGRLQRMV GRVLLDAFAT DLLAVSEAAM RGRWRETWRL DPRCRVIYNG VELEPFGVAI AGQRPMPDLP ELDEFGEAMA PQLTVLHVAR PDPVKNRARA IEIVAAMCAR GLDVRLRIVG RQTEEETERL MTLARGLGVS DRVEFIGERL DIPKLLVTSS LLLVTSLREG LPSVVLEACA VGTPVLSSDL PGVGEIARVL PGITMLPLGT PNEIWANTAA DLAVVPPTMD ERREAMRRLR RSPFTMENWQ RDITAVWS
|
| |