Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1856 |
Symbol | |
ID | 5670258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2228430 |
End bp | 2230109 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240777 |
Product | glycosyl transferase family protein |
Protein accession | YP_001506200 |
Protein GI | 158313692 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0355886 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0028209 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGAACGACT CGCGCACCAC CGGCGGTCCG CTGCCGGAGA CCGCCGGCCC GCTCGCCGCC GGCGGCCTCC CCGGCGCCCG CGCCGATGCC ACCGTCGCCG ATGTAGCCGT CGCCGGCACC GCTGTCACCG GCACTACCGT CGCCGGCACT GCCGTCGCCG ACTCCGCTGA TGACGGTGAT CCCGCCGACC GCCTCGACGT CTCCGTCGTC ATGCCGTGCC TGAACGAGGC CGAGTCGGTC GGCGTCTGCG TCCGCAAGGC GCTGGCCGGG CTGGCCGCCG CCGGAGTCGC GGGCGAGGTC GTCGTCGTCG ACAACGGCTC GACCGACGGC TCCGCGGCGG TGGCGACCGC GGCCGGCGCG CGCGTCGTCG CCGAGTCACG GCGTGGCTAC GGCAACGCCT ATCTCGCCGG CTTCGCCGCC GCGCACGGCC GGTTCCTGGT CATGGGCGAC TCCGACGACA CCTACGACTT CGCCGACCTC GGCGCGCTGC TCGCCCCGCT GCGCGCCGGG CGCGCCGACT ACGTGCTGGG TTCCCGGTTC GCCGGTGAGA TCCTGCCCGG CGCCATGCCC TGGCTGCACC GATACGTCGG CAACCCGCTC CTCACCGGCA TCCTCAACCG CCTGTTCGAC GTCCGCTCGT CCGACGCCCA CTCCGGGATG CGGGCCTTCA CCAGGGACGC CTACCGGCGG ATGCGGCTGC GCTGCGAGGG CATGGAGCTC GCCTCCGAGC TCGTCATCGC CGCCCGTCGA GCCGAGCTGC GGATCGAGGA GGTGCCGATC ACCTACCACC CGCGGGTCGG GGCGTCGAAG CTCCACTCAC TGCGGGACGG CTGGCGCCAC CTGCGGTTCA TGCTGCTGCT GGCGCCCAGG CACCTGTTCG TCCTGCCGGG TCTGGTCCTG TTCGGGCTGG GCACGGCCGG CCAGCTGGCG CTCCTGCCCG GTTCGCTCGA TGTCGGGTTC CACCGGCTCG ACCTGCACTT CTCCGTGCTG TTCGCACTGA TCGCGATCCT CGGTTGGCAG TTGGTGCTTC TCGGTGTCTT CGCCGACGTC CACAACCATG CCGCGGGGTG GCAGGAGCGC CGCCGCTGGC CGCTGACGTC GATCCACCGG CGTTTCACGC TCGAGCGGGG CCTGGCGGCC GGCGGGATCC TGTTCACCGT CGGCTTCGCG ATCGACTGCG TCATACTCGC CCGATGGCTG GCGAACTCGA TGGGGCCGCT CAACGAGCTG CGCCCCGCCC TGCTCGCCAT GTCGCTGATG GTGCTCGGCG CGCAGACCGC CTTCGGGTCG TTCTTCCTGC GGCTGGTGAC GGCCGGGCCG AGCGGCGGCC ACCGCCGGGC CGGCTGGGCG CCCGCGACCG GACTCGCCGT CTCAGCCGCG GCCACCGGCG CGTCGCCATC CGCAGTCTCG CCGTCCTCAG TGTCGCCGGC CGATCCCCCG CCGGCCGCGG CCCCGCCGGC CGCGAGGCCG GTGGCGGCCG AGCCGGGCGA CGACGCGGCG GCTGGTGACC GCGCCCCCGG GCCGGTCGGG GCGCCGGGCC CGGTCGGGAC GCCGACGCCA GACGGCGGAG CCGGTCCGGC TACCCGCACG GGCCCGGAAA GTCATGAGAA CGACGAGGAT GCGGCGTTCC TCGCCGCGCC CGCACCCGTC CTCGGTGGAA TTCCCGCCCA CCAGCGCTGA
|
Protein sequence | MNDSRTTGGP LPETAGPLAA GGLPGARADA TVADVAVAGT AVTGTTVAGT AVADSADDGD PADRLDVSVV MPCLNEAESV GVCVRKALAG LAAAGVAGEV VVVDNGSTDG SAAVATAAGA RVVAESRRGY GNAYLAGFAA AHGRFLVMGD SDDTYDFADL GALLAPLRAG RADYVLGSRF AGEILPGAMP WLHRYVGNPL LTGILNRLFD VRSSDAHSGM RAFTRDAYRR MRLRCEGMEL ASELVIAARR AELRIEEVPI TYHPRVGASK LHSLRDGWRH LRFMLLLAPR HLFVLPGLVL FGLGTAGQLA LLPGSLDVGF HRLDLHFSVL FALIAILGWQ LVLLGVFADV HNHAAGWQER RRWPLTSIHR RFTLERGLAA GGILFTVGFA IDCVILARWL ANSMGPLNEL RPALLAMSLM VLGAQTAFGS FFLRLVTAGP SGGHRRAGWA PATGLAVSAA ATGASPSAVS PSSVSPADPP PAAAPPAARP VAAEPGDDAA AGDRAPGPVG APGPVGTPTP DGGAGPATRT GPESHENDED AAFLAAPAPV LGGIPAHQR
|
| |