Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2150 |
Symbol | |
ID | 5670550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2579408 |
End bp | 2580721 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641241071 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001506492 |
Protein GI | 158313984 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCGCGGC ATGTACTCAG CACAAAGTCA GGGGTAGTAA TGGGCTGGAT GGTGTCACAG ATCGGCGCGC GAGAGCACTA CGCGACGGCT GTCGGGCTTG AGGTTTACTC GACGCTAGAC CAACTGTACA CCGATGCATG GTGCCGTTTG CCGCTCAGCT CTATTCCGAA ACTTCCTGGC CCGATGGTTC GTTACGCCGG ACGTGCAGAT CGGCGGATCC GATCGGTAAA GAGTTATGAA CTACAGCTGG GCCCAAGTCA CATTCGGAGT GCTGTCGCGC ACCGTTTCTC CCGAAATTAC GCGCACGAGC TTCGATGCAT AGAAATCGGC AGAAGGTTCG ACACGTTGGT GCGTAAGGAC CTGCGTAGGC GCCGCTTCGA TCCGAATAAG GACGCATTCT TTGGCTACTT CGGTGGCGCG TTGGAAAGCC TTCGATATCT GGGTGACCAG GGAGTCCCCA CGATACTCGA CCAGACTGGA TGTGGACGGT CCTACTACGA AGAGGTTGCA GCAGAAAGGC TCCTTTGGCC GGAATGGGAA GGGCGCCCTC CGACGATACA CGAGGCGTAT TTCGACCGAG CCCACGATGA ATGGAAAGCC GCCTCCGCGG TGGTGGTTAA CTCCAACTGG GCCCGAAAAT CTGCGCTGAA AGAAGGGTGC TCGCCCGACA AGATATTCGT GCTTCCTCTG GCCTACGATG CGCCGAGTCT GAAGGTTGGT GCGCGGCCAC CACGTCCACA CGGTCGACTC AAGGTAATGT GGCTCGGTCG CGTTATTTTG TCCAAGGGTA TCCAATATCT ACTGCTGGCA GCACAGCTAC TTCCCGAGGT CGACTTTATC GTTGCGGGAC AGATCGGGGT AGATGCCAAC GTCCTACGAA AGGCAACGCC GAGTAACGTG AAGTTCCTCG GCCCGATCCC GCGCTCCCAT GCAGCGGAGT TTCTGACCTC GGGTGACCTG TTCGTTCTTC CAACTCTATC CGACAGCTTC GCGTTGACTC AGCTGGAAGC CATGTCGGCA GGCCTTCCCG TTATTACCAC CGATCGTTGT GGAGATGTCG TGACCGATGG TCAGAACGGT TATATAGTGC CAGTTCGTGA TCCTTATGCT ATTGCGAATG CTGTGGCACG GCTCGATTGC GACCGAAATA TGCTCAAGGA ATTTTCTCGT CTTGCGGTGA TCCGCGCCAG GCAACTGTCC CTGACGAAGT ACGTGGAGAA CCTGGAGACT ATTCGGCGAG GCATCTGCCC CGCGCCCGCG ATGCGCGACG GCAGTCGCGG ATCAGATCCA TTCCAGGCTG CCACCGTCTG GTAG
|
Protein sequence | MPRHVLSTKS GVVMGWMVSQ IGAREHYATA VGLEVYSTLD QLYTDAWCRL PLSSIPKLPG PMVRYAGRAD RRIRSVKSYE LQLGPSHIRS AVAHRFSRNY AHELRCIEIG RRFDTLVRKD LRRRRFDPNK DAFFGYFGGA LESLRYLGDQ GVPTILDQTG CGRSYYEEVA AERLLWPEWE GRPPTIHEAY FDRAHDEWKA ASAVVVNSNW ARKSALKEGC SPDKIFVLPL AYDAPSLKVG ARPPRPHGRL KVMWLGRVIL SKGIQYLLLA AQLLPEVDFI VAGQIGVDAN VLRKATPSNV KFLGPIPRSH AAEFLTSGDL FVLPTLSDSF ALTQLEAMSA GLPVITTDRC GDVVTDGQNG YIVPVRDPYA IANAVARLDC DRNMLKEFSR LAVIRARQLS LTKYVENLET IRRGICPAPA MRDGSRGSDP FQAATVW
|
| |