Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3722 |
Symbol | |
ID | 5672087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4406517 |
End bp | 4407698 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242603 |
Product | glycosyl transferase family protein |
Protein accession | YP_001508023 |
Protein GI | 158315515 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.671186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGTGG TACCACCGCT GGCCGGGCAT GTGAACCCGG CCACCGCGGT AGCGGGGGAA CTGGCCGCGC GCGGTCACCA GGTGGCGATT GCCGGTCATG CCGACGTCAT CGCGCCGATC GTACCGGCGA AGGTCGACCT GCTCGCGCTG TCCGGGAGAC CGGCCGACGC CGCGGAGCGG ACCAGAATAG AGGCCTCGTC CCAGCGGCTT CGTGGCGTAG CCGCGCTGAA GTTCCTCTGG CAGGACTTTC TCCTCCCCCT CGGCGCGGCG ATGATCCCGG AGATCGACGC CATGGCCAGC GAGTTCCGTC CCGACGTGGT GGTCGCCGAC CAGCAGGCTG TCGGCGCGTC GGTCGTGGCA CGCCGACGAG GCACACGGCT AGCCGTCCTC GCCACCACGC CCGCCGAGTT CGACGACCCC TACGCCGGGC TCGACCGCGT CGGCGCCTGG ATCGCCGGAC TGCTCCAGGA CTTCCAACTC GCCCACGGCA TCCCCCCGGA ACAGGCCGCG GCCACGGACC CCCGGTTCTC CGACCAGCTC ACCCTCATCT GCTCCGTGCC AGGCCTACTC AAAGCGGGCC GTTTCGCGGA ATCAGTGGTC TTCGTCGGCT GCGCGGCCGC TCGGCGTCGT GCCGACCCGG ACTTCCCCTG GACATGGCTG GACGAAACCC GGGCCACCGT CCTGATCTCA CTCGGCACGG TCACCCGCGA GGCCGGCCGC CGTTTCCTGC GCGCCGCCGC CGAGGCGATG CTGTCGATGG CCACCGACGT CCAGGCCGTC GTCGTCGCGC CACCGGGGAC CGCCACCGAC CTGGCCCTGG CGGCGCCCGC CGATCTCCTC GTCACACCAC GCGTACCACA GCTCGCGTTG CTGCCACACC TGGCCGCGGT GATCTGCCAC GCCGGCAACA ACACCGTGTG CGAATCGCTC GCACACGGAG TCCCACTCGT CGTCGCACCC GTCCGCGACG ACCAGCCCAT CATCGCGGAA CAGGTAGAAC GAGCCGGAGC AGGCACCCGG ATCCGATTCG GCCGTGCCGG CGCGGCAACG ATCGCCGATG CCCTGCGAAA CGTGCTCGAC GATCCGACCT ACCGAGCGAC CGCCGGGCGA CTGCGACAGC AGTTCACCGC CGCCGGCGGC ACGGCCACAG CAGCGGCCCA CATCGAGCAA CTCGCCAGCT AG
|
Protein sequence | MFVVPPLAGH VNPATAVAGE LAARGHQVAI AGHADVIAPI VPAKVDLLAL SGRPADAAER TRIEASSQRL RGVAALKFLW QDFLLPLGAA MIPEIDAMAS EFRPDVVVAD QQAVGASVVA RRRGTRLAVL ATTPAEFDDP YAGLDRVGAW IAGLLQDFQL AHGIPPEQAA ATDPRFSDQL TLICSVPGLL KAGRFAESVV FVGCAAARRR ADPDFPWTWL DETRATVLIS LGTVTREAGR RFLRAAAEAM LSMATDVQAV VVAPPGTATD LALAAPADLL VTPRVPQLAL LPHLAAVICH AGNNTVCESL AHGVPLVVAP VRDDQPIIAE QVERAGAGTR IRFGRAGAAT IADALRNVLD DPTYRATAGR LRQQFTAAGG TATAAAHIEQ LAS
|
| |