Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1050 |
Symbol | |
ID | 5669464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1232081 |
End bp | 1233469 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239979 |
Product | hypothetical protein |
Protein accession | YP_001505412 |
Protein GI | 158312904 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.729107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACC ACCACTCCGA GCTCACGGCC CTGCTCTCGG ACTGGCTGCC CAGGCAGCGC TGGTTCGCGG GCAAGGGCCG GCCGGGCGGG AGGCTGCGCG TCGGGCAGGA CGTCCGCCTC AGCTTCGACG CGGCGGTGAA GGCGGCGATG CACCTGCTGG TGGTGGAGGT CCGCTACGAC GACGGCGGCC TCTCCGACCA CTACCAGGTC CCGGTGGTGA TCCGGCCGGA CGCCCCCTTC GGCCACGAGG GGTTCCTCAT CGGCGAGTCG TCGGTGGGCC TGGTCTACGA CGGCCTGCAC GACTCCGACG GAAGCGCCGC CCTGCTGGAC TACCTGCGCC GGGGCGCGAG CCGCGAGGGG CTGACCGCGA CGGCGGTCGA GTCGTTGGAC GACCTGCCCG CGCACGCGGT CGGCGCCGAG CAGTCGAACA CCTCGATCGT CTACGGCGAC GCCTACATCC TGAAGGTCTT CCGCCGCCTG TGGCCCGGGA CGAATCCCGA TCTGGAGATC ACCCGGGTTC TGGCGCGCGC CGGCAGCGAG CACGTGGCCC GGCCGGTGGC CTGGCTGAGC GGGCAGCTCT CCGGCGTCCC GACGACCTTC GCGTTCATGC AGGACTTCCT GCGCACCGGG GCCGAGGGCT GGCTGCTGGC CCTGGCCAGC GTCCGCGACC TCTACGCCGA GGGCGACCTG CACGCCGACG AGGTGGGCGG CGACTTCGCC GCCGAGGCCG AGCGGCTGGG CGCCGCGACC GCCCAGGTGC ACCGCGACCT GGCCGCCGCG CTGCCGACCC GGCCCGCCGA CGCCGCCGCG CTCGGCGAGG TCGCCGACTA CCTGCACAGC CGGCTCGACG CCGCGCTGGC GGCCGTGGCG GAGCTCGCGC CGTTCGAGGC CGCACTGCGG ACCGCCTACG ACGAGGTCCG CCGCGCCGAC CACGCGGCGC CGTTCCAGCG CATCCACGGC GACCTGCATC TCGGCCAGGT GCTGCGGGTG GAGTCGGGCT GGGTGCTGTT CGACTTCGAG GGTGAGCCGG CGCGGCCGGT GCCCGAGCGG ACCCTGCTCG AATCCCCGCT GCGCGACATC GCCGGCATGC TCCGGTCGTT CGACTACGCG GCCCAGTCGA TGCTGCTCGA GCGCTCCGAC GAGCCGTCGC TGGCCTACCG GGCGCTGGAG TGGGCCGACC GCAACCGGGA CGCCTTCTGC CGCGGCTACG GCGCGGTGTC CGGCGCGGAT CCCCGGGACG GCGGCGCCGT CCTGCGTGGT CTCGAGCTCG ACAAGGCTGT GTACGAAGTG CTCTACGAGG CGCGCCACCG GCCGGGCTGG ATCAGCATCC CGCTGCGTTC GGTCGAACGG TTGACCGGCG GGCGACCCAC TGAGCTCCCC GCGCCCTGA
|
Protein sequence | MTDHHSELTA LLSDWLPRQR WFAGKGRPGG RLRVGQDVRL SFDAAVKAAM HLLVVEVRYD DGGLSDHYQV PVVIRPDAPF GHEGFLIGES SVGLVYDGLH DSDGSAALLD YLRRGASREG LTATAVESLD DLPAHAVGAE QSNTSIVYGD AYILKVFRRL WPGTNPDLEI TRVLARAGSE HVARPVAWLS GQLSGVPTTF AFMQDFLRTG AEGWLLALAS VRDLYAEGDL HADEVGGDFA AEAERLGAAT AQVHRDLAAA LPTRPADAAA LGEVADYLHS RLDAALAAVA ELAPFEAALR TAYDEVRRAD HAAPFQRIHG DLHLGQVLRV ESGWVLFDFE GEPARPVPER TLLESPLRDI AGMLRSFDYA AQSMLLERSD EPSLAYRALE WADRNRDAFC RGYGAVSGAD PRDGGAVLRG LELDKAVYEV LYEARHRPGW ISIPLRSVER LTGGRPTELP AP
|
| |