Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4682 |
Symbol | |
ID | 5673024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5592512 |
End bp | 5593786 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243539 |
Product | galactokinase |
Protein accession | YP_001508955 |
Protein GI | 158316447 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | [TIGR00131] galactokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGC GTGGTGAGGT TTCCGCGACG GCCGGGGGCG TCGGCGCAAC CCCCGCCCCG GGCAGCCCGG GCACCCCGGG CGCGGCGGAG CGGGCCGTCG CGGCTTTCGT CCGGTGCCAC GGGGCCCGGC CGGCCTACCT GGTCCGCTCG CCTGGTCGGG TGAATCTGAT CGGCGAGCAC ACCGACTACA ACGGTGGCCT CTGCCTGCCG GTGGCGATCG ACCTCGAGCT GTGTCTCGCG TTCTCCCCGT CCCCGCGCGC GACTGCGGGC CACGGGTTTG TCGAGGTGCT TTCCGAGCAT CGCGCGGCAC CGGCCCGGAT CGACCTGCCG CCGCCTCCGC CGCCTGAGCC CGGGTCACCG GGCCGGGCCG GGGCCGCGCA ATCCGGATGG GCCGGGTACG TCGAAGGTGT CGTCGTGATG GCGGCGTCGG TCGCCGGCGC CGCCGCGTCG CGAGGCTGGT ACGGGACGCT GGCCAGCGAC CTGCCGCTCG GCGCGGGGCT GTCGTCGTCG GCCGCGCTGG AGCTTGCCGT GGCCCGGGCC TGCGCGGTGG TGTGGAAGAC CGGCTGGGAT CCGATCGAGG CGGCCCGGCT CGCCCAGCGG GCCGAGAACG GCTGGGTGGG CGCCGCCACC GGCCTGCTTG ACCAGATCGC CTGCGCCGCG GCGACGGCCG GCCACGCTCT GGAGATCGAC TTTCGTGATC TGACGGTGAC CGCGGTCGCC GTCCCGGATT CGGTCGTCGT CGCGGTCGTG GACACCGGCA CCCGCCGGGA GCTGGTCACC AGCGCCTACG CCCAGCGCCG GGCCGAATGT GAGCGGGCGG CGGCCGGGTT GGGCGTCGCG AGCCTGCGCG ACCTCGGCGA CCGGTTGCCC GCCGACGCGG CCCGCCGGCT CGACCCGGTG GCGTTGCGCC GGGCGCGCCA CATCGTCACC GAGAACACGC GGGTACGGGA GCTCGCCGAC GCACTGCGCC GCGTGGACCT GCCGCGGGCC GGCACCGTGC TCCTCGACGG CCACCGGTCG ATCCGGGATG ACTTCGAGGT GTCCGGTCCC GAACTGGAAG CTGCGGTCGA GGCCTGCCTG AACGCTCCCG GCTGCCACGG TGCGCGGATG ACCGGCGGCG GTTTCGCCGG CTGCGTCGTC GCCCTGGTCG ACGCGGACTC GGCGGCTGCC TTCGCCGCGG CGGTGGTCGG TGCCTACCGC GCGAGAACTG GGAACGAGGC GGTGGTCCAT CTGTGCGCGC CGGTCGACGG CACCTCGCTG GTGGACGCGC CCTGA
|
Protein sequence | MSQRGEVSAT AGGVGATPAP GSPGTPGAAE RAVAAFVRCH GARPAYLVRS PGRVNLIGEH TDYNGGLCLP VAIDLELCLA FSPSPRATAG HGFVEVLSEH RAAPARIDLP PPPPPEPGSP GRAGAAQSGW AGYVEGVVVM AASVAGAAAS RGWYGTLASD LPLGAGLSSS AALELAVARA CAVVWKTGWD PIEAARLAQR AENGWVGAAT GLLDQIACAA ATAGHALEID FRDLTVTAVA VPDSVVVAVV DTGTRRELVT SAYAQRRAEC ERAAAGLGVA SLRDLGDRLP ADAARRLDPV ALRRARHIVT ENTRVRELAD ALRRVDLPRA GTVLLDGHRS IRDDFEVSGP ELEAAVEACL NAPGCHGARM TGGGFAGCVV ALVDADSAAA FAAAVVGAYR ARTGNEAVVH LCAPVDGTSL VDAP
|
| |