Gene Franean1_4682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4682 
Symbol 
ID5673024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5592512 
End bp5593786 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content75% 
IMG OID641243539 
Productgalactokinase 
Protein accessionYP_001508955 
Protein GI158316447 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC GTGGTGAGGT TTCCGCGACG GCCGGGGGCG TCGGCGCAAC CCCCGCCCCG 
GGCAGCCCGG GCACCCCGGG CGCGGCGGAG CGGGCCGTCG CGGCTTTCGT CCGGTGCCAC
GGGGCCCGGC CGGCCTACCT GGTCCGCTCG CCTGGTCGGG TGAATCTGAT CGGCGAGCAC
ACCGACTACA ACGGTGGCCT CTGCCTGCCG GTGGCGATCG ACCTCGAGCT GTGTCTCGCG
TTCTCCCCGT CCCCGCGCGC GACTGCGGGC CACGGGTTTG TCGAGGTGCT TTCCGAGCAT
CGCGCGGCAC CGGCCCGGAT CGACCTGCCG CCGCCTCCGC CGCCTGAGCC CGGGTCACCG
GGCCGGGCCG GGGCCGCGCA ATCCGGATGG GCCGGGTACG TCGAAGGTGT CGTCGTGATG
GCGGCGTCGG TCGCCGGCGC CGCCGCGTCG CGAGGCTGGT ACGGGACGCT GGCCAGCGAC
CTGCCGCTCG GCGCGGGGCT GTCGTCGTCG GCCGCGCTGG AGCTTGCCGT GGCCCGGGCC
TGCGCGGTGG TGTGGAAGAC CGGCTGGGAT CCGATCGAGG CGGCCCGGCT CGCCCAGCGG
GCCGAGAACG GCTGGGTGGG CGCCGCCACC GGCCTGCTTG ACCAGATCGC CTGCGCCGCG
GCGACGGCCG GCCACGCTCT GGAGATCGAC TTTCGTGATC TGACGGTGAC CGCGGTCGCC
GTCCCGGATT CGGTCGTCGT CGCGGTCGTG GACACCGGCA CCCGCCGGGA GCTGGTCACC
AGCGCCTACG CCCAGCGCCG GGCCGAATGT GAGCGGGCGG CGGCCGGGTT GGGCGTCGCG
AGCCTGCGCG ACCTCGGCGA CCGGTTGCCC GCCGACGCGG CCCGCCGGCT CGACCCGGTG
GCGTTGCGCC GGGCGCGCCA CATCGTCACC GAGAACACGC GGGTACGGGA GCTCGCCGAC
GCACTGCGCC GCGTGGACCT GCCGCGGGCC GGCACCGTGC TCCTCGACGG CCACCGGTCG
ATCCGGGATG ACTTCGAGGT GTCCGGTCCC GAACTGGAAG CTGCGGTCGA GGCCTGCCTG
AACGCTCCCG GCTGCCACGG TGCGCGGATG ACCGGCGGCG GTTTCGCCGG CTGCGTCGTC
GCCCTGGTCG ACGCGGACTC GGCGGCTGCC TTCGCCGCGG CGGTGGTCGG TGCCTACCGC
GCGAGAACTG GGAACGAGGC GGTGGTCCAT CTGTGCGCGC CGGTCGACGG CACCTCGCTG
GTGGACGCGC CCTGA
 
Protein sequence
MSQRGEVSAT AGGVGATPAP GSPGTPGAAE RAVAAFVRCH GARPAYLVRS PGRVNLIGEH 
TDYNGGLCLP VAIDLELCLA FSPSPRATAG HGFVEVLSEH RAAPARIDLP PPPPPEPGSP
GRAGAAQSGW AGYVEGVVVM AASVAGAAAS RGWYGTLASD LPLGAGLSSS AALELAVARA
CAVVWKTGWD PIEAARLAQR AENGWVGAAT GLLDQIACAA ATAGHALEID FRDLTVTAVA
VPDSVVVAVV DTGTRRELVT SAYAQRRAEC ERAAAGLGVA SLRDLGDRLP ADAARRLDPV
ALRRARHIVT ENTRVRELAD ALRRVDLPRA GTVLLDGHRS IRDDFEVSGP ELEAAVEACL
NAPGCHGARM TGGGFAGCVV ALVDADSAAA FAAAVVGAYR ARTGNEAVVH LCAPVDGTSL
VDAP