Gene Francci3_3451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3451 
Symbol 
ID3905691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4110716 
End bp4112038 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content76% 
IMG OID637880774 
Productgalactokinase 
Protein accessionYP_482534 
Protein GI86742134 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.457714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCCG CCGAGTATCC GGCCACCGAG CATCCGGCCA CCGAGCGTGC GGTGCGCGCG 
TTCGTCGAGA CCTACGGAGA GCGGCCCACC CACCTCGTCC GCGCGCCCGC CCGGGTGAAC
CTCATCGGCG AGCACACCGA CTACAACGAC GGCTTCTGTC TTCCGGTGGC CATCGACCGG
GAGCTGTGCA TCGCCCTGCG CCGTAACGAG GCACCCGAGC TCCGGCTGGT GTCCGAGCAG
GACGCGGTGC CCGCCGTGAT CCCGCTGCCG CCGCCGGGCT TCGACACGCC GGTGTCCGCC
CGGGCACCCG GATGGGTCCG GTATGTCGAA GGCATCGCGG TGATGCTGGC CGCCGAGTCG
GCCTCCCGTG CCGCCGCCGG ATCGGGGGGC TCCCGTGCCG CCGCCGGGTC AGGCTTCCCG
GCCGGAGGGG GCCCGGTGCC CTGGCGTGGC ACGCTGGCCA GCGACATCCC CGTCGGCGCG
GGACTGTCCT CCTCGGCCGC GCTGGAGCTA GCCGTCGCCC TGGCCGGTGC GCATCTCGCC
GGGTTGACGC CCGCGCCGAC GGAGCTCGCG CTGCTCGCCC AGCGGGCGGA GAACCTCTGG
GTGGGGGCGG CGACGGGCCT GCTCGACCAG CTCGCCTGCG CCGCGGGCGT GGCCGGCCAC
GCGCTGCGGA TCGACTGCCG CACGCTGACG ACCGAGCCGG TGCCGCTGCC GGGCGGGCTG
GCCGTGGTGG TCATCGACAC CGGCTCGCGC CGACAGGTGG TGACCAGCGA GTACGCCACC
CGACGGGCGG AGTGTGAGCG GGCGGCGCGG GCGCTGGGCG TCGCCGCGCT GCGGGACCTC
ACGCCCCGGT CGATGGACGA GGCGGTGGCC CGGCTCGGCA GGTCCGCCCG GTCGGATGGG
ACAACCCGGC CGGGCGGAGC GGCCGGCGGG ACCGGTCTGG ACCCGGTCGC GCTGCGCCGG
GCCCGGTTCG TCGTCGCCGA GAACGCCCGG GTCGACGCCG TCGCCGCGGC ACTGCGCGCG
GGCGACGCCC CAACGGCCGG GCGGCTGTTG CTGGCGGGGC ATCGCGGGAT CCGGGAGGAC
TTCGAGGTGT CGGGTCCGGA ACTCGATGCG GCGGTCCAGG CGGCTTCGGC GGCTCCGGGG
TGCTTCGGCG CCCGGATGAC CGGCGGTGGT TTCGCGGGCT GTGCGGTGGC CCTCGTCGAC
CGGTCCCGGC TGACCGCCTT CACCGAGGCG TTCGAACCGG CGTATGCCGC ACTCACCGGC
CGGTGGGCCG TGCTGCACGT CTGCTCGGCG GTCGCCGGCA CGTCCGTCCT GGACCTCGGG
TAG
 
Protein sequence
MRAAEYPATE HPATERAVRA FVETYGERPT HLVRAPARVN LIGEHTDYND GFCLPVAIDR 
ELCIALRRNE APELRLVSEQ DAVPAVIPLP PPGFDTPVSA RAPGWVRYVE GIAVMLAAES
ASRAAAGSGG SRAAAGSGFP AGGGPVPWRG TLASDIPVGA GLSSSAALEL AVALAGAHLA
GLTPAPTELA LLAQRAENLW VGAATGLLDQ LACAAGVAGH ALRIDCRTLT TEPVPLPGGL
AVVVIDTGSR RQVVTSEYAT RRAECERAAR ALGVAALRDL TPRSMDEAVA RLGRSARSDG
TTRPGGAAGG TGLDPVALRR ARFVVAENAR VDAVAAALRA GDAPTAGRLL LAGHRGIRED
FEVSGPELDA AVQAASAAPG CFGARMTGGG FAGCAVALVD RSRLTAFTEA FEPAYAALTG
RWAVLHVCSA VAGTSVLDLG