Gene Franean1_6012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6012 
Symbol 
ID5674333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7332346 
End bp7333881 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content78% 
IMG OID641244860 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001510262 
Protein GI158317754 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0524001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.189514 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAAG CACATACTGT TGCCGAGGTC CGGGACGCCG AGGCGCCCCT GCTCGCGTCC 
CTGCCCTCCG GCGGGCTCAT GCAGCGCGCC GTGTCCGGTC TCGTCTCGCA CGCGGTCCGC
CGCGTCGACC GGGTGTACGG CGCCCGGATC GTCGTCCTCG CCGGTTCCGG TGACAACGGC
GGTGACGCGC TGTGGGCCGG TGCCCGGCTG GCAGCTCGCG GCGCCCGCGT CCACGCCCTC
GCTCCCGGCC GGACCCATCC CGAGGGCACG GCCGCCCTCC TCGCGGCCGG CGGGCGGCTG
CACCGCACCG GCCCGGTCGA CCCGCCCGCA CCGGAGGGGA TGGGCGCCGA CGCGGCCGCC
GACCTCCTCG ACTCCGCCGA CCTCGTCCTG GACGGGCTGC TCGGCATCGG CGGCCGCGGC
GAGCTGCGCG AGCCGTACGC CCAGCTCACC ACGCTCGCGC CGGCCCGGCG GACGGTGGCC
GTGGACGTGC CCAGCGGGGT CGACGCGGAC ACCGGCGCGG TGGCGGAGGG GGCCGTGCGC
GCGGCCGGCA CCGTCACCTT CGGCACGTAC AAGCGCGGCC TGCTGGTCGG ACCCGGAGCT
GTCCATGCCG GGCGGGTCGA GCTGGTCGAC ATCGGGCTGA CCCTGCCCGA GCCCGACCTA
CGTGCCCTGC AGGACGTCGA CGTGGCCCGG CTGTTGCCCG TCCCGGTCGC GGCCGACTCG
AAGTACTCCC GCGGCGTGCT CGGGCTCGTC GGCGGCAGCG ACCGGTACCC GGGAGCCGCG
GTGCTGGCGG TCGGCGGCGC GCTGCGCGGC GGCGCGGGCT ACCTGCGGGT GGTCGCCGAG
GCCGGCGCCG CCGAGTACGT CCGCCGGGCC CACCCGGAAT CCGTGCTGAC GGTGATCGAG
GCGGGGGACG CGGAGGCGAT GCTCGGGGTG GGCCGGGTCC AGGCGTGGGC GATCGGCCCC
GGCCTCGTAC CGGACGAGGC CACCCGGCGC CTCGTCGACG CGCTGCTCGA GCAGACCGAG
AGCGGCCTGC TCGTGGACGC CGGCGCGCTG GACACACTCG CGGCCGCCGT CGCCGCCCGC
CCGGCGGTGC TGCGGGACCG CGCGGGCGCC GTCGTCCTGA CCCCGCACGA GGGCGAGTTC
ATCCGGCTGA CCGGCACGGC ACTCGGCTGG GACCAGGCCG GCACACCTGA GCGCCTGCGG
GCCGACCGTC TCGGCACCGT CCGCCAGGCG GCGCGGGACC TCGGCGCCGT CATCCTGCTG
AAGGGCAACC GGACGATCAT CGCCGCCCCC GGCGGGGAGG CCCTCGTCAA CCTCACCGGG
ACGCCATGGC TGGGAACGGC CGGATCCGGT GACGTTCTCA CCGGACTCGC CGGTTCCCTG
CTGGCCGCGG GCCTGCCGGC GCCGCACGCG GCCGCCGTGG GCGCGTTCCT GCATGGCCGG
GCGGGGGAGC GCGGGCCCGT GCCGCTCGCC GCCGCAGACC TGCCCGCGCT CCTGCCCGGG
GTCGTCGAGG ACCTGCTTGC TAGCGTCGAG GGGTGA
 
Protein sequence
MLEAHTVAEV RDAEAPLLAS LPSGGLMQRA VSGLVSHAVR RVDRVYGARI VVLAGSGDNG 
GDALWAGARL AARGARVHAL APGRTHPEGT AALLAAGGRL HRTGPVDPPA PEGMGADAAA
DLLDSADLVL DGLLGIGGRG ELREPYAQLT TLAPARRTVA VDVPSGVDAD TGAVAEGAVR
AAGTVTFGTY KRGLLVGPGA VHAGRVELVD IGLTLPEPDL RALQDVDVAR LLPVPVAADS
KYSRGVLGLV GGSDRYPGAA VLAVGGALRG GAGYLRVVAE AGAAEYVRRA HPESVLTVIE
AGDAEAMLGV GRVQAWAIGP GLVPDEATRR LVDALLEQTE SGLLVDAGAL DTLAAAVAAR
PAVLRDRAGA VVLTPHEGEF IRLTGTALGW DQAGTPERLR ADRLGTVRQA ARDLGAVILL
KGNRTIIAAP GGEALVNLTG TPWLGTAGSG DVLTGLAGSL LAAGLPAPHA AAVGAFLHGR
AGERGPVPLA AADLPALLPG VVEDLLASVE G