Gene Francci3_0713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0713 
Symbol 
ID3903503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp817471 
End bp818547 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content66% 
IMG OID637878046 
Productglucose-1-phosphate thymidyltransferase 
Protein accessionYP_479826 
Protein GI86739426 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.974379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCCC TCGTCCTTGC TGGTGGTTCG GGAACCCGTC TGCGGCCGAT CACCCACACC 
TCGGCCAAAC AGCTTGTTCC AGTGGCGAAC AAGCCGGTGC TTTTCTACGG CCTGGAGGCC
ATCCGTGACG CCGGTATCAC CGATGTCGGG ATCATCGTGG GGGAGACGGC CGGCGAGATC
CAGGCGGCGG TAGGTGACGG CTCGGCATTC GGGATCCAGG TCACCTACAT CCGCCAGGAC
GCGCCGCTCG GACTGGCACA TGCCGTGCTC ATCGCCCGCG ACTTCCTCGT CGATGAACCG
TTCGTCATGT ACCTCGGCGA CAACCTGATT ATCGGTGGGA TCTCCAGCCT GGTCGAGGAA
TTCCGACGGA CCACTCCGGA CGCCCTGATC CTGCTGACCA GGGTCGACAA CCCCTCCGCC
TTCGGTGTCG CCGAGCTCGG TGCGGATCGG CAGATCATTC GACTGGTCGA GAAGCCGCTC
GTTCCGCCGA GTGACCTGGC GCTCGTCGGC GTCTACATGT TCGGCACCCC CATCCACGAC
GCCGTGGGGG CGATCAAACC GTCGGCCCGC GGCGAGCTGG AGATCACCGA GGCGATCCAG
TGGCTCGTCG ACGGCGGATA CGAGGTCGCC TCCCACCTGG TCGAGGGTTA CTGGAAGGAC
ACCGGCCGAC TCGACGACAT GCTCGAGACG AACCGCCATC TCCTCGAGTC GATCGAGCCG
GCGATCCGTG GCAGCGTTGA CGAGCACAGC ACGATCGTTG GCCGTGTGGT GATCGAGGAG
GGCGCCTCGC TGGTCCGGTC CACCGTGCGC GGGCCGGCCA TCATCGGCCG GGACACCACC
CTCGTCGATA CCTATGTCGG CCCGTTCACC TCGATCTTTC ACTCCTGCGT CATCGAACGG
ACCGAGATCG AGTACTCGAT TGTGCTGGAG CGGGCGACGA TCCGTGGGAT CGGGCGGATC
GAGGACTCGC TGATCGGCCG GGATGTCGAG GTCGTCCCCT CCGCGGCCCT GCCCAGGGCG
CATCGGCTCA TGCTGGGCGA CCACTCCCGC GTCTCGGTGG CTACGACGGG GACCTGA
 
Protein sequence
MKALVLAGGS GTRLRPITHT SAKQLVPVAN KPVLFYGLEA IRDAGITDVG IIVGETAGEI 
QAAVGDGSAF GIQVTYIRQD APLGLAHAVL IARDFLVDEP FVMYLGDNLI IGGISSLVEE
FRRTTPDALI LLTRVDNPSA FGVAELGADR QIIRLVEKPL VPPSDLALVG VYMFGTPIHD
AVGAIKPSAR GELEITEAIQ WLVDGGYEVA SHLVEGYWKD TGRLDDMLET NRHLLESIEP
AIRGSVDEHS TIVGRVVIEE GASLVRSTVR GPAIIGRDTT LVDTYVGPFT SIFHSCVIER
TEIEYSIVLE RATIRGIGRI EDSLIGRDVE VVPSAALPRA HRLMLGDHSR VSVATTGT