Gene Francci3_3092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3092 
Symbol 
ID3904218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3663788 
End bp3664846 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content72% 
IMG OID637880413 
Productglucokinase 
Protein accessionYP_482178 
Protein GI86741778 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATCG CTACTCCTGA TAGTTCGAAG AACCTGGATA TTCCGGGCAG CTTGACGGTT 
CCGCGCGCCG AGCGTGCCGG CGCGGCCCCC GGCCCGTTGC CGGCGGAGAA CCGGATGGAG
GGCCTGACGA TCGGCATCGA CGTTGGCGGG ACGAAGGTCG CCGCCGGCGT CGTGGACGGT
GCGGGGACGA TCATCACTTC CCTGCGTCGG CCCACCCCGG GCCATTCGGC CGCCGAGGTC
GCGGACACCA TCGCCAGCGT CGTCGCGGAG CTCAGTGCCG ACCACGCCGT GCGCGCGGTC
GGCATCGGCG CGGCCGGGTG GGTCGACTCG GACCGGTCCC GCGTCCTGTT CGCACCGAAC
CTCGCCTGGC GCGACGAACC CCTGCGCGAC GAGGTCGGGG GGCGCATCGG CCTGCCCGTC
GTCGTGGAGA ACGACGCCAA CGCGATGGCC TGGGCGGAGT ACCGTTTCGG GGCCGGCCGT
GGCCGGCGTG ACCTCGTCTG CCTGACGGTG GGAACCGGCA TCGGCAGCGG CATCGTCCTG
GGCGGTGAGC TCTACCGGGG CGCGTCCGGT ATCGGCGCCG AGATGGGTCA CATGCGGGTG
GTACCCGACG GGTATCCGTG CGGTTGTGGT AACAGAGGGT GTTGGGAACA GTATGCGAGC
GGGCGAGCGC TGGTCCGGCT GGCGAAGAAC ATCGCCACCG TGGATCCGAG TGCGGCCGTG
CCCATGCTGG AGCATTGCGG CGGTGGCGTC GACGCGCTGA CCGGCCCGGA CGTCACCGAG
GCGGCGCGCA AGGGGGACCC GGCGGCGATC AGGTGCTTCA CCGAGATCGG CCACTGGCTC
GGCGAGGGCA TGGCGATGCT GGTCGCCGCG CTCGACCCGA ACCGCTTCGT CATCGGCGGC
GGCGTCTCCG ACGCCGGCGA GCTGCTGCTC GGCCCGGCCC GGCAGAGCCT CCTGGCCGCT
ATGCCCGGGC GGGATTACCG TTCCGAGCCG GACATCGTCA TCGCCGAGCT CGGATCCCAA
GCGGGCCTCG TAGGCGCGGC CGACCTCGCC CGGTTCTGA
 
Protein sequence
MSIATPDSSK NLDIPGSLTV PRAERAGAAP GPLPAENRME GLTIGIDVGG TKVAAGVVDG 
AGTIITSLRR PTPGHSAAEV ADTIASVVAE LSADHAVRAV GIGAAGWVDS DRSRVLFAPN
LAWRDEPLRD EVGGRIGLPV VVENDANAMA WAEYRFGAGR GRRDLVCLTV GTGIGSGIVL
GGELYRGASG IGAEMGHMRV VPDGYPCGCG NRGCWEQYAS GRALVRLAKN IATVDPSAAV
PMLEHCGGGV DALTGPDVTE AARKGDPAAI RCFTEIGHWL GEGMAMLVAA LDPNRFVIGG
GVSDAGELLL GPARQSLLAA MPGRDYRSEP DIVIAELGSQ AGLVGAADLA RF