Gene Francci3_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1647 
Symbol 
ID3905926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1980051 
End bp1981583 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content70% 
IMG OID637878985 
Productglucose-6-phosphate 1-dehydrogenase 
Protein accessionYP_480752 
Protein GI86740352 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0364] Glucose-6-phosphate 1-dehydrogenase 
TIGRFAM ID[TIGR00871] glucose-6-phosphate 1-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.336303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCCCA CGGTGAGTAC CAACCCGCTG CGGGACCCAC GTGACCGCAG ACTGCCGCGC 
CTGCCGGACG CGAGCGCCCT GGTGGTGTTC GGCGCTACCG GCGACCTGGC CCGCAAGAAG
CTGATCCCCG CCGTCTACGA CCTGGCCCAC CGCGGCCTGC TGCCGCCGGG TTTCGTCCTG
CTCGGGTTCG CCCGCCGGGA CTGGCCCGAC GAGGACTTCG CCGAGTTCGC CCGGCAGGCC
GCCGAGAAGG GCGCCCGCAC CCCCTTCCGT GCAGAGGTCT GGGACCGGCT GGCGGGCTCC
GTGCGGTTCC TGCCAGGGTC CTTCGACGAC GACGCCGCGT TCGACCGGCT CGCCCGCACG
CTGGAGAGCC TGGAGCACTC CCACGGTATC CGTGGCAACG CGGCGTTCTA CCTGTCGATC
CCGCCGTCGG CCTTCCCGGT CGTGCTCAAG CAGATGCAGC GCACCGGGCT CTCCTCGGCC
GCGGGTTCGG GCGGCTGGCG CCGGGTCGTC GTCGAGAAAC CGTTCGGTCA CGACCTGGAG
TCGGCCCGGC AGCTCAACGC GCTCGTCGAC GACGTCTTCA CCCCGTCCGG GGTGTTCCGC
ATCGACCACT ACCTGGGCAA GGAGACGGTC CAGAACCTCT TCGCGCTGCG CTTCGCCAAC
ACGCTGTTCG AACCGATCTG GAACTCCCAG TTCGTCGATT CGGTGCAGAT CACCATGGCC
GAGGACGTCG GGATCGGCAC CCGGGCCGGC TTCTACGACG AGACGGGCGC CGCTCGGGAC
GTGCTCCAGA ACCACCTGTT GCAGCTGCTC GCCCTGACCG CGATGGAGGA GCCGGTCAGC
TTCGGCGCGG AGACCATCCG CACCGAGAAG CTGAAGGTGC TCCGCGCGGT GTCGCTGCCC
ATGGACCTCA CTCGCTACGC GGTAAGGGGG CAGTACGAGC AGGGCTGGCT CGCCGGGGAG
CGGGTCCCGG GCTACCTCGA CGAACAGGAC ATCCCTGCGC AGTCGCGGAC GGAGACCTTC
TCGGCGGTGC GCCTCGGCAT CGAGACGCGC CGGTGGGCCG GGGTGCCGTT CTACCTGCGG
ACCGGCAAGC GGCTGCCACG GCGGGTCACC GAGGTCGCCA TCTTCTTCAA GAAGGCGCCG
CACCTGCCAT TCGACGAGAC CGCCACCACC GAGCTCGGCA ACAACCAGCT GGTCATCCGG
GTGCAGCCCG ACGAGGGGGT CACGCTCAAG TTCGGCTCCA AGGTCCCCGG CTCGGCGATG
GAGGTCCGGG ACGTCGCGAT GGACTTCCTG TTCGGTGAGG CGTTCACCGA GGCGCTGCCG
GAGGCCTACG AACGGCTGAT CCTCGACGTG CTGCTCGGCG ACGCGACGCT GTTCCCGAAC
AACGCGGAGG TCGAGGAGTC CTGGCGGATC GTCGATCCGC TGGAGCGGCA CTGGGCGGGC
ACCACCCCGC ACCGCTACCG GGCCGGCACC TGGGGTCCGG CTGCCGCCGA CGAGATGCTC
GCCCACGACG GTCGCCGGTG GCGGCGGCCA TGA
 
Protein sequence
MAPTVSTNPL RDPRDRRLPR LPDASALVVF GATGDLARKK LIPAVYDLAH RGLLPPGFVL 
LGFARRDWPD EDFAEFARQA AEKGARTPFR AEVWDRLAGS VRFLPGSFDD DAAFDRLART
LESLEHSHGI RGNAAFYLSI PPSAFPVVLK QMQRTGLSSA AGSGGWRRVV VEKPFGHDLE
SARQLNALVD DVFTPSGVFR IDHYLGKETV QNLFALRFAN TLFEPIWNSQ FVDSVQITMA
EDVGIGTRAG FYDETGAARD VLQNHLLQLL ALTAMEEPVS FGAETIRTEK LKVLRAVSLP
MDLTRYAVRG QYEQGWLAGE RVPGYLDEQD IPAQSRTETF SAVRLGIETR RWAGVPFYLR
TGKRLPRRVT EVAIFFKKAP HLPFDETATT ELGNNQLVIR VQPDEGVTLK FGSKVPGSAM
EVRDVAMDFL FGEAFTEALP EAYERLILDV LLGDATLFPN NAEVEESWRI VDPLERHWAG
TTPHRYRAGT WGPAAADEML AHDGRRWRRP