Gene Franean1_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2073 
Symbol 
ID5670474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2496783 
End bp2498315 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content70% 
IMG OID641240995 
Productglucose-6-phosphate 1-dehydrogenase 
Protein accessionYP_001506416 
Protein GI158313908 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0364] Glucose-6-phosphate 1-dehydrogenase 
TIGRFAM ID[TIGR00871] glucose-6-phosphate 1-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.398009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0869756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGTCCA CCGCTAGTGG CAACCCACTG CGGGATCCAC GGGACCGGAG GCTGCCGCGG 
CTGCCGGACA ACAGCGCGCT GGTGGTTTTC GGCGCGACCG GTGACCTGGC CCGCAAGAAG
CTCATACCGG CCGTCTACGA TCTCGCGAAC CGGGGGCTGC TCCCGCCGGG CTTCGTCCTG
GTCGGCTTCG CCCGCCGGGA CTGGAGCGAC GACGAGTTCG CCGCCTTCGC CCGCGAGTCG
GCGGAACGCG GCGCCCGGAC ACCGTTCCGC GAGGACACCT GGGAGCGGCT CGCCGGCTCC
CTGCGCTTCT GCCAGGGCTC GTTCTCCGAC GACGCCGCGT TCGACGGCCT CGCCGCGACG
CTGACCGACC TGGAGCACAG CCACGGGATC CGGGGCAACG CGGCCTTCTA CCTGTCGATC
CCGCCGACGG CTTTTCCCGT CGTCCTCAAG CAGATGCAGC GGACGGGCCT GGCCTCCAAC
GAGGCCGCGG GCGGCTGGCG CCGGGTCGTC ATCGAGAAGC CTTTCGGGCA CGACCTGGAG
TCCGCGCGCG AACTGAACTC GCTCGTCGAC GACGTCTTCA CCCCCGACGA CGTCTTCCGC
ATCGACCACT ACCTGGGCAA GGAGACGGTT CAGAACCTCT TCGCCCTGCG GTTCGCCAAC
ACCCTTTTCG AACCGATCTG GAACTCCCAC TTCGTCGACT CGGTGCAGAT CACCATGGCC
GAGGACGTCG GCATCGGCAC CCGGGCCGGC TTCTACGACG AGACCGGCGC GGCCCGCGAC
GTCCTGCAGA ACCACCTGCT GCAACTGCTG GCGCTCACCG CGATGGAGGA GCCGGTCAGC
TTCGGCGCGG ACACCATCCG CACCGAGAAG CTGAAGGTCC TGCGGGCGGT GTCGCTGCCC
GACGACTACG CCACGTTCGC CGTGCGCGGG CAGTACTCGC AGGGTTGGCT CGCCGGCGAG
CGGGTCCGCG GCTACCTTGA GGAAGCCGAC ATCCCGGCGG ACTCGACGAC GGAGACCTTC
GTCGCGGTGA AGCTCGGCGT TGAGACGCGA CGCTGGGCCG GCGTGCCGTT CTACCTGCGG
ACGGGCAAGC GCCTGCCCCG GCGGGTCACC GAGATCGCGA TCACCTTCAC CAAGGCGCCG
CACCTGCCGT TCGACGAGAC CGACACCGCC GAGCTGGGCA ACAACCAGCT GGTGATCCGG
GTGCAGCCGG ACGAGGGCGT CACGCTGCGC TTCGGCTCGA AGGTCCCCGG CTCGGCCATG
GAGGTGCGCG ACGTCGCCAT GGACTTCCTG TTCGGCGAGG CGTTCACCGA GGCGCTGCCC
GAGGCCTACG AGCGCCTGAT CCTGGACGTC CTGCTCGGCG ACGCGACGCT GTTCCCGAAC
AACGCCGAGG TCGAGGAGTC GTGGCGGATC ATCGACCCGC TCGAGGAGTT CTGGCGGAGC
ACGAAGCCGC ACACGTACCG GGCCGGAAGC TGGGGCCCGG CGGCGGCCGA CGACATGCTC
GCCCGCGACG GCCGCCGGTG GCGCCGGCCA TGA
 
Protein sequence
MASTASGNPL RDPRDRRLPR LPDNSALVVF GATGDLARKK LIPAVYDLAN RGLLPPGFVL 
VGFARRDWSD DEFAAFARES AERGARTPFR EDTWERLAGS LRFCQGSFSD DAAFDGLAAT
LTDLEHSHGI RGNAAFYLSI PPTAFPVVLK QMQRTGLASN EAAGGWRRVV IEKPFGHDLE
SARELNSLVD DVFTPDDVFR IDHYLGKETV QNLFALRFAN TLFEPIWNSH FVDSVQITMA
EDVGIGTRAG FYDETGAARD VLQNHLLQLL ALTAMEEPVS FGADTIRTEK LKVLRAVSLP
DDYATFAVRG QYSQGWLAGE RVRGYLEEAD IPADSTTETF VAVKLGVETR RWAGVPFYLR
TGKRLPRRVT EIAITFTKAP HLPFDETDTA ELGNNQLVIR VQPDEGVTLR FGSKVPGSAM
EVRDVAMDFL FGEAFTEALP EAYERLILDV LLGDATLFPN NAEVEESWRI IDPLEEFWRS
TKPHTYRAGS WGPAAADDML ARDGRRWRRP