Gene Franean1_7068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7068 
Symbol 
ID5675378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8625120 
End bp8626598 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content60% 
IMG OID641245913 
Productglucose-6-phosphate 1-dehydrogenase 
Protein accessionYP_001511304 
Protein GI158318796 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0364] Glucose-6-phosphate 1-dehydrogenase 
TIGRFAM ID[TIGR00871] glucose-6-phosphate 1-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAGC TTATGTCGGT TGCCCCCTGC GAGATAGTAG TCTTCGGCGG CACAGGGGAT 
CTGGCGATGC GCAAACTGAT GCCAGCGCTG TACCACCGCG ACCGAGACGG GCAACTCACC
CCCGACTCGC GGGTCATTGC CGTGTCCCGG GCCGGACTCG ATGATGCCGG ATACCGCGAC
AAAGTTGATT CTGAACTGCG CCGGTTCGTC CCGGAGCTAA TACACGAGCC GGATGTCCTA
GCTCGCTTTA TCCAACGCCT GCACCACATC ACGGTCGACG TGGCGGAGCA CAGCACCTGG
GATGAACTGC GGACACTTCT GGCGGACGGA AAAGACCATG TGCGCGTCTT CTACCTTGCC
TGTGCCCCAC AACTGTTCGG GCCAACCTGT GTAGGCTTGC AAACCAACGG ATTAGTGACC
GATAACTCCA GGGTGGTCCT CGAGAAGCCA CTTGGGCATG ATCTGGTCTC CGCGCGCCGG
ATTAACGATG AAGTCGGTGC GGTTTTTGCT GAGGAACAGA TATTCCGTAT CGACCACTAC
CTCGGTAAAG AAACCGTCCA GAACCTCCTT GTGCTACGCT TCGCCAACTC CCTTCTTGAG
CCGCTGTGGA ATTCTGGTGC GATCGACCAT GTACAGATTA CGGTCGCAGA AACAATTGGC
ATCGGAGGGC GCGGCGACTA TTACGATGGC TCGGGTGCCA TACGTGACAT GGTACAGAAT
CACCTTCTGC AACTGCTCTG TCTGGTGGCG ATGGAACCAC CCAGCCGACT TGACCGCGAG
GCGGTCCGCG ACGAGAAGCT AAAGATTCTC CAGGCCCTAT CCCCACTGAC ACTCGGTGAC
GTGGAACGGT GCGTCGTCCG AGGACAGTAC ACGGCCGGAC TCGTAGAGGG AGCTCCAGTT
CCTTCTTACC AGGACGAAGT TGATGGGAGC ACGAGTACCA CCGAGACGTT TGTGGCGCTG
AAGGTCGAGG TCCAGAACTG GCGCTGGTCC GGGGTACCCT TCTACCTGCG TACCGGCAAG
AGGCTAGATC GTCATGCATC GGAGATCGTG GTTCAGTTCC GACCGGTACC ACATTCAATC
TTCCCGGGTA TCAAGGATGC GATTTCTCCT AACGCTCTTG TGCTGCGGCT GCAACCTGAC
GAGGGCGTAC GCCTCCACTT GATGGCCAAA GAACCGGGCC CGGGCGGCGT GCGACTGCGT
CCAGTCCACC TCAACCTCAG CTTCGCCGAG ACTTTCAAGT CACGCCTGCC CGATGCCTAC
GAGCGGCTAC TCATGGACGT CGTTCGTGGC AACCCGACGT TGTTCATGCG ACGCGACGAG
GTCGAGGCCG CATGGGCGTG GGTGGAGCCC ATCCTGGCCG CTCTGGCGGC GTCAATTGAC
TCACCGAGGC GCTACGCAGC TGGCACTGGT GGACCTGCTG CAGCGATCGC ATTGATCGAA
CGTGACGGCC GCACGTGGCA TGAGGAGGTA GCAGAATGA
 
Protein sequence
MPQLMSVAPC EIVVFGGTGD LAMRKLMPAL YHRDRDGQLT PDSRVIAVSR AGLDDAGYRD 
KVDSELRRFV PELIHEPDVL ARFIQRLHHI TVDVAEHSTW DELRTLLADG KDHVRVFYLA
CAPQLFGPTC VGLQTNGLVT DNSRVVLEKP LGHDLVSARR INDEVGAVFA EEQIFRIDHY
LGKETVQNLL VLRFANSLLE PLWNSGAIDH VQITVAETIG IGGRGDYYDG SGAIRDMVQN
HLLQLLCLVA MEPPSRLDRE AVRDEKLKIL QALSPLTLGD VERCVVRGQY TAGLVEGAPV
PSYQDEVDGS TSTTETFVAL KVEVQNWRWS GVPFYLRTGK RLDRHASEIV VQFRPVPHSI
FPGIKDAISP NALVLRLQPD EGVRLHLMAK EPGPGGVRLR PVHLNLSFAE TFKSRLPDAY
ERLLMDVVRG NPTLFMRRDE VEAAWAWVEP ILAALAASID SPRRYAAGTG GPAAAIALIE
RDGRTWHEEV AE