Gene Franean1_4672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4672 
Symbol 
ID5673014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5579596 
End bp5581635 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content74% 
IMG OID641243529 
Productshort chain dehydrogenase 
Protein accessionYP_001508945 
Protein GI158316437 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02632] rhamnulose-1-phosphate aldolase/alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.174921 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.873986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGA ACCAGACCGT CGCCGAGCTC CTCGCCCGGT CGAACCGGCT CGGCGCCGAC 
CCGCGCAACA CCAACTACGC GGGCGGCAAC ACCTCCGCCA AGGGCGTCGA GATCGATCCC
GTCACGGGCC GCGACGTCGA GCTGCTGTGG GTCAAGGGGT CGGGCGGCGA CCTCGGCACG
CTGACCGAGC CCGGCCTCGC CGTCCTCCGG CTGGACCGGC TGCGCGCGCT CGTGGACGTC
TACCCCGGCG AGGGCCGCGA GGACGAGATG GTCGCCGCCT TCGACTTCTG CCTGCACGGC
CGGGGCGGTG CCGCGCCCTC GATCGACACC GCCATGCACG GGCTCGTCGA CGCCTCGCAC
GTCGACCACC TGCACCCGGA CAGCGGCATC GCCCTCGCGA CCGCCGCCGA CGGCGAACAG
CTCACGAAGG ACGTCTTCGG CACGAAGGTC GCGTGGGTGC CGTGGCGGCG GCCGGGCTTC
CAGCTCGGCC TCGACATCGC GGCCGTGCGG CGGGACAACC CCGGCGCCGT CGGCTGCATC
CTCGGCGGGC ACGGCATCAC CGCGTGGGGC GACACGAGCG AGGAGTCCGA GCGCAACTCC
ACCTGGATCA TCGAGCAGGC GCGGCTCCAC CTCGAGCGGC ACGGGCGGCC GCGGCCCTTC
GGCGCCGTGG CCGACGGCCG GCGCGCGCTG CCAGCCGGGC AACGGCGGGC CAGGGCCGCC
GTGCTCGCGC CCCACCTGCG TGCGGTCGCG TCCCGCGACC ACCGCATGGT CGGCCACTTC
ACCGACAGCG ACGCCGTCCT GGAGTTCCTG GCCAGCGAGA AGCTGTTCGC CCTGGCCCGG
CTCGGGACGT CCTGCCCCGA CCACTTCCTG CGGACGAAGG TCCGCCCGCT GGTGCTCGAC
CTGCCGGCCG CGGCGTCACC CGAGGAGTGC GTCCGGCGGC TCGCGGAGCT GCACGAGGCC
TACCGCGCCG ACTACCGCGC GTACTACGAG CGGCATGCCA CCGCCGACTC GCCGCCGATG
CGCGGCGCGG ACCCGGCGAT CATCCTCGTG CCGGGCGTGG GCATGTTCTC CTACGGCCGG
GACAAACAGA CGGCACGGGT GGCCGGCGAG TTCTACGTCA ACGCGATCAA CGTGATGCGC
GGTGCCGAGG CGGTCTCGTC CTACGCGCCG ATCCCGGAGC GCGAGAAGTT CCGGATCGAG
TACTGGGCGT TGGAGGAGGC GAAGCTGCGC CGCCTACCGC CGCCCAAGCG GCACGCGGGC
CGGGTCGCGC TGGTCACCGG CGCGGCCAGC GGCATCGGCC GGGCCACCGC GTCCCGGCTC
GCCGCCGACG GCGCCTGTGT GGTCGTCGCC GACCTCGACG CGACCAGCGC CGTGTCGGCG
GCGGGCGAGC TCGGCGGCGC CGACGTCGCG GTCGGCGTCG GCGCCGACGT GACGAAGGAG
GCCGAGGTCG CGGCCGCCGT CGCGGCGGCA CTGCTCGCCT TCGGCGGGAT CGACCTCGTC
GTCAACAACG CGGGCCTGTC CATCTCCAAA CCGCTGCTGG AGACCACCGA GCGGGACTGG
GACCTGCAGC ACGACGTCAT GGCGAAGGGA AGCTTCCTCG TCGCGCGGGC CGCGGCGCGA
GCCATGATCG ACCAACGGCT CGGCGGTGAC ATCGTCTACA TCGTCTCCAA GAACGCGCTG
TTCGCCGGGC CGAACAACGT CGCCTACGGC GCCGCGAAGG CCGACCAGGC GCACCAGGTC
AGGCTGCTGG CGGCCGAGCT CGGCGAGCAC GGGATCCGGG TCAACGGCGT CAACCCGGAT
GGCGTCGTGC GTGGTAGCGG CATCTTCGCC GGCGGGTGGG GAGCCCAGCG TGCGGCGGTC
TACGGCATCC CCGAGGAGGA GCTCGGCGCC TTCTACGCCC GGCGGACGCT GCTCGGCCGG
GAGGTGCTGC CCGAGCACGT GGCCAACGCG GTCGCGGCCG TGTGCTCGAC CGAGCTCAGC
CACACGACCG GCCTGCTCGT CCCCGTCGAC GCCGGCGTCG CGGCCGCCTT CCTGCGCTGA
 
Protein sequence
MSGNQTVAEL LARSNRLGAD PRNTNYAGGN TSAKGVEIDP VTGRDVELLW VKGSGGDLGT 
LTEPGLAVLR LDRLRALVDV YPGEGREDEM VAAFDFCLHG RGGAAPSIDT AMHGLVDASH
VDHLHPDSGI ALATAADGEQ LTKDVFGTKV AWVPWRRPGF QLGLDIAAVR RDNPGAVGCI
LGGHGITAWG DTSEESERNS TWIIEQARLH LERHGRPRPF GAVADGRRAL PAGQRRARAA
VLAPHLRAVA SRDHRMVGHF TDSDAVLEFL ASEKLFALAR LGTSCPDHFL RTKVRPLVLD
LPAAASPEEC VRRLAELHEA YRADYRAYYE RHATADSPPM RGADPAIILV PGVGMFSYGR
DKQTARVAGE FYVNAINVMR GAEAVSSYAP IPEREKFRIE YWALEEAKLR RLPPPKRHAG
RVALVTGAAS GIGRATASRL AADGACVVVA DLDATSAVSA AGELGGADVA VGVGADVTKE
AEVAAAVAAA LLAFGGIDLV VNNAGLSISK PLLETTERDW DLQHDVMAKG SFLVARAAAR
AMIDQRLGGD IVYIVSKNAL FAGPNNVAYG AAKADQAHQV RLLAAELGEH GIRVNGVNPD
GVVRGSGIFA GGWGAQRAAV YGIPEEELGA FYARRTLLGR EVLPEHVANA VAAVCSTELS
HTTGLLVPVD AGVAAAFLR