Gene Franean1_0088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0088 
Symbol 
ID5668513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp104451 
End bp105926 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content69% 
IMG OID641239016 
Productaldo/keto reductase 
Protein accessionYP_001504461 
Protein GI158311953 
COG category[C] Energy production and conversion
[K] Transcription 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases)
[COG0789] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGG TCGACGAGCT CAGCGAGGAC GGTTCTCGGG CGCCGGATGG GCCTACGTCG 
GGCCGCCGGA AATCCGGGGC GAGACGGCCC TCTACCGGCG TCGCCCGGTC GGCGGACAGC
GGCCTCACCA TCGCTGAGGC GGCGACGGCG AGCGGCGTGA GCGCGCACAC TCTGCGGTAC
TACGAGCGGG TGGGCCTCAT GCTCGAACGG GTGGCGCGGG CTCCGTCGAG CCATCGCCGC
TACGACGACG AGGACCTGCG CTGGATCGCG ACGCTCACCG CGCTGCGCCA TACCGGCATG
CCGATCCGCC AGATCGCGCG GTACGCGGCG CTTGTACGGG CCGGCCACGG CAACGAGACC
GAACGGCTCG AGCTGCTCAC CGCCCACAAG GAGCGCGTCA CCGAGCGTCT CGATGAGGTT
CGCCGCCATC TCACAGCCAT CGACACCAAG ATTGATATCT ATCGGGAGAG GACCAGACGA
CCGATGCCCC ACGACACATT GCGCCAGGTA CCGCTCGGTT CCCAGGGACT CCGGGTCAGC
GTGCAGGGCC TCGGCTGCAT GGGCATGTCC GACTTCTACG GCGCGACCGA CGACAACGAA
TCGATCGCCA CCATTCAGCG GGCGCTCGGC CTGGGCGTGA CCTTCCTCGA CACCGCCGAC
ATGTACGGGC CGTTCACGAA CGAGCGGCTG GTCGGGCGGG CCATCTCCGG CCGGCGCGGC
GAGGTGACGC TCGCGACCAA GTTCGGGATC GTCCGCGACC CGGACAACCC CCAGGCCAGG
AACATCAACG GCCGGCCGGA GTACGTCCGC TCAGCCTGCG ACGCGTCCCT GTCCCGCCTC
GGGGTCGACC ACATCGATCT CTACTACCAG CATCGGGTCG ACCCGACTGT GCCGATCGAG
GACACCGTCG GCGCCATGGC CGAGCTGGTC ACCGCAGGGA AGGTGCGCTA CCTCGGCCTC
TCCGAGGCGT CGCCGGCCAC GATCCGCCGG GCGCACGCCG TGCATCCCAT CTCCGCGCTG
CAGACCGAGT ACTCGATCTG GTCACGTCAC CCGGAGGAGG AGATCCTCCC GACGCTGCGC
GAACTCGGCA TCGGCTTCGT TGCCTACAGC CCGCTGGGGC GGGGGTTCCT GACCGGAACC
TTCCGCACCC CGAACGACTT CGAGGCCGGC GACTTCCGCG CCAGCATGCC CAGGATGAAC
TCCGAGAACC TGGACGCCAA CCTCTCGGTC GTCGCCCAGA TCGAGGAGAT CGCGGCGGCG
CGAAACGCGA CACCCGCGCA GGTGGCACTC GCCTGGGTGC ACCACCAGGG CGACGACATC
GTCCCGATCC CGGGGACGAA GCGACGCCAC TACCTGGAGC AGAACGTCGC CGCCGTCGGC
CTCGCCCTGA CGCCCGACGA GGTGGAAATC CTGACGAAGG CTGGCGAGAC CGTGCGGGGC
GCGCGCTATC CGGACATGTC CAACGTCAAC CTTTGA
 
Protein sequence
MTAVDELSED GSRAPDGPTS GRRKSGARRP STGVARSADS GLTIAEAATA SGVSAHTLRY 
YERVGLMLER VARAPSSHRR YDDEDLRWIA TLTALRHTGM PIRQIARYAA LVRAGHGNET
ERLELLTAHK ERVTERLDEV RRHLTAIDTK IDIYRERTRR PMPHDTLRQV PLGSQGLRVS
VQGLGCMGMS DFYGATDDNE SIATIQRALG LGVTFLDTAD MYGPFTNERL VGRAISGRRG
EVTLATKFGI VRDPDNPQAR NINGRPEYVR SACDASLSRL GVDHIDLYYQ HRVDPTVPIE
DTVGAMAELV TAGKVRYLGL SEASPATIRR AHAVHPISAL QTEYSIWSRH PEEEILPTLR
ELGIGFVAYS PLGRGFLTGT FRTPNDFEAG DFRASMPRMN SENLDANLSV VAQIEEIAAA
RNATPAQVAL AWVHHQGDDI VPIPGTKRRH YLEQNVAAVG LALTPDEVEI LTKAGETVRG
ARYPDMSNVN L