Gene Franean1_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1556 
Symbol 
ID5669959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1863560 
End bp1864609 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content70% 
IMG OID641240475 
Productaldo/keto reductase 
Protein accessionYP_001505901 
Protein GI158313393 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.206898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.568157 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTACC GCAAGTTCGG GCGGACGGGC ATCGAGGTCA GCCGGCAGTG CCTGGGATCC 
ATGATGTTCG GTGTGATCGG CAACCCCGAC CACGCCGCGT GCGAGCGCAT CATCGCCCGG
GCGCTGGACG CCGGGATCAA CTTCATCGAC ACCGCCGACA TCTACTCGCG GGGTGAGAGC
GAGCAGATCG TCGGGAAGGC CATCAAGGGC CGCCGCGACG ACATCGTCCT GGCCACCAAG
TGCTTCAACC CGATGGGCGG GGACCGCAAC CGCCGCGGCG CCTCCCGCCG GTGGATCACG
CGCGCCGTCG AGGACAGCCT CCGCCGCCTC GACACCGACT ACATCGACCT CTTCCAGATC
CACCGGCATG ACTGGAACAC CGACCTGGAA GAGACCCTCG GCGCGCTGAA CGACCTCGTG
CACCAGGGCA AGATCCGTTA CCTGGGCTCG TCGACGTTTC CCGCGGACTG GATCGTCGAG
GCGCAGTGGG CCGCCCGCCG CCGCAACACC GAGCGTTTCG TCTGCGAGCA GCCCCAGTAC
TCGATCTTCG CCCGCTCGGT CGAGCAGGCC GTCCTGCCCG CCTGCCGGCG GCACGACATC
GCGGTGATTC CCTGGAGCCC GCTCTCCGGC GGCTGGCTGA CCGGGAAGTA CCGGCGCGGG
CAGGCGGTGC CGGCCGACGC CCGGTACGCG GCCGGCAATG TCATGGCACA GGGGCGGGCC
GTCGGGGAGA GCCCCGAGTC GCAGGCCCGG TTCGACGCGG TCGAGCAGCT GTCCGCGGTG
GCGGCGGAGG CCGGCCTCTC CCTCACCCAT CTTGCGCTCG GGTTCGTGGA GAGCCATCCG
GCGATCACTT CGACGATCAT CGGCCCGCGC ACGATGGAGC AGCTCGAGGA CGTCCTCAGC
GGGGCCGACG TCGTGCTCGA CGCGGCGACG CTCGACGCGA TCGACAAGAT CGTGGAGCCC
GGGACCGATT TCGTCGGCGT CCGGCACATG ACCGGTGACC CGTCCCTGCT GCCCGAGACG
CGCCGGCGGC TGGCAACGCA GTTCGGCTGA
 
Protein sequence
MRYRKFGRTG IEVSRQCLGS MMFGVIGNPD HAACERIIAR ALDAGINFID TADIYSRGES 
EQIVGKAIKG RRDDIVLATK CFNPMGGDRN RRGASRRWIT RAVEDSLRRL DTDYIDLFQI
HRHDWNTDLE ETLGALNDLV HQGKIRYLGS STFPADWIVE AQWAARRRNT ERFVCEQPQY
SIFARSVEQA VLPACRRHDI AVIPWSPLSG GWLTGKYRRG QAVPADARYA AGNVMAQGRA
VGESPESQAR FDAVEQLSAV AAEAGLSLTH LALGFVESHP AITSTIIGPR TMEQLEDVLS
GADVVLDAAT LDAIDKIVEP GTDFVGVRHM TGDPSLLPET RRRLATQFG