Gene Franean1_6841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6841 
Symbol 
ID5675154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8341658 
End bp8342734 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content72% 
IMG OID641245690 
Productalcohol dehydrogenase 
Protein accessionYP_001511081 
Protein GI158318573 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGTCG CTGTCGCGTA CGAGGCCGAG AAGCCGCTGG TCGTCGAGGA TCTCGACCAG 
CCCGCCGTGG GCCCGCGGGA TGTGCTCGTG CGCATCGCCG CGAGCGGTAT CTGCCATACC
GACCTCCACG TCATCAACGG GCACTCGCCG CTGCCGCTGC CGATCGTTCC CGGTCACGAG
GCGTGCGGTG TGGTCGAGGA GGTGGGCTCC GAGGTCCGCC GGGTGAAGGT GGGCCAGCGG
GTGCTCGCCG CGGTGTCGCC GGCCTGCGGC ACCTGCTGGT GGTGCGTCAA CGGCATGTCC
AACCACTGCG AGCTCGGCGG GCCGGTCAAG GCGGCGCCGC GGTTCACGCT GGCCGACGGC
CGCACCGCGG CCGCCGTGTG CGGCTGCGGC ACGTTCGCCG AGGCGATGGT GGTCGACGAG
GCCTCGGTCG TGCCGACCAA CACCGACCTG GCCGACGAGG AGCTGGCGCT GCTCGGCTGC
GGGGTGACCA CCGGGCTCGG CGCGGCGCTC ATCACCGCCG GCGTGACCCC CGGCTCGTCC
GTCGCGGTGA TCGGCTGTGG CGGGGTCGGG CAGTCGGTGA TCCAGGGCGC GCGTATCTCC
GGCGCGGCCA CCATCATCGG CATCGACCTC GTGCCCGCCC GCCGCGAGGC CAGCCTGCGG
GTCGGCGCGA CCCACGTGGT CGACCCGGCG GAGGCCGACC CGGTCGAGCA GGTGCGCGCG
CTGACCGAGG GGCGCGGTGT GGACTTCAGC TTCGAGGTCG TCGGCCTGCC CAACCTGATG
GTGCAGGCCT TCGACATGGC ACGTAAACAG GGAGCGGTCA CGCTGGTCGG CATGCCGACC
ACGACCGCGA CCCTGACCCT GCCGGCCATC TCGGCGATCT TCTCCGGCAA GCGCCTCGCC
GGTTCCGTGG TGGGCGGCGC CCAGATCCTG CGTGACATGC CCAGGTTCAT CCGGCTGGCC
GAGACCGGCC AGCTCGACCT CGGCGGCATG GTGTCCAACC GGATCCGGCT GGACGACATC
AACGAGGGCA TCGCGCTGCT CGACCGCGCC GAGGGCACCC GGACCGTGAT CATCTAA
 
Protein sequence
MRVAVAYEAE KPLVVEDLDQ PAVGPRDVLV RIAASGICHT DLHVINGHSP LPLPIVPGHE 
ACGVVEEVGS EVRRVKVGQR VLAAVSPACG TCWWCVNGMS NHCELGGPVK AAPRFTLADG
RTAAAVCGCG TFAEAMVVDE ASVVPTNTDL ADEELALLGC GVTTGLGAAL ITAGVTPGSS
VAVIGCGGVG QSVIQGARIS GAATIIGIDL VPARREASLR VGATHVVDPA EADPVEQVRA
LTEGRGVDFS FEVVGLPNLM VQAFDMARKQ GAVTLVGMPT TTATLTLPAI SAIFSGKRLA
GSVVGGAQIL RDMPRFIRLA ETGQLDLGGM VSNRIRLDDI NEGIALLDRA EGTRTVII