Gene Franean1_6571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6571 
Symbol 
ID5674886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7995180 
End bp7996256 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content76% 
IMG OID641245422 
Productalcohol dehydrogenase zinc-binding type 2 
Protein accessionYP_001510814 
Protein GI158318306 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID[TIGR02822] zinc-binding alcohol dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGCCT GGCAGGTCAC TCGCCCCGCG CCGGTGGCCA CGGCGCCACT GCGCGCGGTC 
GAGCTGCCCA TCCCCGAGCC CGGTCCCGGT CAGGTCCGTC TGAAGGTCGC CGCCTGCGGC
GTCTGCCGGA CGGACCTGCA CCTGGCCGAG GGCGACCTCC CGCCGCACCG GCCGCTCACC
GTGCCCGGTC ACGAGGTCGT CGGGTACGTC GACGCGCTCG GTCCCGGGGT CCACGGGGTT
TCCGGGCCAG CCGGGCCAGC CGGTGCGGCC GGCTCCCGGC GACCCGATCC GGCCCCGGCG
GCCCCGATCC GGCTGGGTGA CCGGCTCGGC ATCGCCTGGC TCGCCGGAAC GGATCAGACG
TGCGCCTACT GCCGGCGCGG CGCCGAGAAC CTCTGCCCCG CGTCGCTCTA CACAGGCTGG
GACGCCGACG GCGGGTACGC CCAGTACGCC GTCGTCGACG CGGACTACGC CTACCGCCTG
CCCGCCGGCT ACAGCGACGG CGAGCTGGCC CCGTTGCTGT GCGCCGGGAT CGTCGGCTAC
CGGGCGCTGC TGCGCGCCGA GCTTCCGCCC GGCGGCCGGC TGGGCGTCTA CGGGTTCGGC
GCGTCCGCGC ATCTCGCCGC GCAGGTGGCG ATCGCCCAGG GCGCGACGGT GCACGTCATG
ACCAGGTCCG CCCGGGCCCG CCGCCTCGCC CTCGAGCTCG GCGCGGCGTC CGCGACCGGC
GCCTACGACT TCCCACCCGA GCCGCTCGAC GGGGCGGTCC TGTTCGCACC GGTCGGCGAT
CTGGTCCCGG TCGCGCTCGC CGCGCTGGAC AGGGGCGGCA CCCTCTCGAT CGCCGGGATC
CACCTCACCG ACGTCCCGGT CCTGAACTAC CGTCGGCACC TGTTCCAGGA GCGCTCGGTG
CGCAGCACGA CCGCGAACAC CCGCGCCGAC GGCCGCGAGT TCCTGGAGAT CGCCGGGCGC
CACCGGCTCG CGGTGACAAC CACCCCGTAC CCGCTGACGG CCGCCGACCA GGCGCTCGAG
GACCTCGCAC GCGACCGGGT GGACGGCGCC GCCGTGCTGT TCCCGGACGG CGTCTGA
 
Protein sequence
MLAWQVTRPA PVATAPLRAV ELPIPEPGPG QVRLKVAACG VCRTDLHLAE GDLPPHRPLT 
VPGHEVVGYV DALGPGVHGV SGPAGPAGAA GSRRPDPAPA APIRLGDRLG IAWLAGTDQT
CAYCRRGAEN LCPASLYTGW DADGGYAQYA VVDADYAYRL PAGYSDGELA PLLCAGIVGY
RALLRAELPP GGRLGVYGFG ASAHLAAQVA IAQGATVHVM TRSARARRLA LELGAASATG
AYDFPPEPLD GAVLFAPVGD LVPVALAALD RGGTLSIAGI HLTDVPVLNY RRHLFQERSV
RSTTANTRAD GREFLEIAGR HRLAVTTTPY PLTAADQALE DLARDRVDGA AVLFPDGV