Gene Franean1_4527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4527 
Symbol 
ID5672876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5400663 
End bp5401703 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content72% 
IMG OID641243392 
Productalcohol dehydrogenase 
Protein accessionYP_001508808 
Protein GI158316300 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCACCG CAAAGGCCCG GGCCATGAGC GGCCCGACGG CACCGTTCAG CACGATCACC 
GTCGAGCGCC GCGACGTCGG CCCGCGCGAC GTGCTGATCG ACATCGCCTA CGCCGGGGTC
TGCCACACCG ACGTCCACCA CGCCCGCGCG GAGTTCGGGC ACACCCGCTA TCCGATCGTG
CCCGGTCACG AGATCGCCGG CATCGTCCGC GAGGTGGGCG CGGAGGTCGC CGGGCTGACC
GCCGGCGACC ACGTGGGCGT CGGCTGCCTG GTCGACTCGT GCCGGGACTG CCCCGCCTGC
CGCGCGGGGC AGGAGTCCTA CTGCCGCCGC GGCAAGGTGC TGACCTACAA CGGGGTGGGT
CGTGACGGGG CGACCACGCT GGGCGGGTAC AGCGAACTCG TGGTCGTCGA CCAGCGGTTC
GTCGCCCGCA TCCCCGATGC CCTACCGCTG GACGCCGCGG CCCCCCTGCT CTGCGCGGGC
ATCACGATGT ACCAGCCGCT GCAGCGCTGG GGCGCGGGCC CCGGCAGGCG GGTCGGCATC
CTGGGGTTCG GCGGGCTCGG GCACATCGGC GTCCAGATCT CCCACGCGCT CGGCGCGCGC
ACGACGGTCC TGGAACTCAC CGAGGACCGC CGCGCCGACG CCGAGCGCCT CGGGGCGGAC
GACTACCGGA CGACCGGCGA CCTGGGCGCG CTGCGGGACT CGTTCGACCT GATCGTGTCG
ACGGTCCCGA CGAACTACGA TCTGTCCTCC CACCTCGACC TGCTCGACCT GGACGGCACG
TTCGTCAACC TCGGCGTGCC CGACGAGCCG CTGCGCGTCG ACCCCTACAC GCTGCTGACG
AACCGGCGCG TGCTGGCCGG TTCGATGAGC GGCGGCATGC CGCAGACGCA GGAGATGCTC
GACTTCTGCG CCGAGAACGG CATCAGGGCC GAGGTGGAGG TCGTCGCGGC GAAGGAGCTC
GACCAGGTCT ACGACCGCCT CAGTGCCGGC GACGTCCGGT ACCGGTTCGT GCTCGACGTC
GCGACCATCG CCGAGTCCTG A
 
Protein sequence
MITAKARAMS GPTAPFSTIT VERRDVGPRD VLIDIAYAGV CHTDVHHARA EFGHTRYPIV 
PGHEIAGIVR EVGAEVAGLT AGDHVGVGCL VDSCRDCPAC RAGQESYCRR GKVLTYNGVG
RDGATTLGGY SELVVVDQRF VARIPDALPL DAAAPLLCAG ITMYQPLQRW GAGPGRRVGI
LGFGGLGHIG VQISHALGAR TTVLELTEDR RADAERLGAD DYRTTGDLGA LRDSFDLIVS
TVPTNYDLSS HLDLLDLDGT FVNLGVPDEP LRVDPYTLLT NRRVLAGSMS GGMPQTQEML
DFCAENGIRA EVEVVAAKEL DQVYDRLSAG DVRYRFVLDV ATIAES