Gene Franean1_4537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4537 
Symbol 
ID5672886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5413092 
End bp5414258 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content73% 
IMG OID641243402 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_001508818 
Protein GI158316310 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTC CGTCCGTAAG ACACGTGACA CCGTCCGTCC GCGTCTTCGC CGGCGAGGGC 
GCGCTCACCG CGCTGCCTCG CGAGTTCGAC CGCGCGGGGA TCCGGCGGGC GGTCGTGTTC
TGCGGGGCCT CGATACGCCG TCACACCGAG GCGGTGGCAA GGGTCGAGTC GGCGCTTGGC
GACCGGCTCG CCGGATGGTT CGACGGAGTC CGCGAGCACA GCCCGCTTCC CGCGGTCGAG
CACGCGCGCG AGGTCCTGGA GGCGACCGGC GCCGACGCCG TGGTGGCGCT GGGTGGCGGG
TCCGCCATCG TGACCGGCCG CGCCGCCAGC ATCCTGCTGG CGGAGAAGGC CGACGTTCGC
GATGTCTGCA CCCGCCGCGT CGACGGCCGG CTGGTCAGCC CGAAGCTCGA CACGCCGAAG
ATCCCCCAGT GGATCATTCC GAGTACTCCG ACCACGGCCT ACGCCAAGGC GGGAAGCGCG
GTGCGGGACC CGGAGACCGG GGAGCGGCTG GCCCTGTTCG ACCCCAAGAC CCGCGCCGCC
GGCGTGTTCA TGGATCCCGT GATCGCCGCC ACCGCGCCGG TGCCGCTGGT GCGGTCATCT
GCCCTGAACG CCTTCGCCAT GGCCGTCGAC GGACTGCAGT CGGACACGGA CGATCCGCTC
GCCGACGCGC TGCTGGCGTA TGCGCTCCGC CTGTCGAGGG AGTGGCTGCC GCGCCTCGAC
GTCGCCTCGG ACGGGGAGCC GCGCCTGCGC CTCATGCTCG CCGCGCTCCT CGCGGGCCAG
GGCAGCGACC ACACGGGCAC CGGCCTGGCT CAGGCGCTCT CGCACGCGGT CGGCCCGCGC
TCCACGGTGG CGAACGGGAC AGTCGAGGCG ATGCTCCTGC CGCCCACGAT GCGCTTCAAC
ACCCAGGTGA CGAAGCGGCG CCTCGTCCAG GTCGCGGAGG TCCTCAGCGG CGGACACCGG
CCTGACGATG GTGCGGCCGA GGCGATCGAC GCCGTCGAGC ACCTGCTCGC GGCCGTCGGT
GTGCCACGCC GCCTGCGCGA CGTCGGGGTT GATCGCGCGG CCCTGCCGGA GATCATCGAG
CACGCCATGG ACGACTGGGC CATCACCCGT GTCCCGCGCC CGGCGACCCG GGAGGATCTC
GAGGCGCTTC TTGACCGCGT CTGGTGA
 
Protein sequence
MSFPSVRHVT PSVRVFAGEG ALTALPREFD RAGIRRAVVF CGASIRRHTE AVARVESALG 
DRLAGWFDGV REHSPLPAVE HAREVLEATG ADAVVALGGG SAIVTGRAAS ILLAEKADVR
DVCTRRVDGR LVSPKLDTPK IPQWIIPSTP TTAYAKAGSA VRDPETGERL ALFDPKTRAA
GVFMDPVIAA TAPVPLVRSS ALNAFAMAVD GLQSDTDDPL ADALLAYALR LSREWLPRLD
VASDGEPRLR LMLAALLAGQ GSDHTGTGLA QALSHAVGPR STVANGTVEA MLLPPTMRFN
TQVTKRRLVQ VAEVLSGGHR PDDGAAEAID AVEHLLAAVG VPRRLRDVGV DRAALPEIIE
HAMDDWAITR VPRPATREDL EALLDRVW