Gene Franean1_1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1497 
Symbol 
ID5669901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1798672 
End bp1799706 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content70% 
IMG OID641240417 
Productalcohol dehydrogenase 
Protein accessionYP_001505843 
Protein GI158313335 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0703519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCACGTG CGATCGTGTT CAAGGGTGAC GAGACCTGGG AGCTTCGAGA GGTCCCGAAG 
CCGGCGTTAA GGCCGGGATG CGCGGTGCTG CGGGTCGAGG CGGTCGGCAT CTGTCACAGC
GATGTCGACC AGTTCAGGGG GCACGCGCCG GTGCCGTCGG GTGGGGTGTT CCCCGCCGTT
CCCGGTCACG AGATCGTCGG TCGGATCGAA GAGATCACCC CGGAGGCGCA GGAAGAGTTC
GGCGTCCGGG AAGGCGACCG GGTCGGGGTC CGTTCAGTGG TGCGCGGGCC TACCGGGAGC
AGGGTGTACG GGTTCGACTT TCCTCTCGAT GAGGGCTCCG GGCTCTTCGG CGGCTATGCC
GACTACATGG AGCTCGTGCC CGGCTCGGAG GTCCACCGGC TTCGCGACGA CCTGCCGGCG
ACCGAGTTGA CGGTCTACGA GTGCCTGACC AACGCGATCA CGTGGATCCG GCCGGTGCAG
CCCGGCCAGA CTCTGGTGGT GGAGGGCCCT GGCCACATGG GGCTGGCCGC GATCGTCGGC
GCCCGTGCGG CCGGCGCTGG AACGATCATC GTGACCGGGC TGGCCGGCGA CCGGCTCCGA
CTCGACACCG CGCTCAAGGT CGGCGCGGAT CACGCCATCG ATGTGGAGAA CGAGGACGTG
GTCGCCCGCG TCGCCGAGAT CACCGGCGGA GCGATGGCGG ACGTCGTCCT CGACGCCGCG
TCAGGCAACC CGGTGACGCT GAAGACCGCC ATGCGGATCG CCCGGACCGG CGCCACGATC
GTCGCGGCCG GGATGAAGGA CCGACTTCTC GACGGATTCG ACGTCAGTCA GATCCCGCTG
CGTCATCTGA CCATCGCGCC CGGCGGCGGC CTCGACCTGG CCGGCGCCTG CACAATGATC
AACGAGGGGA CGGTGCCCAC CGGCGTGCTC CACGGCGCGT CGTTCCCGCT GGAACAGTTC
GAAGACGCGC TGGCGCTGGC CGATCGACGC GTGCCCGGCC AGGACGCGGT ACGCGTTTCC
CTCAAGGTGG CCTGA
 
Protein sequence
MPRAIVFKGD ETWELREVPK PALRPGCAVL RVEAVGICHS DVDQFRGHAP VPSGGVFPAV 
PGHEIVGRIE EITPEAQEEF GVREGDRVGV RSVVRGPTGS RVYGFDFPLD EGSGLFGGYA
DYMELVPGSE VHRLRDDLPA TELTVYECLT NAITWIRPVQ PGQTLVVEGP GHMGLAAIVG
ARAAGAGTII VTGLAGDRLR LDTALKVGAD HAIDVENEDV VARVAEITGG AMADVVLDAA
SGNPVTLKTA MRIARTGATI VAAGMKDRLL DGFDVSQIPL RHLTIAPGGG LDLAGACTMI
NEGTVPTGVL HGASFPLEQF EDALALADRR VPGQDAVRVS LKVA