Gene Franean1_3749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3749 
Symbol 
ID5672114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4440828 
End bp4441865 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content74% 
IMG OID641242630 
Productalcohol dehydrogenase 
Protein accessionYP_001508050 
Protein GI158315542 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.529473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGCGA CATACATGTA CGGCGCCGGC GACGTGCGGA TCACCGACGT CGGCGACCCG 
GTGCTGGAGC AGCCGACCGA CGCGCTGGTG CGCGTCGTGC TGGCGTGTAT CTGCGGCAGC
GACCTGCACC CATATCACAG CCTGGCGGCA ACCCCGGCGG GCGTCCCGAT GGGCCACGAG
TTCATCGGCG TCGTCGAGGA GGTCGGCGGC GAGGTGTCGA CGCTGCGCGC CGGCGACCTC
GTCATCGCCC CGTTCGCCTG GTCGGACGGC ACCTGCGAGT TCTGCCGCGA GGGCCTGCAC
ACCTCGTGCC GTACCGGCGG GTTCTTCGCG GCCGGCGGCG TCGGCGGCGG CCAGGCCGAG
GCGATACGCG TCCCACAGGC CGACGGCACC CTGGTGAAGG TCCCGGTGCC CGAGGACTCC
GCCATTCTGC CCTCCCTGCT GACCCTCTCG GACGTGTTCG GAACCGGCTA CCACGCCGCC
GTCCGGGCCG GCGTGAACCC GCGCACCACG GTCACCGTCA TCGGCGACGG CGCGGTCGGG
CTGATGGCGG TGCTCTCGGC CCGGCGGCTC GGCGCGGAGC AGATCATCCT GATGGGGCGA
CACAAGGCCC GCACCGACCT CGGCCTCGAG TTCGGCGCGA CGGACGTCGT CGCCGAGCGC
GGCGAGGAGG GCGTCGCCCG GGTGCGGGAG CTCACCGGCG GCGACGGCAG CCACGCGGTG
CTCGAGGCCG TCGGCTACCG GGCCGCCTAC GACCAGGCCC TCGGTGTGGT CCGGCCGGGT
GGCGTGATCA GCCGGGTCGG CGTGCCCCAG TACGCCGACG CGCCGATCGG CTTCCCCAGC
CTCTTCGGCC GCAACATCAC CCTCACCGGG GGCCCGGCGC CTGTCCGGGC CTACATCGAG
ACGCTGCTCC CCGCCGTCCT CGACGGGGAG GTCGAGCCCG GCAAGGTCTT CGACCGCACA
GTCTCCCTCG AGGACGTCCC CGCCGGCTAC CGCGCGATGG ACGACCGCAA GGCCCTCAAG
GTGCTCGTCC GCCCGTAG
 
Protein sequence
MRATYMYGAG DVRITDVGDP VLEQPTDALV RVVLACICGS DLHPYHSLAA TPAGVPMGHE 
FIGVVEEVGG EVSTLRAGDL VIAPFAWSDG TCEFCREGLH TSCRTGGFFA AGGVGGGQAE
AIRVPQADGT LVKVPVPEDS AILPSLLTLS DVFGTGYHAA VRAGVNPRTT VTVIGDGAVG
LMAVLSARRL GAEQIILMGR HKARTDLGLE FGATDVVAER GEEGVARVRE LTGGDGSHAV
LEAVGYRAAY DQALGVVRPG GVISRVGVPQ YADAPIGFPS LFGRNITLTG GPAPVRAYIE
TLLPAVLDGE VEPGKVFDRT VSLEDVPAGY RAMDDRKALK VLVRP