Gene Franean1_4227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4227 
Symbol 
ID5672582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5034115 
End bp5035149 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content71% 
IMG OID641243100 
Productalcohol dehydrogenase 
Protein accessionYP_001508517 
Protein GI158316009 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.463097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGTG CCATCGTCTT CAACGGCGAC GAGACCTGGG AGGAACGCGA CCTCCCCGTC 
CCCGACCCCC AGCCCGGCGG CGCCGTCCTG CGCGTCGAGG CCACCGGCCT GTGCCACAGC
GACATCGACC ACTTCCGCGG CCACGTCCAC ACCTCCTGGG GCGGTGCCTT CCCCTCCATC
GCCGGCCACG AGATCGTCGG CCGCGTCGAG AAGATCGATT CGGCCACGGC CGAGGCGTGG
GGGGTGGCGG AAGGCGACCG GGTCGCCGTC CGTGAGCTCC TCGTCACGCC CGAGGGGTAC
CGGATCTACG GCCACGACTT CTCGGTGGAC GAGGGCTCGG GCCTGTACGG CGGCTTCGCC
GAACACCTCG AACTGCTGCC CGGCTCCCAG GTCTACCGCC TGCGCGAGGA CCTCCCCGCC
GACCAGCTCA CGGTCTTCGA ACCGCTGAGC TGCGCGGTCA CCTGGGTGGC GCCGGTCAGG
AAGGGCGACG TCGTCGTCAT CGAGGGGCCA GGCCACATGG GGATGGCGAC CATCGTCGCC
GCCCGTGCGG CCGGCGCCTC GACCATCATC GTCACCGGCA CGGCCAGGGA CCGCTTCCGC
CTCGACTGGG CACTGCGCGT CGGCGCCGAC CACACCGTCG ACGTCGACTC CGAGGACCCC
CTCGAACGGG TCCGCGAGCT CACCGACGGC CGCCTGGCCG ACGTCGTCAT CGACGCCGCC
GCCGGTAATC CGGTCACCAT CAACCTCGCT ATGGACCTCG TCCGCAAGGG TGGGCACGTC
GTCATCGCCG GGATGAAGGA CCGCCCCCTC GAAGGCTTCC ACAGCGACTG GATCCCCACC
CGACGGATCA CCCTGCACCC CGGCGCCGGC CTCGACACCG AGGCGGCCGT CGACCTCATC
AACGCGGGCA AGGTACCGAC CGGCGAGCTG CTCGGCGACA CCTTCCCCCT CGAACATTTC
GAGGATGCCT TCGCGCTTCT GACCCGCAGG ACACCCGGCC GAGACTCGAT CCGGATCGCC
CTGCGCCTCA CCTAG
 
Protein sequence
MSRAIVFNGD ETWEERDLPV PDPQPGGAVL RVEATGLCHS DIDHFRGHVH TSWGGAFPSI 
AGHEIVGRVE KIDSATAEAW GVAEGDRVAV RELLVTPEGY RIYGHDFSVD EGSGLYGGFA
EHLELLPGSQ VYRLREDLPA DQLTVFEPLS CAVTWVAPVR KGDVVVIEGP GHMGMATIVA
ARAAGASTII VTGTARDRFR LDWALRVGAD HTVDVDSEDP LERVRELTDG RLADVVIDAA
AGNPVTINLA MDLVRKGGHV VIAGMKDRPL EGFHSDWIPT RRITLHPGAG LDTEAAVDLI
NAGKVPTGEL LGDTFPLEHF EDAFALLTRR TPGRDSIRIA LRLT