Gene Franean1_6217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6217 
Symbol 
ID5674536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7550546 
End bp7551580 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content69% 
IMG OID641245069 
Productalcohol dehydrogenase 
Protein accessionYP_001510465 
Protein GI158317957 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.51495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGG CCATCGTTTT CAACGGCGAC GAGACCTGGG AGGAACGCGA TCTGCCGGTG 
CCCGATCCCC AGCCGGGCGG AGCTGTCCTT CGGGTGGAGG CGACCGGCCT CTGCCACAGC
GATATCGACC ATTTCCGGGG TCATGTGCAC ACGTCCTGGG GCGGTGCGTT CCCGTCCATC
GCCGGTCACG AGATCGTGGG CCGGGTGGAG AAGATCGACG CCGCGGCGGC CGCCGCGTGG
GGCGTCGGGG AAGGTGACCG GGTCGCCGTC CGCGACATCG TGGTGACCCC CGCCGGTTAC
CGCATCTACG GGCACGACTT CTCCGTGGAC GAGGGCTCCG GCCTGCATGG CGGATTCGCG
GAGCACCTCG AACTGCTGCC CGGTTCCCGG GTGTATCGCC TGCGTGACGA TCTCCCGGCC
GAGGAGCTCA CGGTCTTCGA GCCACTGAGC TGCGCGGTGA CCTGGGTGGC GCCGGTGCGG
CAGGACGATG TCGTGATCAT CGAAGGTCCC GGCCACATGG GCATGGCCAC CATCGTCGCT
GCCCGCGCGG CCGGAGCCGC CACGGTGATC GTGACCGGGA CGGCGAGCGA CAGATTCCGC
CTCGACTGGG CGCTGCGTGT CGGTGCCGAC CACACCGTCG ACGTCGACAA CGAGGACCCG
GTCGAACGAG TACACGAGAT CACCGACGGC CGGATGGCGG ACGTGGTGAT CGACGCCGCG
GCGGGAAATC CGGTGACGGT GAACCTTGCC ATGGATCTCG TGCACAAGGG TGGGCATGTC
GTCGTCGCCG GTATGAAGGA CGGCCCGCTC AAGGGCTTCC ACAGCGACTG GATCCCTACC
CGACGGATCA CCCTCCACCC CGGCGCGGGC CTCGACACGG AAGGAGCGGT CGAGCTCATC
AACGCCGGCC GGGTACCGAC CGCCGACCTG CTCGGCGACA CCTTCCCCCT CGAACGTTTC
GAGGAGGCGT TCGCCCTCCT GTCACGAAGG ACACCGGGCC ACGATTCGAT CCGGGTCGCC
CTGCGCCTGT GCTGA
 
Protein sequence
MSRAIVFNGD ETWEERDLPV PDPQPGGAVL RVEATGLCHS DIDHFRGHVH TSWGGAFPSI 
AGHEIVGRVE KIDAAAAAAW GVGEGDRVAV RDIVVTPAGY RIYGHDFSVD EGSGLHGGFA
EHLELLPGSR VYRLRDDLPA EELTVFEPLS CAVTWVAPVR QDDVVIIEGP GHMGMATIVA
ARAAGAATVI VTGTASDRFR LDWALRVGAD HTVDVDNEDP VERVHEITDG RMADVVIDAA
AGNPVTVNLA MDLVHKGGHV VVAGMKDGPL KGFHSDWIPT RRITLHPGAG LDTEGAVELI
NAGRVPTADL LGDTFPLERF EEAFALLSRR TPGHDSIRVA LRLC