Gene Francci3_0264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0264 
Symbol 
ID3905704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp305474 
End bp306472 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content74% 
IMG OID637877592 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_479381 
Protein GI86738981 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID[TIGR02822] zinc-binding alcohol dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.825688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCCT GGCAAGTAGT CGCTCCCGGG CCGATGAGCA CCCGCCCGCT GCGCCCGGCC 
GAGCTGCCCG TGCCCCGGCC AGGGCCGGGC CAGGTTCGGG TCCACGTCGA CGCCTGCGGA
GTCTGCCGTA CCGATCTGCA CCTCGCCGAG GGTGATCTCC CGCCGCACCG ACCGCGCACC
GTGCCCGGAC ATGAGGTCGT CGGACGGGTC GACGCGGTTG GCGAGGGGGT GAGCGGCGTG
CGCACCGGGG ACCGGCTGGG CATCGCCTGG CTGGCCTCGA CCGACGGGAC CTGCGGGTAC
TGTCGGCGCG GAGCGGAGAA CCTCTGCCCG GCCTCGACCT ACACCGGATG GGATGTCGAC
GGCGGATACG CCGAGTACGC CTGCGTGCGC GCCGACTATG CCTACCGGCT GCCCGACGGC
TACAGCGATG CGGAGCTTGC CCCGCTGCTG TGCGCCGGCA TCGTCGGCTA CCGGGCGCTG
CGCCGGGCGG AGCTGCCGCC CGGCGGGAGG CTCGGCATCT ACGGGTTCGG CGCGTCCGCC
CATCTCGCCG CCCAGGTGGC GATCGCACAG GGCGCCACCG TCCACGTGAT GACGCGCTCG
GCGCGGGCCC GCGAGCTTGC TCTGGAACTC GGCGCCGCCT CGGCGGGCGA GGCGTATGCC
GCACCGCCCG AGCCGCTGGA CGCGGCCGTG CTCTTCGCCC CGGTGGGTGA CCTGGTGCCG
GTGGCGCTCG CCGCGCTCGA CCGTGGCGGC ACGCTGTCCA TCGCGGGCAT CCACCTCAGC
GACATCCCGC CGCTCGTTTA CTCCGACCAT CTGTTCCAGG AACGGTCCGT ACGCAGCACG
ACCGCCAACA CTCGCGCCGA CGGTGAGGAG TTCCTGGCGA TCGCCGCCGA GCACCGGCTC
GCGGTGACCG TGTCGCCCTA CCCGCTGTCC GCGGCCGACC GGGCCCTCGC CGACCTCGCC
GCCGACCGGG TCACCGGCGC GGCCGTGCTG CTGCCCTGA
 
Protein sequence
MRAWQVVAPG PMSTRPLRPA ELPVPRPGPG QVRVHVDACG VCRTDLHLAE GDLPPHRPRT 
VPGHEVVGRV DAVGEGVSGV RTGDRLGIAW LASTDGTCGY CRRGAENLCP ASTYTGWDVD
GGYAEYACVR ADYAYRLPDG YSDAELAPLL CAGIVGYRAL RRAELPPGGR LGIYGFGASA
HLAAQVAIAQ GATVHVMTRS ARARELALEL GAASAGEAYA APPEPLDAAV LFAPVGDLVP
VALAALDRGG TLSIAGIHLS DIPPLVYSDH LFQERSVRST TANTRADGEE FLAIAAEHRL
AVTVSPYPLS AADRALADLA ADRVTGAAVL LP