Gene Francci3_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1547 
Symbol 
ID3904779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1855936 
End bp1856955 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content74% 
IMG OID637878884 
Productzinc-binding alcohol dehydrogenase 
Protein accessionYP_480652 
Protein GI86740252 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.135035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGCCTG CCGCGGACAT CGCGGACATC GCGGAGGTCT CGACGCGGTG GGACAGCCGC 
GGCGGGACGG CCAGGGAAAC GGCGAAGGGA ACGGTGGTTG TGGTGCCGAA GGCTTACGTC
TACAACGACC ACGGCGGTCC GGAGGTGGAG GCGTTCGCTG ACCTCCCGAT CCCGGTGGCC
GGGCCGGGTC AGCTCACGAT CGCGGTACGC GCCGCCGGGG TCAACCCCGT GGACTGGAAG
CTGCGGGGCG GCCTGCGGCT GCCGGCCGCC CCGCCGGCCG TGTTTCCGGT CGTGCTCGGA
GTGGAGGCCT CCGGCGTGGT GACGCAGGTG GGCCCGGACG TCAACGGATT CGCCGTCGGC
GACGAGGTGT TCGGCAGCGC GCCGGGCGGT GGCTACGCCG AGTACACGGT GCTGACCGCC
AGGGAGAGCG CGCGCAAGCC GGCCGCGGTG TCCTTCGTGG CCGCGGCGAC GCTTCCGGTC
GCGGCGGCGA CCGCCTACGA CGGCGTGCAC CAGCTGGCCC TGCCGCCCGG CGCCACCCTG
CTGATCATCG GTGTGGGTGG CGGCGTGGGA GTGGCCGCCG CGCAGATCGC CCGGCATGCG
GGCCTGACGG TCGTCGGCAC CGCGAGCGCC GGCAAGAAGG ACTTCGTCGA GGCCCTCGGC
GTGGCGCACG TCGAGCCGGG CCCCGATGTC GCGGACCGGG TGCGGGCCGC CGCGTCCCGG
GGGGTCGACG GAATCTACGA TCTCGTCGGG GGCGAGACGC TGGACGACGT CGTCGAGGTC
CTCGCGGACC GGTCGAAGCT CGTCACGGCC TTGTCTGGGG CGAGCGACCG GTACGGCGGG
ACGACGGTCC AGCGGGCCCG GGACAGCCGC GTGCTCGACG CGGTCGCCCA GCTGGTCGTG
GACGACGCGC TGGACCCGCT CGTGACCGCG ACCTTCCCGC TGGACCAGGC CCCGGCGGCG
CTGCGCGCGG TGGAGAACGG CCACGCCCGC GGCAAGATCG TGATCAAGGT CGCCGCGTGA
 
Protein sequence
MTPAADIADI AEVSTRWDSR GGTARETAKG TVVVVPKAYV YNDHGGPEVE AFADLPIPVA 
GPGQLTIAVR AAGVNPVDWK LRGGLRLPAA PPAVFPVVLG VEASGVVTQV GPDVNGFAVG
DEVFGSAPGG GYAEYTVLTA RESARKPAAV SFVAAATLPV AAATAYDGVH QLALPPGATL
LIIGVGGGVG VAAAQIARHA GLTVVGTASA GKKDFVEALG VAHVEPGPDV ADRVRAAASR
GVDGIYDLVG GETLDDVVEV LADRSKLVTA LSGASDRYGG TTVQRARDSR VLDAVAQLVV
DDALDPLVTA TFPLDQAPAA LRAVENGHAR GKIVIKVAA