Gene Francci3_4322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4322 
Symbol 
ID3907291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5163738 
End bp5164688 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content72% 
IMG OID637881650 
Productzinc-binding alcohol dehydrogenase 
Protein accessionYP_483397 
Protein GI86742997 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID[TIGR02824] putative NAD(P)H quinone oxidoreductase, PIG3 family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTGGT CGACGGTGGC GGACCCTCCT GCCCCGGGGC CGGGCGAGGT TACCCTGGAG 
GTGGTCGCGA CGGCGGTGAA CCGCGCGGAC CTGCTCCAGC GGCAAGGTTT CTACCCGCCG
CCGCCCGGTG CCTCTGAGAT CATCGGGATG GAGTGCTCCG GGCGGGTGGC GGTTCTCGGC
GCCGGCGTGG ACCGGGTTGA GGTCGGGGCC GAGGTGTGCG CGCTGCTCAG TGGGGGCGGC
TACGCGAGTC GGGTGAACGT GCCGGTCGGC CAGGTGCTGC CGGTCCCGGC CGGGGTCGAC
CTCATCACGG CGGCCGCTCT GCCCGAGGTC GCCTGCACGG TGTACTCCAC GGTGTTCGGC
ATCGCCGGTC TGCGTGACCG CGAGGTCTTC CTCGTGCACG GCGGGGCGTC CGGCATCGGG
ACCTTCGCGC TCCAGGCGGT CCGCGCGCTG CGTCCGAACG CGCTGGTCGC GACCACCGCC
GGCACCGCGG CCAAACTGGC CAGGGTGCGG GAGCTCGGCG CCCACATCGC GGTCTCCTAC
CGTGACGACG ACTTCGTCGC CAGGATCCGC GAGGCCACCG ACGGCCATGG GGCGGACGTC
ATCCTCGACA ACATGGGTGC GGCGTATCTC GCCCGCAACG TCGCCGTGCT GGCCGTGGGA
GGCCGTCTGG TCGTCATCGG CCTGCAGGGC GGGGTGAAAG GGGAGCTCAA CCTCGGCGCC
CTGCTCACCA AGCGGGCGGC GGTCCACGCC GCGTCGCTGC GCGGGCGGCC GGTCGAGGAG
AAGGCCGACA TCGTGACCGG TGTCCGCGGC GACTTCTGGC CGGCGATCGA GGCGGGGGCG
ATCCGGCCGG TCATCGATCG GGTGCTGTCG ATCACCGAGG TCGCGCGGGC GCACCAGCAT
GTGGCTGATT TCGGACATGT CGGAAAGGTG GTACTCACGA TCCCGGAATG A
 
Protein sequence
MTWSTVADPP APGPGEVTLE VVATAVNRAD LLQRQGFYPP PPGASEIIGM ECSGRVAVLG 
AGVDRVEVGA EVCALLSGGG YASRVNVPVG QVLPVPAGVD LITAAALPEV ACTVYSTVFG
IAGLRDREVF LVHGGASGIG TFALQAVRAL RPNALVATTA GTAAKLARVR ELGAHIAVSY
RDDDFVARIR EATDGHGADV ILDNMGAAYL ARNVAVLAVG GRLVVIGLQG GVKGELNLGA
LLTKRAAVHA ASLRGRPVEE KADIVTGVRG DFWPAIEAGA IRPVIDRVLS ITEVARAHQH
VADFGHVGKV VLTIPE