Gene Franean1_7221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7221 
Symbol 
ID5675522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8816830 
End bp8817909 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content75% 
IMG OID641246058 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001511446 
Protein GI158318938 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0813175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCTGCG ACGGGATCGG CCCGACCGCC CACCGGCGGA CGGCAATCGG GGGCACTATC 
GGAGCTGTGA CGACGGTCCC CGAACTGGTG GCGCCCCGCG ATCCCGGCCT GTCCAGAGCC
TGGCGCTCGG ACCGCCGTGC CGCCCGGCGC AGGCAGCCCG ACCTCCCGGT CGCGATGGCC
CTGGTCACCG ACGGTTCCGA ACGCTCGACG GCGGACGTCC TGCGTGACGC TGGGGTCGAT
GTCGTGGGGC TGCTGGCGCC GGAGCCGCTG GAGTCGCTGG CCTGGGCCGC CGAGGCGAAG
GCCCCGCGCG CCTACAGCGA CCTGATCGCC CTGCTGTCGG ACGACATCGA GGCGGTCTGC
ATCGAGATGT CGCCGCCGGC GTCCGACATC GTGGCCCGCC GGGCGGCCGA GGCCGGCCTG
CACGTCCTGC TCGCGAAGCC GGCCACGGCC GAGGCCGAGG CGCTGCGCGC GGTGGCCGAC
ATCGCCGAGG ACGCCGATCT CGCCCACGTC GTCGCCCTGG ACGGCCGGGC GTGGCCCGCG
GCGTGGCACG TCCAGGCGTC GGTGCACTCG CTGGGCCGGC TGAGCCAGAT CACCGTGGTC
GGCGCGCCGT CCGGGCCGAT CGGCCGGGCC GAGATCATCG ACCTGACGCT GCGCTGGTGC
GGCGAGATCC TCGCCGTGTG CGCCGATCCC GGCAGCATGC CGGCGACCAC GCTCACCCCC
GACGCGCCGG TCACCCTGGC CCTGCTCGCC GCCAACGGGA CGACCGTCCT GATCAACGAA
CGGATGGGCG GGGAGATCGC CACCGCCACG GTGACCGTCT GCGGCGAGGC GGGCCGCATG
GTCGTCCAGG GGCGGCGGGT CCGCCGGCAG GACGGGACGG GCATTCGTGA CCTGTGGATG
CCGACGGTGC CCGCCGAGCG GCCCGGCCTG GTCGAGGCCA CCTACGACGT CGTGCGGGCC
ACCGAGCTGA ACGACCCGGC CCTGGTGCGC GGCGCGACCT TCCACGACCT GCTCACCGCG
ACGCGCCTGC AGGTGGCGGC CGCGGCGTCT CACAAGCGGG GTGGCTGGGT CGAGCTTTGA
 
Protein sequence
MVCDGIGPTA HRRTAIGGTI GAVTTVPELV APRDPGLSRA WRSDRRAARR RQPDLPVAMA 
LVTDGSERST ADVLRDAGVD VVGLLAPEPL ESLAWAAEAK APRAYSDLIA LLSDDIEAVC
IEMSPPASDI VARRAAEAGL HVLLAKPATA EAEALRAVAD IAEDADLAHV VALDGRAWPA
AWHVQASVHS LGRLSQITVV GAPSGPIGRA EIIDLTLRWC GEILAVCADP GSMPATTLTP
DAPVTLALLA ANGTTVLINE RMGGEIATAT VTVCGEAGRM VVQGRRVRRQ DGTGIRDLWM
PTVPAERPGL VEATYDVVRA TELNDPALVR GATFHDLLTA TRLQVAAAAS HKRGGWVEL