Gene Franean1_0924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0924 
Symbol 
ID5669338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1074820 
End bp1075785 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content74% 
IMG OID641239851 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001505286 
Protein GI158312778 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00780913 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.965318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACG ACGACGTCCC GGCCGGTCAT TCCACGCGGA CCGCGCCGGA GGCCCAGGGG 
GCATCCACCG GACGACGGGC GCGACAGCCA CAGGCCGACG CCTCGGCCCC GGCGCGGGCG
GTCACGGCAC CACGCCCACG AGCCGGCACG GCGGACGGCC CCGGCGATGA CGGTAGCGTC
GCGACGGTGA CCGGCCCGTT CGCCAGGACG GCCGGCGACG CCGTCGCGGG CGGCGGCGAG
TACGCCGGCC GGGTCGCCTT CGTGACCGGA TCCGGCTCGG GAATCGGCGC CGCCTGTGCC
CGCCGGCTGG CCGCGGCCGG CGCCTTCGTC GTCCTGGCCG ACCGGGACAC CGTCGCGGCC
AAGGAGGTCG CCGGCGAGAT CGAGGCGGCC GGCGGCACCG CACTGGCGGT GGCGGTCGAC
GTCGCCGACC CGGAGTCGGT CGCCCAGGCC GTCGCCACGG CGATCGAGGC CGGTGGACGG
CTCGACCTCG CGGTCAACAA CGCGGGCATC GCCACCGACC GGGCCCCGCT GGAGGACATC
TCCCTCGCCG ACTGGGACCG GGTTCTCGCG GTCAACCTCT CCGGCGTCTT CTACAGCATG
CGCGCCGAGA TCCCGGCGAT GCTCGCGGCC GGCGGCGGCT CGATCGTCAA CATGGCCTCC
GTGCTGGGCA CGGTCGGCCT ACAGGGCACA CCGGCCTACG TCGCGGCCAA GCACGGCGTG
ATCGGACTCA CCAGGGTGGC CGCGCTGGAC AACGCGACAC GCGGAATCCG GGTCAACGCG
GTAGCACCCG GATTCATCGA CACCACGATG GTCAGCTCAC ACCGCGGAGC ACGCTTCTTC
CAGCCGATGA ACCGGCTGGG GACCGCCGAC GAGGTCGCCG AGGTCGTCCA CTTCCTACTC
TCCGACCGCG CGTCCCTGGT GACCGGCAGC GTCTACTCCG CCGACGGGGG ATTCACCGCC
CGCTGA
 
Protein sequence
MTDDDVPAGH STRTAPEAQG ASTGRRARQP QADASAPARA VTAPRPRAGT ADGPGDDGSV 
ATVTGPFART AGDAVAGGGE YAGRVAFVTG SGSGIGAACA RRLAAAGAFV VLADRDTVAA
KEVAGEIEAA GGTALAVAVD VADPESVAQA VATAIEAGGR LDLAVNNAGI ATDRAPLEDI
SLADWDRVLA VNLSGVFYSM RAEIPAMLAA GGGSIVNMAS VLGTVGLQGT PAYVAAKHGV
IGLTRVAALD NATRGIRVNA VAPGFIDTTM VSSHRGARFF QPMNRLGTAD EVAEVVHFLL
SDRASLVTGS VYSADGGFTA R