Gene Franean1_4489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4489 
Symbol 
ID5672839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5356385 
End bp5357224 
Gene Length840 bp 
Protein Length279 aa 
Translation table11 
GC content69% 
IMG OID641243356 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001508772 
Protein GI158316264 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGGC GAGTCGCCGG GAAGGTCGCG CTGATCACCG GGGCGGCGCG CGGGCAGGGC 
CGCAGCCACG CGGTTCGGCT CGCGCAGGAG GGGGCCGACA TCATCGCCGT CGACCTCTGC
GCCGACGTGC CGGGCGTGCC GTACCCGGGG GGCACCCGCG AGGATCTGGC CGAGACGGTA
CGGCAGGTGG AGGCCCTCGA TCGGCGGGCC GTGGCGACGG TCGCCGACGT GCGTGACCAC
GAGCAGCTCG CGGCGGCTGT GGTGGGGGGT GTCGCCGAGT TCGGGCGGCT CGACGTGGTC
AGCGCGAACG CGGGCATCGC CATGCCGCCC TTCCCCACCC ACGAGATGCC CGAGGAGGTG
TGGCAGGGCA TGCTTGCGGT CAACCTGACC GGCGTCTGGC ACACCTGCAA GGCCGCCATA
CCGCACCTGA TCGCGGGTGG CCGCGGCGGG TCGATCATCC TTACGAGTTC CGCGGCTGGT
CTCAGGGGTT ACGAGAACAT CGCCAACTAC GTCGCGGCCA AGCACGGCGT GGTCGGTCTG
ATGCGGACGC TGGCCAACGA GCTCGCCCGG CACTCGATCC GGGTGAATTC GGTGCATCCC
ACCACTGTCT CGACCGAGAT GATCCAGAAC GAGTCGACCT ACCGCCAGTT CCGGCCGGAC
CTGACCGACA CGCCGACCGA GGACGACGTG CGCGACGCGT TCACGTCGCT CAATCTGATA
CCGGTGCCCT GGATTGAGTC GATTGACGTG TCGAACGCGC TGCTGTTTCT CGCGTCCGAC
GAGTCTCGGT ACATCACCGG CATCACGCTG CCGATCGACG CCGGCCAGAT GGTCAAGTAG
 
Protein sequence
MAGRVAGKVA LITGAARGQG RSHAVRLAQE GADIIAVDLC ADVPGVPYPG GTREDLAETV 
RQVEALDRRA VATVADVRDH EQLAAAVVGG VAEFGRLDVV SANAGIAMPP FPTHEMPEEV
WQGMLAVNLT GVWHTCKAAI PHLIAGGRGG SIILTSSAAG LRGYENIANY VAAKHGVVGL
MRTLANELAR HSIRVNSVHP TTVSTEMIQN ESTYRQFRPD LTDTPTEDDV RDAFTSLNLI
PVPWIESIDV SNALLFLASD ESRYITGITL PIDAGQMVK