Gene Franean1_3697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3697 
Symbol 
ID5672063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4377111 
End bp4378052 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content72% 
IMG OID641242580 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001508000 
Protein GI158315492 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGG CCACACTGCC CGCAGCCCGG TTCGACGGCC GGGTCGCCGT GATCACCGGT 
GCCGGCCGGG GTCTGGGGCG CGCCTACGCC CTGCTGCTCG GCTCGCTGGG CGCCAAGGTC
GTCGTCAACG ACCCGGGCGG CAGCATGAGC GGGGAGGGCC TCGACACCGG CCCCGCCGAG
CAGGTCGTCC AGGAGATCGT CGCCGCCGGC GGCGAGGCCG TCGCCTCCAC CGACTCGGTG
GCCACGGCCG AGGGCGGACA GGCGATCATC GGCACGGCGA TCGACAGCTT CGGCCGGATC
GACATCCTCA TCCACAACGC CGGGACGCAC CGCCCGGCGC CGCTGGCGGA GATGACGTAC
GAGGACTTCG ACGCCGTCCT GGACGTCCAC CTGCGCGGGG CATTCCACGT CGTGCGGGCC
GCGTTCCCGC TGATGGTCGC GGCGGGGTAC GGCCGGATCG TGCTGACCTC GTCCATCGGC
GGGCTGTACG GCAACGCCGG GGTCGTCAAC TACGGCGTGT CCAAGGCCGG CATGATCGGG
CTGTCGAACG TGGCCGCCCT CGAAGGGGCC GCGTCGGGCG TGAAGAGCAA CATCATCGTC
CCCGCCGCGA TCACGCGGAT GGCGGAGGGG ATTGACACCT CGGCCTACCC GCCGATGGGG
CCCGAGCTGG TGGCCCCCAC CGTGGGCTGG CTCGCGCACG AGTCCTGCTC GATCACCGGG
GAGATGCTGA CCTCGATCGC CGGCCGGGTG GCCCGCGTCT TCATCGCCGA GACCCCGGGC
GTGTACCAGC CGTCCTGGAC GGTCGAGCAG GTCGGGGAGC AGCTCGAGAC CATCCGCGAC
ACGAGCGACC CCTGGATCCT GCCGGTCGTG CCCTCGGCGC ACGTCGACCA CATCGTCAAC
AGTTTCGCGA TGGCCGCGAA GGGCGCCGCG AACGCGTCCT GA
 
Protein sequence
MSEATLPAAR FDGRVAVITG AGRGLGRAYA LLLGSLGAKV VVNDPGGSMS GEGLDTGPAE 
QVVQEIVAAG GEAVASTDSV ATAEGGQAII GTAIDSFGRI DILIHNAGTH RPAPLAEMTY
EDFDAVLDVH LRGAFHVVRA AFPLMVAAGY GRIVLTSSIG GLYGNAGVVN YGVSKAGMIG
LSNVAALEGA ASGVKSNIIV PAAITRMAEG IDTSAYPPMG PELVAPTVGW LAHESCSITG
EMLTSIAGRV ARVFIAETPG VYQPSWTVEQ VGEQLETIRD TSDPWILPVV PSAHVDHIVN
SFAMAAKGAA NAS