Gene Franean1_6331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6331 
Symbol 
ID5674650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7690396 
End bp7691514 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content71% 
IMG OID641245184 
Product2-alkenal reductase 
Protein accessionYP_001510579 
Protein GI158318071 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATGG GTCATCTCTC AAGGAGCAGG AGATTGCGGC GGTGGCTCGC CCTGCCGGAG 
ACTCCCAGAC CGTCGATCCG GCACAGGTCG GAGTTTTCGG CCTGGATCGT GGCCGCTGGC
CTGGCGGCTG GCGTCCTGTT CGGGACGGCG GCGTGCGGTA CCGCGACCGA CGACGCATCC
ATGTCCCGGG CCGAGACCAC GAACTCGCAG CCCGCCGGAC TCCCCGCCGT GGTACGCGAC
GCCGAGTCGT CCGTGGTCAC GATCTTCGTC GGAAACGGGC TGGGCAGTGG CGTCGTCTAC
CGAGCCGACG GCGTCATCGT CACCAACGAG CATGTCGTAC GGTCCGCGGC CGACCGCAGG
GTCGAGGTGG CCTTCGCCGA CGGCCGCCGA GCCCCGGGCC GAGTGCAGGC CGCCGACCGG
ATCAGCGACA TCGCGGTCGT CAAGGTCGAT CGGAGCGGCC TGCCCACCCT GACGTTCCGT
AGCGAGCTTC CCCAGGTGGG CGAGCTGGCC GTGGCGATCG GTAGCCCGCT TGGGTTCGAG
AACAGCGCCA CCGCCGGCAT CGTCTCCGGG CTGAATCGCA CCCTGCCGGC ATCCGGCCAG
CCCGGCCGCC TAGGCCAGCC GCTGGTCGAC CTGATCCAGA CCGACGCGGC GATCTCCCCC
GGCAACTCCG GTGGGGCGCT CCTGGACGGG CAGGGCCGCG TCCTGGGCAT CAACGAGGCC
TACGTGCCCC CGTCGGAGGG GGCTGTCTCC CTGGGCTTCG CGATCCCGTC AGCCACCGTT
GTCGACGCTG CCGATCAACT GCTGCGCACT GGCGAGGTGC AGCACGCCTT CCTCGGCGTC
CAGGTCACCA GCCTCACGCC CGAAGTCGCC CGGCAGCTCG ACATCCAGGT CGACAGCGGC
GTCCTGGTGC TCTTTGTCGC CGACCAAGGT CCGGCCGACC GTGCCGGTGT CCGGCTCGGC
GATGTGATCC GCACCTTCAA CGGCGAGCCG GTGAGATCCC CCACCGACTT CCTGGCCCAA
CTCCGCAGTG TCGACCCCGG CCGGCAGGTG ACGCTCGGTA TCCGCCGCAA CGGCGACGAC
CTCGAGGTGA AGGCCACCGT CGCAGACCGG CCCGCCTGA
 
Protein sequence
MDMGHLSRSR RLRRWLALPE TPRPSIRHRS EFSAWIVAAG LAAGVLFGTA ACGTATDDAS 
MSRAETTNSQ PAGLPAVVRD AESSVVTIFV GNGLGSGVVY RADGVIVTNE HVVRSAADRR
VEVAFADGRR APGRVQAADR ISDIAVVKVD RSGLPTLTFR SELPQVGELA VAIGSPLGFE
NSATAGIVSG LNRTLPASGQ PGRLGQPLVD LIQTDAAISP GNSGGALLDG QGRVLGINEA
YVPPSEGAVS LGFAIPSATV VDAADQLLRT GEVQHAFLGV QVTSLTPEVA RQLDIQVDSG
VLVLFVADQG PADRAGVRLG DVIRTFNGEP VRSPTDFLAQ LRSVDPGRQV TLGIRRNGDD
LEVKATVADR PA