Gene Franean1_5513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5513 
Symbol 
ID5673843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6679634 
End bp6681277 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content73% 
IMG OID641244368 
Productaldehyde dehydrogenase 
Protein accessionYP_001509773 
Protein GI158317265 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCGCG TGCGCGACGG CATTGAAGGC TGTTTGGTGG CGCCGCGATC GGATCAGAGA 
GACCCTCGTC AGATGGACGC CGCAGCCAGC ACTCCCCCGT CCCGCCTGCA TCTCAAGCCG
GGCACCGCCT GGGCCGACGC CTACTCGCGC GCCCGGCAGG AGGCCCCGGA AGCGTTCCAC
GAGGACCGCC TGCTCAACCT GTGGGGCGGT CAGTGGCGTC GCACCGGCAA CCCGCTGCAC
AGTCTCACGC CGGTGGACGG CACCCCGATC GCCGGCCCGC CGATGATCGA GCCCGACGAG
GCGCGCGAGG CCATCCGCGC GACCCTCGAC GACCACAAGG AATGGCGCGA CGTCCCGCTG
GCCGACCGCA AGGCCCGGGT GACCGCCGCG ATCGAGGCGA TGGAGGAGCA CCGCGACCTG
CTCGCACTGC TGCTGGTCTG GGAGATCGGC AAGCCGTGGC GGCTCGCCCG CACCGACGTC
GACCGCGCCC TGGACGGCGT GCGCTGGTAC GTCGACGAGA TCGACTCCAT GATCGGCGGC
CGGGCGGCCC TGCCGGGCCC GGTCAGCAAC ATCGCGAGCT GGAACTACCC GATGAGCGTG
CTCATGCACG CCATGCTCGT CCAGGTGCTC GCCGGCAACG CGGCGATCGC CAAGACGCCG
ACCGACGGCG GGGCGGCCTG CCTGACGCTG GCCTGCGCGC TCGCCCGCCG GGCCGGGCTG
CCGGTGTCGC TGGTCTCCGG GTCGGGGTCG CGGCTGTCGT CCGCGCTGGT GCGGGCGCCG
GAGATCGGCT GCCTGGCGTT CGTGGGCGGG CGCTCCGCCG GCGGCCAGGT GGCGGCCGCG
CTCGTCGACA CCGGCAAGCG GCACTTCCTC GAGCAGGAGG GCCTCAACGC CTGGGGCATC
TGGGACTTCT CCCAGTGGGA CCTGCTGGCC TCGCACCTGC GCAAGGGCTT CGAGTACGGC
AAGCAGCGCT GCACCGCCTA CCCGCGCTAT GTCGTCCAGC GCCAGCTCTT CGACAAGTTC
CTGGAGATGT ACCTGCCGGT GGTCTCCTCG GTGCGGTTCG GGCATCCGCT CGCCGTCGAG
AACGACTCCG ACCCGCTGCC CGACCTCGAC TACGGGCCGG TGATCACCGC GGAGAAGGCG
GCGGAGCTCG CCGCCAAGAT CGACGAAGCG GTGACCAAGG GCGGCGTGCC GCTCTACCGC
GGCGACCTCG CCGACGGCCG GTTCCTGCCC GGCCAGGACC GGGCCGCCTA CGTCCCGCCG
GTGGCGGTCC TCAACCCGCC GCCGTCGGCC GCGCTGCACC ACGCGGAGCC GTTCGGGCCG
GTCGACAGCA TCGTGGTCGT CGACTCCGAG GCCGAGCTGC TGTCCGCGAT GAACGCCTCG
AACGGCGCGC TCGTCGCCTC ACTGGCCTGC GACGACGACG CCACGGCGCG CCGGCTGGCC
GGGGAGCTCG CCGCGTTCAA GGTCGGCGTC AACAAGCCCC GCTCGCGAGG CGACCGGTCC
GAGCCGTTCG GCGGGCGCGG CGCGTCGTGG AAGGGCGCGT TCGTCGGCGG GGAGCACCTC
GTCCGCGCGG TCACCGTCGG CGCGGACCCG AACGAACGCC TCTACGGCAA CTTCCCCTCC
TACTCCCTCT ACCCGGAGAC GTGA
 
Protein sequence
MYRVRDGIEG CLVAPRSDQR DPRQMDAAAS TPPSRLHLKP GTAWADAYSR ARQEAPEAFH 
EDRLLNLWGG QWRRTGNPLH SLTPVDGTPI AGPPMIEPDE AREAIRATLD DHKEWRDVPL
ADRKARVTAA IEAMEEHRDL LALLLVWEIG KPWRLARTDV DRALDGVRWY VDEIDSMIGG
RAALPGPVSN IASWNYPMSV LMHAMLVQVL AGNAAIAKTP TDGGAACLTL ACALARRAGL
PVSLVSGSGS RLSSALVRAP EIGCLAFVGG RSAGGQVAAA LVDTGKRHFL EQEGLNAWGI
WDFSQWDLLA SHLRKGFEYG KQRCTAYPRY VVQRQLFDKF LEMYLPVVSS VRFGHPLAVE
NDSDPLPDLD YGPVITAEKA AELAAKIDEA VTKGGVPLYR GDLADGRFLP GQDRAAYVPP
VAVLNPPPSA ALHHAEPFGP VDSIVVVDSE AELLSAMNAS NGALVASLAC DDDATARRLA
GELAAFKVGV NKPRSRGDRS EPFGGRGASW KGAFVGGEHL VRAVTVGADP NERLYGNFPS
YSLYPET