Gene Franean1_4276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4276 
Symbol 
ID5672631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5111805 
End bp5115407 
Gene Length3603 bp 
Protein Length1200 aa 
Translation table11 
GC content75% 
IMG OID641243149 
Productaldehyde dehydrogenase 
Protein accessionYP_001508566 
Protein GI158316058 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.24568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.171168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGG CAGTGAGCAG TAGCCCGCAC GACCTGTCCG ACCTGGGCGA CGAGGCTGTC 
GCGCTGGTCC GCCGCTGGGT CGCCGAGGCG GCGTCCGAGC CGGTGGACCC GGCCGCCGCC
CGGCTGGCCG CCGTCCTGCG GGAGCCGGGC GGGCTGGCGT TCACCGTGCG CTTCGTCGAC
GGGGTGATCC GTCCGGAGGA CCCGCGGGTA GCGGCGCGCA ACCTGGCCCG GCTCGCGCCG
TCCGTGCCCG CCTTCCTGCC CTGGTACCTG CGCGGCGCGG TGCGGGCAGG GGGAGTAGCC
GGGCCCGTGG TGCCGTGGGT CGTCGTCCCG GCCGCCCGGC GGGTGCTGCG CCGGATGGTC
GGCCACCTCG TCGTCGATGC CACCGACCGC GGGCTCGGAC GGGCCATCGA GCGGCTGCGC
CGGCCCGGCG TCCAGCTGAA CATGAACCTG CTGGGGGAGG CCGTCCTCGG TGAGCGGGAG
GCCGCGCGCC GGCTGGCGGG CACGACGGCG CTGCTCGCCC GGGACGACGT CGACCACGTG
TCGATCAAGG TGTCGGCGAG CGTCGCGCCG CACGCCGCCT GGGCCTTCGA GGAGACCGTG
GCGCATGTCG TGGAGACGCT GACACCGCTG TTCGAGCAGG CCATGGCAGC GCGGCCGGCG
GAGTCGCGGC CGGCCACGTT CGTCACGTTG GACATGGAGG AGTACCGCGA CCTGGACCTG
ACGATCGAGG TGTTCACCAC GCTGCTCGAC CGGCCGGCCC TGCGGCGCCT GGCGGCCGGG
ATCGTCCTGC AGGCCTACCT GCCCGACGCG CTCGGCGCCC TTACCCGCCT GCAGGAGTGG
AGCGCGTCCC GGCGGGCGGC CGGCGGTGCC CCGGTCACCG TACGGCTGGT GAAAGGCGCG
AACCTGCCGA TGGAACGGGC CGAGGCGTCC CTGCGTGGCT GGCCGCCGGC GCCGTACGAC
ACCAAGCAGG ACACCGACGC CAACTACAAA CGGCTGCTGG ACTACGCCCT GCATCCCGAC
CGGGTCGCCA ACGTGCGCGT CGGGGTCGCC GGCCACAACC TGTTCGACCT GGCCTTCGCG
TGGCTCCTCG CCGGCCGCCG CGGCGCCCGC GGCGGCCTCC GCTTCGAGAT GCTGCTCGGG
ATGGCGCAGG CGCAGGCGCG GGTCGTCGCG CGTGAGGTGG GCGGCCTGCT CCTCTACACC
CCGGTGGTGC GGCCGGCGGA GTTCGACGTC GCCATCGCCT ACCTGGTCCG CCGGCTCGAG
GAGGGCGCGA GCCGGGAGAA CTTCATGTCG GCCATGTTCG ACCTCGCCAC CAACGAGACG
CTGTTCGCCC GTGAGGAGAA GCGGTTCCGC GCCTCGCTGG CCGACGTCGA CGACACCGTC
CCGTCCCCGC GCCGCACCCA GAACCGCCTG CGCGCCACGC CTCCCATCGG CGCAGCGGGG
CCGGCCGGGC CTGATGGCAC AGGAGGCTTC CGGAACGCTC CGGACACCGA CCCCTCCCTG
GCGGCCAACC GGACGTGGGC GCGGCGCGTC CTGGCCCGGG CGCCGGCGTC CACGCTGGGC
GTCGACCTCG TCGCGGTTAC CACGATCCGC TCGGCGGACG AGCTGGACGC GGTGCTGGCA
CGCGCGGTCG CCGCTGGCCC GGGTTGGGCG GCGCTGGGTG GGGCGGGCCG CGCGGCCGTT
CTACGCCGGG CCGCCGAGGT GCTCGAACAG CGGCGCGGCG AGCTGGTCGA GGTCATGGCC
ACCGAGACCG CGAAGACCTT CGACCAGGCC GATCCGGAGG TCTCGGAGGC CGTCGACTTC
GCCCGCTACT ACGCCGAGCG GGGTGCCGGG CTCGACGACG TCGACGGGGC GCTGCTCGCG
CCGGTGCGCC TCACCGTGGT CACGCCGCCG TGGAACTTCC CGGTCGCGAT CCCCGCCGGG
TCCACCCTCG CCGCCCTCGC CGCCGGCTCG CCGGTGGTGA TCAAGCCGGC CGGTCAGGCC
CGTCGCTGCG GCGCCGTCCT GGTAAGGGCG CTGTGGGACG CCGGCATCCC CCGCGAGGTC
CTCCAGCTCG TCAACGTGGA CGAGGGCGAC CTCGGCCGGG CGCTGGTCGG TGACCCTCGC
GTCGATCGGG TCATCCTCAC CGGCGCGTTC GAGACGGCGG AGCTGTTCCG GTCGTTCCGC
CGCGACCTGC CGCTGCTCGC GGAGACGAGC GGCAAGAACG CGATCGTGGT GACGCCGAGC
GCCGACCCCG ACCTGGCCGT GCGCGACGTC GTTGCCTCCG CGTTCGGCCA CGCCGGCCAG
AAGTGCTCGG CGGCGTCGCT GCTCATCCTC GTCGGCTCTG CGGCGGCGTC CCGCCGGCTG
CGCGACCAGC TCGTCGACGC GGTCCGCTCG CTGGTCGTGG GGGAGTCGGC CGAGCCGCGG
ACCCAGCTCG GGCCGCTCAT CGAACCGGCT TCGGGCAAGC TGCTGCGCGC GCTGACCGAG
CTCGGCCCGG GCGAGCGCTG GCTCGTCGAG CCGCGGCGCC TCGACGAACA GGGCCGGCTG
TGGTCGCCCG GAGTGCGTGC GGGGGTGTCC CGCGGGTCGG AGTTCCACCG GACGGAGTAC
TTCGGCCCGG TCCTGGGGAT CATGCCGGCG GCCGATCTCG CTGAGGCCGT GGAGATCCAG
AACGAGGTCG ACTTCGGCCT CACCGCGGGC CTGCACTCGC TCGACGCGGC GGAGCTCGAC
TACTGGCTGC GCCACGTCCA GGCGGGCAAC CTCTACGTGA ACCGCGGCAT CACCGGCGCG
ATCGTGGGCC GCCAGCCGTT CGGCGGCTGG AAGCGCTCGG CGGTGGGCCC GGGAACCAAG
GCGGGCGGCC CGACCTACCT CTACGCCCTC GCTGACTGGA CCAGCAGACC CAGCTCCGCG
ACGGCCGAGC TCGGACCGAC CGCCCGCCGC CTGCTCGCCG TCGTCGCTGA AAGTGACTGT
GTAGGAGAGT TCGGCGACGG TGCCTCCGTT GGTGGCGCCT CCGCTAGTGA TGCCTCCGCC
GGCGGCGCGT CCGCTGGCGG TGCCGCGCCT GGCGGAGGCG CCGGTGGTGG GGTCGGCGCG
GTCGCGGATC TCGCTGGGCT GGAACGGGCG CTGCGCAGCG ACGCGGCCGC CTGGGCCGGC
GGGTACGGCG CCGCCCGCGA GCTCGCCGGG CTGTCCGCCG AGCGCAACAT GCTGCGCTAC
CAGCCCGTTC CGGTCGAGAT CCGCTTCGCG GGCGGCGGGA TCCACCAGCT CGTCCGCGTG
GTCGCGGCCG GGCTGCTGGC CGGCTCGCCG TTGCGGGTGA GCTCCGCGGT CGAGCTCCCG
CGGTGGCTAC GCGCGGAGCT CGCCGGGCTG GCGGTCGGTA ACAGCTGCCA GTCCGACGAC
GAGTGGCTCG CTGACCTGGC CCGACGCCCG GCGAGCCGGG CCGGGCTGCG GGTTCGGCTG
ATCGGCGGGG ACGCCGGCGC GCTGACGGCC GCGGTCGGTG ACCGCCTGGA GATCGCCGTC
CACGCGCGGC CGGTGACCGA GTCCGGGCGG CTGGAGTTGC TGCCGTTCCT GCGCGAGCAG
GCGGTCAGCA TCACCGCCCA CCGCTTCGGT ACCCCGGACG GCCTCTCCGA CGGCCTCCTC
TGA
 
Protein sequence
MTAAVSSSPH DLSDLGDEAV ALVRRWVAEA ASEPVDPAAA RLAAVLREPG GLAFTVRFVD 
GVIRPEDPRV AARNLARLAP SVPAFLPWYL RGAVRAGGVA GPVVPWVVVP AARRVLRRMV
GHLVVDATDR GLGRAIERLR RPGVQLNMNL LGEAVLGERE AARRLAGTTA LLARDDVDHV
SIKVSASVAP HAAWAFEETV AHVVETLTPL FEQAMAARPA ESRPATFVTL DMEEYRDLDL
TIEVFTTLLD RPALRRLAAG IVLQAYLPDA LGALTRLQEW SASRRAAGGA PVTVRLVKGA
NLPMERAEAS LRGWPPAPYD TKQDTDANYK RLLDYALHPD RVANVRVGVA GHNLFDLAFA
WLLAGRRGAR GGLRFEMLLG MAQAQARVVA REVGGLLLYT PVVRPAEFDV AIAYLVRRLE
EGASRENFMS AMFDLATNET LFAREEKRFR ASLADVDDTV PSPRRTQNRL RATPPIGAAG
PAGPDGTGGF RNAPDTDPSL AANRTWARRV LARAPASTLG VDLVAVTTIR SADELDAVLA
RAVAAGPGWA ALGGAGRAAV LRRAAEVLEQ RRGELVEVMA TETAKTFDQA DPEVSEAVDF
ARYYAERGAG LDDVDGALLA PVRLTVVTPP WNFPVAIPAG STLAALAAGS PVVIKPAGQA
RRCGAVLVRA LWDAGIPREV LQLVNVDEGD LGRALVGDPR VDRVILTGAF ETAELFRSFR
RDLPLLAETS GKNAIVVTPS ADPDLAVRDV VASAFGHAGQ KCSAASLLIL VGSAAASRRL
RDQLVDAVRS LVVGESAEPR TQLGPLIEPA SGKLLRALTE LGPGERWLVE PRRLDEQGRL
WSPGVRAGVS RGSEFHRTEY FGPVLGIMPA ADLAEAVEIQ NEVDFGLTAG LHSLDAAELD
YWLRHVQAGN LYVNRGITGA IVGRQPFGGW KRSAVGPGTK AGGPTYLYAL ADWTSRPSSA
TAELGPTARR LLAVVAESDC VGEFGDGASV GGASASDASA GGASAGGAAP GGGAGGGVGA
VADLAGLERA LRSDAAAWAG GYGAARELAG LSAERNMLRY QPVPVEIRFA GGGIHQLVRV
VAAGLLAGSP LRVSSAVELP RWLRAELAGL AVGNSCQSDD EWLADLARRP ASRAGLRVRL
IGGDAGALTA AVGDRLEIAV HARPVTESGR LELLPFLREQ AVSITAHRFG TPDGLSDGLL