Gene Franean1_4515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4515 
Symbol 
ID5672864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5386466 
End bp5387878 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content71% 
IMG OID641243380 
Productaldehyde dehydrogenase 
Protein accessionYP_001508796 
Protein GI158316288 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCAATC AGGACCTCAT CCTCATCGAC GGTGAGTGGG TCCCGTCCTC CGGCACTGGC 
GTCATCAACG TCGAGAACCC GGTCACAGAG GAGATCATCG GCACGATTCC GGACGGCACG
CCCGAGGACG TCGACCGGGC GGCGGCCGCT GCCGCCCGGG CCTTCGACGG CTGGTCCCGT
TCGACGCTCG ACGAGCGGGC CGAGGTACTG CGAAGCCTGG CCGACTTCGT GGAGGGCCGG
GCCGAAGAGA TCACCGAGGC GATCGTCCAG GAGATCGGCG AGCCGCTGAG CATCGCCACA
GCTCACCAGA CGCTGTCCAC GGTCAAGCAC CTTCGCGGCA CGGCCGACGC GCTGGCGGAG
GTCGACTGGA ACGTGGCGGT CGGGGACACG CTCGTGCACC GGGCCCCGGT CGGGGTGGTC
GGCGCGATCA CGCCGTGGAA CGTGCCGCTG TTGATGATCG CGATGAAGGT CGGCGCCGCC
GTCGCGGCGG GCTGCACCGT CGTTCTCAAG GGCACCGAGA TCGCCCCGCG CAGCTCCTTC
TTCTTCGCCG AGGGCACGCT GAAGGCCGGC CTGCCCGCCG GCGTGGTCAA CCTGGTCAGC
GGCACCGGCC CGGTGGTGGG CGAGGCCATC GCCGGTCATC GCCTGGTCGA CATGGTCTCG
ATCACCGGCT CGGTGCGCGC GGGCAGCCGC GTCATGGAGA TCGCCTCCCG CACGGTCAAG
CGGGTCGGAC TCGAGCTCGG CGGCAAGTCC GCGAACGTCA TCCTCGAGGA CGCCGACGTC
GCCAGGGCCG TCACCGCCGG CATCGCCGAC GCCTTCCGTA ACAGCGGGCA GGTGTGCGGC
GGCCTCACCC GCGTGCTCGT GCCCCGCGCG CGGCTGGCCG AGGCCGAGAA TGCCGCGCGT
GCTGCCGCCG AGTCCTACGT GCTCGGCGAT CCGTTCGCGG AGGGGACCAG CCTCGGGCCG
GTGGTGACCC GGGCCCAGCG GGACCGCGTC CGCGAACTGA TCCAGTCGGG CATCGACCAG
GGCGCCACGC TGGTGACCGG CGGCCCCGAG CAGCCCGAGG GCCTGGCCAC CGGCTACTAC
GTGAAGCCGA CCATCTTCAG CGGCACACCG GACATGCGGA TCGCCCGCGA GGAGATCTTC
GGACCGGTGG TGACGATCCT CCCGTACGAC ACCGAGGAAG AGGCTGTCGC GATCGCCAAC
GACTCCGACT ACGGTCTCGC CGGTGGTGTC TGGGCCGCCT CGCCCGCGCG CGCCCGCGAA
GTCGCGCTCC GCCTGCGCAC GGGCCGGATC CGCATCAACG GCTCCCCGGT CAACCCGCGT
GCGCCGCACG GCGGTTTCAA GCTCTCGGGT ATCGGCCGGG AGAACGGCCG TTTCGGTATC
GAGGAGTTCC TGGAGTACCA GTCGATCGGG TGA
 
Protein sequence
MINQDLILID GEWVPSSGTG VINVENPVTE EIIGTIPDGT PEDVDRAAAA AARAFDGWSR 
STLDERAEVL RSLADFVEGR AEEITEAIVQ EIGEPLSIAT AHQTLSTVKH LRGTADALAE
VDWNVAVGDT LVHRAPVGVV GAITPWNVPL LMIAMKVGAA VAAGCTVVLK GTEIAPRSSF
FFAEGTLKAG LPAGVVNLVS GTGPVVGEAI AGHRLVDMVS ITGSVRAGSR VMEIASRTVK
RVGLELGGKS ANVILEDADV ARAVTAGIAD AFRNSGQVCG GLTRVLVPRA RLAEAENAAR
AAAESYVLGD PFAEGTSLGP VVTRAQRDRV RELIQSGIDQ GATLVTGGPE QPEGLATGYY
VKPTIFSGTP DMRIAREEIF GPVVTILPYD TEEEAVAIAN DSDYGLAGGV WAASPARARE
VALRLRTGRI RINGSPVNPR APHGGFKLSG IGRENGRFGI EEFLEYQSIG