Gene Franean1_4456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4456 
Symbol 
ID5672807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5322919 
End bp5324346 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content69% 
IMG OID641243324 
Productaldehyde dehydrogenase 
Protein accessionYP_001508740 
Protein GI158316232 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC ACGAGCAGAT CTACCTTGAC GGAACATGGG TGACGCCGGA CGGCACCGAG 
GAGGTGTCGG TTCCGAACTC CACGACCGGG AAGGAGATCG GCACCATCCG ACTCGGGTCG
GAGAAAGACG TCACCGCCGC CGTGGAGGCA GCCCGCACAG CCTTCACCAC GTGGAGCGAT
ACTCCGATCG CGGAGCGCAT CCGGATCCTC GACGAGATGA CCCGGATCCT GACCTCCCGC
CATTCCGAGC TCAGCGACCT GTTCTCCCAG GAGGTCGGGA CGCCCACCGC CCAGGCGGGG
GCCTTCCAGG GGCTGGCGAC GAGCCTGTTC GGGATGTACG CCCAGAAGGC GTCGACGTTC
GACTACGAGG ACCGACGAGG AAACATCACC GTCCTCCGAG AGCCGATCGG GGTGGTCGGC
GCAATCACGC CGTGGAACTT CCCCCTCCTC CTCATGTGCC TCAAGGTCGC TCCGGCACTC
GCGGTGGGAT GCACCGCCGT GCTTAAGCCC GCGCACATCG CGCCGCTCTC GGCATACGCG
CTGGCGGAGG CGGCGGACGC GGCCGGCCTC CCGCGCGGGG TCCTGAACGT CGTACCGGGC
GGTCCGCGGG CCGGAGAGGC CCTCGTCGCG CACCCGGACG TCGACATGAT CTCCTTCACC
GGCTCGACCA AGGCCGGCCG CCGGATCAGC GCTGTCGCCG GTGAGACGAT CAAGCGGGTC
ACCCTCGAAC TGGGCGGCAA GTCCGCGCTC GTCGTGTTGG ACGACGCCGA TCTCCAGGCG
GCCGTCACCA CCGGCATGCA GTCGGTGACG ACCAACAATG GCCAGGGCTG CGCCTGCCTG
ACCCGCGTCG TCGTTCCCCG TGCACGGCTC GCCGAAGCCG AGGAGATGGC CCGCGCCTTC
GCCGAGTCCA CCGTCATCGG CGATCCTCGC GACCCGGCCA CGACGCTGGG GCCGATGGCC
TCGGCCGAAC ACCGCGACCG GGTCGACGGC TACATCCGTA CGGGGATCGA CGAGGGCGCC
CGACTGGTGT CCGGAGGTCC GGGCGCTCCG GACCGGTGGT CTGGCGGCTT CTTCGTCCGC
CCCACGGTCT TCACGGTCGA GGATGTCTCC GCCACCATCG CCCAGCAGGA GATCTTCGGG
CCCGTGCAGG TGCTCATCCC ACATGACGGC GACGACGACG CGGCTCGGAT CGCCAACGCG
ACCGTCTACG GGCTCGCCGG CGCCGTCTTC GGTACCGACC CGGACCGGAT TCGGCGGGTC
ACCCGGCTGA TGCGCACGGG GAAAGTCGAC ATCAACGGTA CGACCTTCGA CCCGGAGGCC
CCGTTGGGCG GCTACCGGCA GTCGGGCAAC GGCCGCGAGT TCGGCGACTA CAGCCTGGAG
GAGTTCCTGG AGACGAAAGC GGTGGCGGAC CTGACCGCCG GAGCCTGA
 
Protein sequence
MTTHEQIYLD GTWVTPDGTE EVSVPNSTTG KEIGTIRLGS EKDVTAAVEA ARTAFTTWSD 
TPIAERIRIL DEMTRILTSR HSELSDLFSQ EVGTPTAQAG AFQGLATSLF GMYAQKASTF
DYEDRRGNIT VLREPIGVVG AITPWNFPLL LMCLKVAPAL AVGCTAVLKP AHIAPLSAYA
LAEAADAAGL PRGVLNVVPG GPRAGEALVA HPDVDMISFT GSTKAGRRIS AVAGETIKRV
TLELGGKSAL VVLDDADLQA AVTTGMQSVT TNNGQGCACL TRVVVPRARL AEAEEMARAF
AESTVIGDPR DPATTLGPMA SAEHRDRVDG YIRTGIDEGA RLVSGGPGAP DRWSGGFFVR
PTVFTVEDVS ATIAQQEIFG PVQVLIPHDG DDDAARIANA TVYGLAGAVF GTDPDRIRRV
TRLMRTGKVD INGTTFDPEA PLGGYRQSGN GREFGDYSLE EFLETKAVAD LTAGA