Gene Franean1_2940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2940 
Symbol 
ID5671326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3456969 
End bp3458615 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content71% 
IMG OID641241846 
Productaldehyde dehydrogenase 
Protein accessionYP_001507266 
Protein GI158314758 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCTGG TGGAGGAGTC GCTGCTCGCC GGTAGCGCCC TTCGCCCGTG GCGGAGCGCG 
GCAGGAGCCC GCGAACGCCG ACGCTGCCGC CCGTTCGAGG AGTCGCTGAT GACCCGATCC
GCCACGACGG TGGCACCTGC CGCCACAGCC GCTCAGCCGG CGGCGCTCGA GTCGCTGAGC
CCGGGAACCG GGGAGCGTGT CGGCGTCTTC CCGGTCATGG GGGAAGCCGA GGTCGCGGGG
GTCGTTGCCC AAGCGAGACA AGGCTTCTCG TGGTGGAACA CCCGCACCGG GCGCCAACGC
CGCCACTGCC TGCTGAACTG GGCCGCTCAC CTCGTACGCC ACGCCGAAGA GCTGAGTGAT
CTGGTCAGCC GAGAGCAGGG CAAGCCGGCG GACGAGGCCT ACCTGGAGAT CGTCCCGGCC
CTCGAGCAGA TCCGCTGGGC CGCGCTGCAT GCGGGCCGGG TCCTGCGCCG GCGTCGGGTC
TGGCCCGGGC TCGTCATGGC CAACCATGTG GCGGCCGTGG ACTACCGTCC GTACGGTGTC
GTCGGTGTGA TCGGCCCGTG GAACTACCCG GTCTTCACGC CCGCGGGCTC AATCGCCTAT
GCCCTGGCGG CCGGCAACAC CGTGATCTTC AAGCCGAGCG AGTACACCCC GGCCACGGGA
GAGTTCCTGG TACGGGCCTT CGCCGCTGCG AACCCCGACG CGCCCGGCGG CGTGCTGAAC
CTGGTGACCG GTAGTGGGCC GACTGGCGCG GCGCTCATCG CGGCGGGCGT GGACAAGGTC
GCGTTCACGG GGTCGCCCGC CACAGGCCGT CGTGTCATGG CTGCCTGCGC CGCCTCGCTC
ACCCCGGTCG TGCTCGAGTG CGGCGGCAAG GACGCAATGA TCGTCGCGGA CGATGCGGAC
ATACGGGCAG CCGCGAAGGC CGCGGTCTGG GGCGGCATGT CGAACAGCGG CCAGACGTGC
GTAGGCATCG AGCGCGTCTT TGTCGTCGCC GCCGTCCGTG ACAGGTTCCT CCAGGAGATC
CAGCGTCAGC TCGACGGGAT CCGTCCCGGG CTCGCGGCGG ACGCGCCCTA CGGACCGATG
ACGACACCCG GACAGATCGA CATCGTACGG CGCCAGATCG ACGAGGCGCT CGCGGCCGGC
GCGATCCCGC TGGTCGGCGA CGGCTCATCC GTGCACGCGC CCTTCATCGA ACCAGTTGTC
CTTGTGGACC CGCCAGATGA CTGCAGTGTG ATGACCGAGG AGACCTTCGG GCCGGTACTG
GCGGTGCGCA CCGTCGAGGA TGTCGACGAA GCCGTACGGC TCGCCAACGA CAGCCGCTAC
GGCCTGGGCG CCAGTGTCTT CTCGCGTAGC CAGGGCCGGC GGATCGCCGA GCGGGTGACG
TCCGGCATGG TCGCGGTCAA CTGCGGGCTC GCCTTCGTGG GGATTCCCGC CCTTCCGTTC
GGCGGGCAGC ACGACAGCGG CTTCGGTCGT CTGCACGGCG CCGAGGGCCT TCGCGAGTTC
GCCCAGCCAC GCAGCTACGC CCGCCAGCTG ATCTCCCTGC CCGGGATCGA TGCCACCACC
CTGCGGCGGA CCCCGCGCAC GCTTCGGGTC CTGCGCCTGC TCATGAACGC ACGGCACGCT
CGGCGCTCAA CGCTTGGAAG GCGGTGA
 
Protein sequence
MLLVEESLLA GSALRPWRSA AGARERRRCR PFEESLMTRS ATTVAPAATA AQPAALESLS 
PGTGERVGVF PVMGEAEVAG VVAQARQGFS WWNTRTGRQR RHCLLNWAAH LVRHAEELSD
LVSREQGKPA DEAYLEIVPA LEQIRWAALH AGRVLRRRRV WPGLVMANHV AAVDYRPYGV
VGVIGPWNYP VFTPAGSIAY ALAAGNTVIF KPSEYTPATG EFLVRAFAAA NPDAPGGVLN
LVTGSGPTGA ALIAAGVDKV AFTGSPATGR RVMAACAASL TPVVLECGGK DAMIVADDAD
IRAAAKAAVW GGMSNSGQTC VGIERVFVVA AVRDRFLQEI QRQLDGIRPG LAADAPYGPM
TTPGQIDIVR RQIDEALAAG AIPLVGDGSS VHAPFIEPVV LVDPPDDCSV MTEETFGPVL
AVRTVEDVDE AVRLANDSRY GLGASVFSRS QGRRIAERVT SGMVAVNCGL AFVGIPALPF
GGQHDSGFGR LHGAEGLREF AQPRSYARQL ISLPGIDATT LRRTPRTLRV LRLLMNARHA
RRSTLGRR