Gene Franean1_2374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2374 
Symbol 
ID5670770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2820615 
End bp2822216 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content75% 
IMG OID641241291 
Productaldehyde dehydrogenase 
Protein accessionYP_001506712 
Protein GI158314204 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.238793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTCG TTCATCCACG CCGGGTGGCG GCGGTGATCG GCGGCCGCCA GCACGACCTG 
TACGCCGCGG CCGGCCCGGC GGGCATCGCC CCGGAGACCG CGCGCGTCGT CAGCTCGACG
AACCCGGCCC GCCTGAGCGA CGTCGTCGCC GAGGTCGTGC TGGACGGGGC GGAGGCCATC
GTCGCGGCCG CGGCCGCCGC CCGCGCCGCC CAGCGGGAGT GGGCCGACGT CCCGGCGCCG
GTGCGTGGCG AGGTCATCGG CGCCTTCGGC CGGCTCGTCG AGGACAACGC GGAGACGCTG
GCCCGGCTGG TCACCCGGGA GATCGGCAAG CCGCTGGCGG AGGCGCGCGG GGAGGTCCGC
GAGATCATCG ACACCTGCGC GCTGTTCCGC GGCGAGGGCC GGCGCCCGCA CGGCCAGACG
GTGCCGTCCG AGATGCCCGA CCGGGAGCTG TTCACCTACC GTGAGCCGCT CGGCGTGGTC
ATGGTGATCA CTGCTGGCAA CTTCCCGGTG GCGGTGCCGT CCTGGTATCT GGTGCCGGCG
CTGCTGACCG GCAACGCGGT GGTGTGGAAG CCCGCCGAGT ACGCCGCCGC CTGCGCCGCG
GCCCTGATGG ACCTGCTGAC CGCCGCCGGC GTTCCGCCCG GGGTGGCGAA CCTAGTCCTC
GCCGACGGCC CGGCCACCTC GCTCGGCCTC GAACGCGCCC TGGACGCCGG CCTCGTCGAC
AAGGTCGGCT TCACCGGCTC GACGTCCGTC GGCCGCTTCG TCGGCGCGCT GTGCGGGCGG
CACCTGCAGT CGCCGTGCCT GGAACTGGGC GGCAAGAACC CGATGGTGCT GGCCCCCGAC
GCCGACCTGG ACGCCGCCGT CGCCGCCGCG CTGTTCGCCG GGTTCGGGAC GGCCGGCCAG
CGGTGCACCT CGCTGGGCAC GGTGATCGCG CACGAGTCGG TGCACGGCGC GTTCCGGCGC
CGGCTGGACG CCGCCGTCAG CGGCGCCGTG CTCGGCGACC CCACCCGGGA CGTCCTCTAC
GGCCCGCTGC TCGACGCCCG GTTCGCCGCC GGCTTCGAGG ACCACCTGGC GTGCGTGCGC
GACCATCACG AGCCGTTCGG GTCCACCGCG CTCGGCCGGA TCGGCCCGGC CAGCCCGCGG
CGCGGGTTCG TCGGCGACCC CGAGACCGGC CTGTACTACC ACCCGGTCGT CGTCGACCGG
GTGCGCCCCG AGGACGAGCT GTTCACCGCG GAGACCTTCG GCCCGATCGT CGGGCTGACC
ACCTACCGCC ACCTGGAGGA GGCCGTCGAG CTCGCGAACC TGCCCGGCTA CGGGCTGTCC
TCGTCGATCT TCACCGGTGA CCCGGTGAGC GTCCGGCGCT TCCGGCGCGG GGTGCGTGCC
GGAATGGTCA GCGTGAACAC CTCCACCTCC GGCGCCGAGG CGCACCTGCC CTTCGGCGGC
AACGGGCGCT CCGGCAACGG CGCCCGCCAG TCCGGCCAGT GGGTGCTGGA CCAGATGACC
CGCTGGCAGT CCCTGACCTG GGAACTGTCC GGCCGCCTGC AGAAGGCCCA GCTCGACGTC
TCCGTCCCCC CGGCCGACCT CGGCTTCCGC CTGCCGCGGT GA
 
Protein sequence
MTLVHPRRVA AVIGGRQHDL YAAAGPAGIA PETARVVSST NPARLSDVVA EVVLDGAEAI 
VAAAAAARAA QREWADVPAP VRGEVIGAFG RLVEDNAETL ARLVTREIGK PLAEARGEVR
EIIDTCALFR GEGRRPHGQT VPSEMPDREL FTYREPLGVV MVITAGNFPV AVPSWYLVPA
LLTGNAVVWK PAEYAAACAA ALMDLLTAAG VPPGVANLVL ADGPATSLGL ERALDAGLVD
KVGFTGSTSV GRFVGALCGR HLQSPCLELG GKNPMVLAPD ADLDAAVAAA LFAGFGTAGQ
RCTSLGTVIA HESVHGAFRR RLDAAVSGAV LGDPTRDVLY GPLLDARFAA GFEDHLACVR
DHHEPFGSTA LGRIGPASPR RGFVGDPETG LYYHPVVVDR VRPEDELFTA ETFGPIVGLT
TYRHLEEAVE LANLPGYGLS SSIFTGDPVS VRRFRRGVRA GMVSVNTSTS GAEAHLPFGG
NGRSGNGARQ SGQWVLDQMT RWQSLTWELS GRLQKAQLDV SVPPADLGFR LPR