Gene Franean1_5389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5389 
Symbol 
ID5673721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6497646 
End bp6499127 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content67% 
IMG OID641244245 
Productaldehyde dehydrogenase 
Protein accessionYP_001509651 
Protein GI158317143 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGACA GACCTCAGTT CGAGTCGAGG ATGCTCATCG ACGGCAAGCT CGTCGAGGCC 
GCGACCGGCA AGACGTTCGA CAACGTCAAC CCGGCGACCG AGGAGGTGCT CGGCCAGGTC
ACCGATGCCT CCGCCACCGA CATGCACCGC GCCATCGACG CCGCCCGCCG GGCCTTCGAC
GAGACCGACT GGCCGACGAA CCGCGCGTTG CGCAAGCTCT GCCTCTCCCA GCTACAGAAG
GCGCTCGAGT CGGAACGCGA GCAGTTCCGC GAGGAGCTGA TCGCAGAGGT CGGCTGCCCG
CGGACGATCA CCAACGGCGA ACAGCTCGAC GCGCCGCTGG CGAACGCGCT GCGCCACCCG
ACCAAGCTCA TCGACACCTA CCCCTGGGAG ACCGACCTCG GTAACACGGT CGACGAGCGA
AGCGGAAGGC TGACATCGCG GAGGATCTGG CGCGAGCCGA CCGGCGTCGT CGGGGCCATC
GTGCCCTGGA ACTTCCCGCT CCAGATCGCA CTGCACGCAC TCGGCCAGGC GCTGGCCACC
GGCAACACGG TCGTGCTCAA GCCTGCGCCG GACACCCCGT TCCACGCCAC CCGCCTCGGC
CGCCTCATCG CCGAACACAC CGACTTCCCG CCTGGAGTCG TCAACGTCGT CACGGCGTCG
GACCACCTCG TCGGCGAGGA ACTGACCCTC TCCCCCAAAG TCGACCTGAT CTCCTTCACC
GGGTCGACCG CCGTCGGAAA ACGGATCATG GAGAAGGGTG CGGCCACCCT CAAACGACTG
TTCCTGGAGC TGGGCGGCAA ATCGGCGACG ATCGTGCTCG ACGACGCCGA CCTGCAGACC
GCCACGAGGA TGGGCATCGC CGCATGCGTC CACGCCAGCC AAGGCTGTGC CATCCCGACC
CGGATGCTGC TCCCCCGCTC CCGCTACGAC GAGGGCGTAG CCCTGCTCGA GGCCATGTAC
GCGGGCGTCA AGGTCGGTGA TCCCCAGCAG GCGGATACCG TCACCGGACC GGTCATCTCG
ACAAAACAGC GTGAGCGGGT GCTCGGCTAC ATCCGCAAGG GCATCGATGA GGGAGCCAAA
CTTCTCGTCG GCGGCACCGA GCGGCCCGAA GGGCTCGACA AGGGGTACTT CGTCAAACCG
ACGCTGTTCG TCGACGTCGA CAACTCCATG ACCATCGCCC AGGAAGAAAT CTTCGGGCCC
GTCCTGGCGG TGATCCCGTT CGAGGACGAC GAGGACGCGA TCAGGATCGC CAACGACAGC
TCCTACGGCC TGTCCGGCGC GGTGATGTCG GGATCGCTGG AACGGGCGCT GGCGGTGACC
CGACGTATCC GGACGGGCAC CGTCCACGTC AATGGTGGCC TGGCGTCCGG CCCGGACATG
CCCTTCGGCG GCTACAAGGC CAGCGGCATC GGCAGGCACA AAGGACATGC GGGCTTCGAC
CAGTACCTCG AGACCAAGTC CACAGCTTGG CCGGCGAGCT GA
 
Protein sequence
MADRPQFESR MLIDGKLVEA ATGKTFDNVN PATEEVLGQV TDASATDMHR AIDAARRAFD 
ETDWPTNRAL RKLCLSQLQK ALESEREQFR EELIAEVGCP RTITNGEQLD APLANALRHP
TKLIDTYPWE TDLGNTVDER SGRLTSRRIW REPTGVVGAI VPWNFPLQIA LHALGQALAT
GNTVVLKPAP DTPFHATRLG RLIAEHTDFP PGVVNVVTAS DHLVGEELTL SPKVDLISFT
GSTAVGKRIM EKGAATLKRL FLELGGKSAT IVLDDADLQT ATRMGIAACV HASQGCAIPT
RMLLPRSRYD EGVALLEAMY AGVKVGDPQQ ADTVTGPVIS TKQRERVLGY IRKGIDEGAK
LLVGGTERPE GLDKGYFVKP TLFVDVDNSM TIAQEEIFGP VLAVIPFEDD EDAIRIANDS
SYGLSGAVMS GSLERALAVT RRIRTGTVHV NGGLASGPDM PFGGYKASGI GRHKGHAGFD
QYLETKSTAW PAS