Gene Franean1_4526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4526 
Symbol 
ID5672875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5399139 
End bp5400626 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content70% 
IMG OID641243391 
Productaldehyde dehydrogenase 
Protein accessionYP_001508807 
Protein GI158316299 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTCGTAT ATGCGACCAC GGTAGTACAT GTGACCCCGC GTCCCCGTGG TGCTCGCGCA 
AGGGAGACAG TCGTGATCGA AAAAAGAACG ATCTTCATCG ACGGTGAGTG GGTCAGCTCG
GCGGGCACCG GCACCCTGAC CGTCATCAAC CCGGCCACCG AGGAACCGCT CGCGACGTTA
CCGCGGGGCC ACGTCGACGA CGTCGACCGC GCCGCCCAGG CCGCCGCGCG AGCGTTCGAG
TCCTGGTCGC GCTCGACGGT CGACGACCGG ATCGACATGC TCACCCGCAT CGCCGACATT
CTCGACACGC GGGCCGAGGA GCTGGCGCGC ACGATCGTGA GCGAGGTCGG CACACCGATA
ACGCTCGCCC GCGGCTCGCA GTCCGCCACC GCCATCAACG ACCTGCGCAT CGCCGCGGCG
AGCCTGAAGG ACATCGTGTG GGAGGAGCGG TTCGACGACA CGATCGTGCG GCGCATCCCC
GCGGGTGTGG CCGGGGCGAT CACCCCGTGG AACGGGCCGA TGCGGATGAT CGCGCTCAAG
GCGGGCGCGG CCATCGCCGC GGGCTGCACG ATGGTGCTCA AGGGCACCGA GGTCGCACCG
CTGAGCGCCT TCCTGTTCGC CGAGGCCGCC GCCGAGGCCG GCCTGCCCAG GGGCGTGTTC
AACCTGGTCA GCGGAACGGG CCCGGAGATC GGCGAGGCGC TCGCGACCCA TCCGCTCGTC
GACATCGTCT CGCTCACCGG CTCGGTGCGC GCGGGCAGCC GGGTGATGGA GCTGGCGTCG
CGGTCGGTCA AGCGCGTGGC GCTGGAGCTC GGCGGCAAGT CCGCCAACAT CATTCTTGCG
GACGCCGACC TGGAGAAGGC CGTCGTCGAC GGGCTCGGCG ACGCCTTCCG CAACTCCGGC
CAGGTCTGTG GCGGCCTCTC GCGCATGCTG GTCCCCCGCG GGCGGCTGGC CGAAGCCGAG
GAGATCGCCG CGGCGAAGGC GACGAGCTAC GTGATCGGCG ATCCTCTGGA CGAGGCCACG
ACCCTCGGGC CGGTGGTCTC CGATGCCCAG CGCGACCGCG TGCGCCGCTA CATCCAGACC
GGTGTCGACG AGGGACTGCG GCTGGTCGCC GGCGGCCCGG AGGCGCCGGA GCACCTCGAC
CGGGGCTACT ACGTGCAACC GACCGTCTTC ACCGGTGACA ACAGCAGCAG GCTGGCCCAG
GAGGAGATCT TCGGACCGGT GGTCATCATC ATCCCGTTCG ACGACACCGA CGAGGCCGTC
GCCATCGCCA ACGACTCCGA CTACGGGCTC GCGGGCGCGG TCTGGGCGGC CGACCCCGCG
CGGGCACGGG ACGTCGGCCG ACGCATCCGC ACCGGACGCG TACGGATCAA CGGCGCGCCC
ATCGACATGC GCGCCCCGCA CGGTGGCCTC AAGCGGTCGG GCATCGGCCG GGAGATGGGC
CGGTACGGGA TCGAGGAGTA CCTCGAGTAC CAGTCGCTCA TCAGCTGA
 
Protein sequence
MFVYATTVVH VTPRPRGARA RETVVIEKRT IFIDGEWVSS AGTGTLTVIN PATEEPLATL 
PRGHVDDVDR AAQAAARAFE SWSRSTVDDR IDMLTRIADI LDTRAEELAR TIVSEVGTPI
TLARGSQSAT AINDLRIAAA SLKDIVWEER FDDTIVRRIP AGVAGAITPW NGPMRMIALK
AGAAIAAGCT MVLKGTEVAP LSAFLFAEAA AEAGLPRGVF NLVSGTGPEI GEALATHPLV
DIVSLTGSVR AGSRVMELAS RSVKRVALEL GGKSANIILA DADLEKAVVD GLGDAFRNSG
QVCGGLSRML VPRGRLAEAE EIAAAKATSY VIGDPLDEAT TLGPVVSDAQ RDRVRRYIQT
GVDEGLRLVA GGPEAPEHLD RGYYVQPTVF TGDNSSRLAQ EEIFGPVVII IPFDDTDEAV
AIANDSDYGL AGAVWAADPA RARDVGRRIR TGRVRINGAP IDMRAPHGGL KRSGIGREMG
RYGIEEYLEY QSLIS