Gene Franean1_4593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4593 
Symbol 
ID5672938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5474610 
End bp5476073 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content71% 
IMG OID641243454 
Productaldehyde dehydrogenase 
Protein accessionYP_001508870 
Protein GI158316362 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCGC TGACCTACCA GCTGTACATC GACGGCGCGT GGCGGGACAG CGACGGCGAC 
GGCGTCCTCG AGGTCCTCAA CCCGGCGACC GAAGAGGTCA TCGGCGCCGT CCCCGACGGA
ACCGTCAGCG ACGTCGACCG GGCCGTCGCC GCCGCCCGGC GGGCGTTCGA CGAGGGCCCG
TGGCCGACGC TCAGCGCCAA CGAGCGCGCC ACCGCGCTGC TGCGCATGGC CGACGTGATG
GAGCGGCGCG TCGACGAACT CAAGGAACTC AGCGTCCGGG AGGCCGGCTC GACGCGGGCC
CTGGCCGACA CGCTGCAGGT CAGCGTCCCA CTTCACCATT TCAGGGACAT GGCTGAGCGG
GTGCTGCGGC AGTTCCCGTT CGAGCGGGCG ATGCTGCCGA CGGTCGGGCC GACGCTCGCC
CAGGGGGTCG TCCGCCGCGA GCCCTACGGC GTCGCCGCGC TCATCTCGGC CTACAACTTC
CCGCTCTTCC TCAACATCCT CAAGCTGGCC CCGGCACTGG CCGCGGGGTG CACGGTCGTC
CTCAAGCCGG CGCCGACCAC CCCGCTCGAG GCGTTCGTCC TGGGCGAGAT GGCCGACGAG
GCCGGCCTGC CGCCCGGCGT GCTCAACATC GTGAGCGGCG GCATAGCGGC CGGCGAGGCG
CTGACCACCC ATCCCGGGGT CGACATCGTC AGCTTCACCG GCTCCGACAC CGTGGGCCGG
CTGGTCTACA CCCAGGCGGC GCAGTCGCTG AAGAAGGTCG TGCTCGAGCT CGGCGGCAAG
TCCGCCAACA TCATCACCGC CGACGTCGAC CTCGACCTCG TCGTCCCGAC GATCGTCAAC
GGCATGACCA CCCACGCCGG CCAGGGCTGC AGCCTGCTGA CCCGGACGCT GGTGCACCGC
TCACGTCTCG ACGAGCTCGT CGGCCTGGTC AAGCAGAGCC TTGATCACAT CACGGTCGGC
GACCCGGCCG ACCCCGCCAC GACCATGGGA CCGCTGATCA GCGCGGCCCA GCGGGCGAAG
GTCGAGAGCC TCATCTCCGC CGGCCGCGCC GAGGGCGCCC AGGTCGCCTA CGGCGGGGGC
CGGCCCGCCC ATCTCGACAA GGGGTTCTTC GTCGAGCCGA CGCTGTTCGT CGATGTCGAC
AACTCGATGA CGGTCGCCCG CAAGGAGTTC TTCGGCCCGG TCGGCGTCGT CATCGCCTTC
GACGACGACG ACGAGGCGGT CCGGCTCGCC AACGACAGCG AGTTCGGGCT CGGCGGCGGG
GTCTGGGCGC AGTCCCCGGT ACGCGCCTAC GAGATCGCCA AGCGGCTGCG CACCGGAATG
ATCTACATCA ACGGCGGCGG CGCGGGCTCC AGCCCGCACA CCGCGTTCGG CGGCTACAAG
CAGAGCGGGC TCGGCCTCGA GCGCGGCGAG TTCGGCCTCG AGGAGTTCCT GCTGTCCAAG
AGCATCATCT GGAGCGCCCG CTGA
 
Protein sequence
MSALTYQLYI DGAWRDSDGD GVLEVLNPAT EEVIGAVPDG TVSDVDRAVA AARRAFDEGP 
WPTLSANERA TALLRMADVM ERRVDELKEL SVREAGSTRA LADTLQVSVP LHHFRDMAER
VLRQFPFERA MLPTVGPTLA QGVVRREPYG VAALISAYNF PLFLNILKLA PALAAGCTVV
LKPAPTTPLE AFVLGEMADE AGLPPGVLNI VSGGIAAGEA LTTHPGVDIV SFTGSDTVGR
LVYTQAAQSL KKVVLELGGK SANIITADVD LDLVVPTIVN GMTTHAGQGC SLLTRTLVHR
SRLDELVGLV KQSLDHITVG DPADPATTMG PLISAAQRAK VESLISAGRA EGAQVAYGGG
RPAHLDKGFF VEPTLFVDVD NSMTVARKEF FGPVGVVIAF DDDDEAVRLA NDSEFGLGGG
VWAQSPVRAY EIAKRLRTGM IYINGGGAGS SPHTAFGGYK QSGLGLERGE FGLEEFLLSK
SIIWSAR