Gene Franean1_2968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2968 
Symbol 
ID5675705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3494230 
End bp3495678 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content71% 
IMG OID641241872 
Productaldehyde dehydrogenase 
Protein accessionYP_001507292 
Protein GI158314784 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGACC GGGAGCAGCT GTTCGTCGGA GGCACCTGGG TCGCGCCGAG CGCCGACCGC 
TTCATCGAGG TGATCTCCCC GCATACGGAA GAGCCGGTCG GTCGCGTGGC AGCGGCGGGG
ACCGCGGATG TCGACCGCGC TGTCGCCGCG GCGCGGCAGG CTTTCGACGA GGGAGCGTGG
CCCCACACCG ACCCGGCCGA GCGCGTCGAG GCGATCCGCC GGCTGGTCAC GCTCTACGGC
AAGCATCGCG ACGAGCTCGC CGAGCTGATC ACCACCGAGC TGGGTGCGCC GATCTCCTTC
GCCAGACGCG CGCAGGTCGC CCTCCCCGGC GCGATGATGA GCGCACTCAC CGACATCGCG
GCCGACTACC GCTGGCGGGA GAACCGGCCG GGAACCTACG GCCAGGACAT CATCCTGCGC
AAGGAGCCCG TGGGCGTCGT AGCCGCCGTC GTCCCCTGGA ACATGCCGCA GTTCCTGACC
GTCACCAAAG TCATCCCGGC CCTCCTCGCC GGCTGCACCG TCGTGCTCAA GCCGGCGCCC
GAGTCGTCGC TCGACGCCCT GTTCTTCGCC GACCTGCTCG ACCAGACCGG CCTGCCACCC
GGCGTCGTCA ACGTCATCCC CGCCGACCGT GAGGTGAGCG CCCACCTCGT CGCCCACCCC
GGCATCGACA AGGTCTCCTT CACCGGCTCG ACGGCGGTCG GCCGGCAGGT GGCGGCGGCA
TCCGCTCCCC ACCTCACCAA GGTCAGCCTG GAGCTCGGGG GAAAGTCGGC GGCCATCGCG
CTGGACGACG CCGACCCGGC CACCGTGGCG CGTGCCGTCC GCCTCTCGGG CATGGGCATG
GCCGGGCAGA TCTGTAACTC CCTCACCCGT GTGCTCGCGC CCGCACGCCG CATCGGCGAC
TACGCCGAGG CACTCGCGGC AACCCTCTCG GCCATCAGAA TCGGCGATCC GGCCGATCCC
GGGACCGAGA TGGGCCCGCT CGTGGCCAGG CGCCAGCAGG AACGGGTGCG CGAGTACATC
GACACCGGCG TGCGCGAGGG CGCCCGACTC GTCCTGGGCG GCACCGACCT ACCCGCGGGC
ATCGAACGCG GCTGGTACGT GCGACCCACC GTCTTCAGCA ATGTCGACAA CTCGATGACG
ATCGCCCGCG AGGAGATCTT CGGTCCGGTC CTCGCCGTCA TCCCCTACCA CGACGAGGCG
GACGCCATCC GCATCGCCAA CGACTCCGAC TACGGCCTGG CCGGCTCCGT GTTCACCGCC
GACACCGAAC ACGGCCTCGA CATCGCCAGC CGGGTCCGAG CCGGCACCTT CGGCGTCAAC
CAGGGCTACT CCATGGACCC CGCCGCCCCC TTCGGAGGAC TGAAAGCCAG TGGTTACGGC
CGTGAACTCG GCCGTGAAGG GCTCGAGGGC TACCTCGACA TCAAATCGAT CTCCGTCGCG
GCCCCCTGA
 
Protein sequence
MQDREQLFVG GTWVAPSADR FIEVISPHTE EPVGRVAAAG TADVDRAVAA ARQAFDEGAW 
PHTDPAERVE AIRRLVTLYG KHRDELAELI TTELGAPISF ARRAQVALPG AMMSALTDIA
ADYRWRENRP GTYGQDIILR KEPVGVVAAV VPWNMPQFLT VTKVIPALLA GCTVVLKPAP
ESSLDALFFA DLLDQTGLPP GVVNVIPADR EVSAHLVAHP GIDKVSFTGS TAVGRQVAAA
SAPHLTKVSL ELGGKSAAIA LDDADPATVA RAVRLSGMGM AGQICNSLTR VLAPARRIGD
YAEALAATLS AIRIGDPADP GTEMGPLVAR RQQERVREYI DTGVREGARL VLGGTDLPAG
IERGWYVRPT VFSNVDNSMT IAREEIFGPV LAVIPYHDEA DAIRIANDSD YGLAGSVFTA
DTEHGLDIAS RVRAGTFGVN QGYSMDPAAP FGGLKASGYG RELGREGLEG YLDIKSISVA
AP