Gene Franean1_3377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3377 
Symbol 
ID5671748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4001735 
End bp4003183 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content71% 
IMG OID641242265 
Productaldehyde dehydrogenase 
Protein accessionYP_001507685 
Protein GI158315177 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGAGC GGGAGCAGAT GTTCGTCGGA GGCGCGTGGG TCGCGCCGAG CACCGACAGC 
AGGATCGAGG TGACCTCGCC GCACACCGAG AAGCTGATCG GTCGCGTCGC CGCGGCGAGC
ACGGAGGACG TCGACCGCGC CGTCGCCGCA GCGCGGCGGG CCTTTGACGA GGGGCCGTGG
CCGCGGACCG ATCCGGCCGA GCGGGTCGAG GTGATCCGGC GGCTGGCCAC GCTCTACCGC
GGGCATCGCG ATGAGCTGGC CGAGCTCATC ACCGCCGAGC TGGGCGCGCC GATCTCGTTC
GCCAGGCGCG CGCAGGTCGC CCTGCCCGGG GCGCTGATGA CCGCGCTGGC CGACACAGCG
GCCGGCTACT CCTGGCAGGA GAAGCGGCCG GGAACCTACG GCCAGGACAT CATCCTGCGC
AAGGAGCCCG TGGGCGTCGT CGCCGCGGTG GTCCCCTGGA ACATGCCGCA GTTCCTGACC
GTCACCAAGG TCATCCCGGC GCTGCTCGCC GGCTGCACCG TCGTGCTCAA GCCGGCGCCC
GAGTCGTCGC TCGACGCGCT GTTCTTCGCC GACCTGCTCG ACCAGGTCGG CCTGCCGCCG
GGCGTCGTCA ACGTCCTCCC CGCCGACCGT GAGGTGAGCG CCTACCTAGT CGCCCACCCC
GGTATCGACA AGGTCTCCTT CACCGGCTCG ACGGCGGCCG GCAGGCAGGT GGCGGCGGCA
TGCGCTCCCA ACCTCACCAA GGTCAGCCTC GAGCTCGGGG GAAAGTCGGC GGCCATCGCT
TTGGACGACG CCGACCCGGC CACCGTGGCG CGCGCCGTCC GCCTCTCGGG CATGGGCATG
GCCGGGCAGA TCTGCAACTC CCTCACCCGG GTACTCGTGC CCACGAGCCG CGTCGGCGAC
TATGCCGATG CGCTCGCGGC AACCCTGTCG GCCATCAGAA TCGGCGATCC GGCCGATCCC
GCGACCGAGA TGGGCCCGCT CGTGGCCAGG CGCCAGCAGG AACGGGTCCG CCAGTACATC
GACACCGGCG TGCGGGAGGG CGCCCGGCTC GTCGTCGGTG GCACCGATCT GCCCGACGGG
ATCGATCGCG GTTGGTACGT GCGACCGACC GTCTTCAGCG ACGTGGACAA CGCGATGACG
ATCGCCCGGG AGGAGATCTT CGGCCCGGTC CTTGTCGTCA TCCCCTACGG CGACGAGGAC
GAGGCCGTTC GCATCGCCAA CGACTCGGAG TACGGCCTGG CCGGCTCCGT GTTCACCGCG
GACACCGGGC ACGGGCTCGA GGTGGCGGGC CGGGTCCGGG CCGGGACTTT CGGCGTGAAC
CAGGGCTACT CCATGGATCC CGCGGCCCCC TTCGGGGGAG TGAAGGCCAG CGGCTACGGC
CGCGAGCTCG GCCGTGAGGG CCTCGAGGGC TACCTCGACG TCAAATCGAT CTCCGTCGCC
ACCGCGTAG
 
Protein sequence
MLEREQMFVG GAWVAPSTDS RIEVTSPHTE KLIGRVAAAS TEDVDRAVAA ARRAFDEGPW 
PRTDPAERVE VIRRLATLYR GHRDELAELI TAELGAPISF ARRAQVALPG ALMTALADTA
AGYSWQEKRP GTYGQDIILR KEPVGVVAAV VPWNMPQFLT VTKVIPALLA GCTVVLKPAP
ESSLDALFFA DLLDQVGLPP GVVNVLPADR EVSAYLVAHP GIDKVSFTGS TAAGRQVAAA
CAPNLTKVSL ELGGKSAAIA LDDADPATVA RAVRLSGMGM AGQICNSLTR VLVPTSRVGD
YADALAATLS AIRIGDPADP ATEMGPLVAR RQQERVRQYI DTGVREGARL VVGGTDLPDG
IDRGWYVRPT VFSDVDNAMT IAREEIFGPV LVVIPYGDED EAVRIANDSE YGLAGSVFTA
DTGHGLEVAG RVRAGTFGVN QGYSMDPAAP FGGVKASGYG RELGREGLEG YLDVKSISVA
TA