Gene Franean1_3895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3895 
Symbol 
ID5672256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4659541 
End bp4660509 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content71% 
IMG OID641242774 
ProductShort-chain alcohol dehydrogenase of unknown specificity-like protein 
Protein accessionYP_001508191 
Protein GI158315683 
COG category[R] General function prediction only 
COG ID[COG4221] Short-chain alcohol dehydrogenase of unknown specificity 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.926603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.100351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCA GACTCGACGG CGCGGTCGCG GTGATCACCG GGGCGGGCAG CGGCATCGGC 
CGGGCCGCCG CGCACTCGCT GACCCGGTTC GGGCGGGTCG ACGTGGTCAT GAACAACGTC
GGCATCCTGG CCGTCGGCGC GGTCGAGGAC ATCCCGCTCG AGGCGTGGCA GCGGGTCATC
GACGTCAACC TGCTCGGCGT CGTGCGCAGC AACCTCGTCT TCCTGCCGCT GCTGCTCGCG
CAGGGCTCGG GGCACGTCGT CCGTGAGCAC GGTGGCACCC GGGCGGTGAT CGTCCGGGAC
TACGCGGGCG GGAGCACCTC GGTGCAGTTC GCCGACCAGC TGATCGTGAG TCTGCGAGCC
GCCGGCGTCG AGATTCCCGC GGAGCAGGTC ACCGAGTACC AGCCGGGACG GGCGAGCGCC
TCCCGGATCG CCGAGTGGAT ACTGAACAAC GGCGTCGACA CCCTCGTCGC CGCGATGGAC
ACCGAGACGC TCGCTCAGCT CGTCGACGCC GCGCACGAGG CCGAGGTACC ACTCAAGGTC
ATCCTGGCCG GCCGCGAGGT CAGCGCGGAG CTGCTGCAGA CCTACGGCGC CCGGCTCGCG
GGGGTCACCT CCTATGCCAA CTACCTGCCG TTCCAGGTCA GCTCCCCGGC TCTCGGCGCC
TACCGGGCGG CGGTGGCCCG GTACGCCCCC CAGCTCGTCG ACCCGGACCA GACCCTCGCC
CTGACCGCCT ACGTCGTCGC GGACATGCTC GTCCGCGGGC TGGAGGAGGC CGGGGAGTGC
CCGAGCAGGC AGTCCTTCAT GGACGGCCTG CGCGCCGTGG AGGACTATGA CGCTGGCGGT
CTCATCACCA GAACCGACTT CGGCGAGGAT TTCGGGCGCC TCCGCGAGTG CTACGCCTTC
GTCCGGGTCA ACGCCGAGGG CACGGGCATC GAGGTGGTGG ACCCCGACTT CTGCGGCAGT
AGGCTCTGA
 
Protein sequence
MTARLDGAVA VITGAGSGIG RAAAHSLTRF GRVDVVMNNV GILAVGAVED IPLEAWQRVI 
DVNLLGVVRS NLVFLPLLLA QGSGHVVREH GGTRAVIVRD YAGGSTSVQF ADQLIVSLRA
AGVEIPAEQV TEYQPGRASA SRIAEWILNN GVDTLVAAMD TETLAQLVDA AHEAEVPLKV
ILAGREVSAE LLQTYGARLA GVTSYANYLP FQVSSPALGA YRAAVARYAP QLVDPDQTLA
LTAYVVADML VRGLEEAGEC PSRQSFMDGL RAVEDYDAGG LITRTDFGED FGRLRECYAF
VRVNAEGTGI EVVDPDFCGS RL