Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3895 |
Symbol | |
ID | 5672256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4659541 |
End bp | 4660509 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242774 |
Product | Short-chain alcohol dehydrogenase of unknown specificity-like protein |
Protein accession | YP_001508191 |
Protein GI | 158315683 |
COG category | [R] General function prediction only |
COG ID | [COG4221] Short-chain alcohol dehydrogenase of unknown specificity |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.926603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.100351 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCCA GACTCGACGG CGCGGTCGCG GTGATCACCG GGGCGGGCAG CGGCATCGGC CGGGCCGCCG CGCACTCGCT GACCCGGTTC GGGCGGGTCG ACGTGGTCAT GAACAACGTC GGCATCCTGG CCGTCGGCGC GGTCGAGGAC ATCCCGCTCG AGGCGTGGCA GCGGGTCATC GACGTCAACC TGCTCGGCGT CGTGCGCAGC AACCTCGTCT TCCTGCCGCT GCTGCTCGCG CAGGGCTCGG GGCACGTCGT CCGTGAGCAC GGTGGCACCC GGGCGGTGAT CGTCCGGGAC TACGCGGGCG GGAGCACCTC GGTGCAGTTC GCCGACCAGC TGATCGTGAG TCTGCGAGCC GCCGGCGTCG AGATTCCCGC GGAGCAGGTC ACCGAGTACC AGCCGGGACG GGCGAGCGCC TCCCGGATCG CCGAGTGGAT ACTGAACAAC GGCGTCGACA CCCTCGTCGC CGCGATGGAC ACCGAGACGC TCGCTCAGCT CGTCGACGCC GCGCACGAGG CCGAGGTACC ACTCAAGGTC ATCCTGGCCG GCCGCGAGGT CAGCGCGGAG CTGCTGCAGA CCTACGGCGC CCGGCTCGCG GGGGTCACCT CCTATGCCAA CTACCTGCCG TTCCAGGTCA GCTCCCCGGC TCTCGGCGCC TACCGGGCGG CGGTGGCCCG GTACGCCCCC CAGCTCGTCG ACCCGGACCA GACCCTCGCC CTGACCGCCT ACGTCGTCGC GGACATGCTC GTCCGCGGGC TGGAGGAGGC CGGGGAGTGC CCGAGCAGGC AGTCCTTCAT GGACGGCCTG CGCGCCGTGG AGGACTATGA CGCTGGCGGT CTCATCACCA GAACCGACTT CGGCGAGGAT TTCGGGCGCC TCCGCGAGTG CTACGCCTTC GTCCGGGTCA ACGCCGAGGG CACGGGCATC GAGGTGGTGG ACCCCGACTT CTGCGGCAGT AGGCTCTGA
|
Protein sequence | MTARLDGAVA VITGAGSGIG RAAAHSLTRF GRVDVVMNNV GILAVGAVED IPLEAWQRVI DVNLLGVVRS NLVFLPLLLA QGSGHVVREH GGTRAVIVRD YAGGSTSVQF ADQLIVSLRA AGVEIPAEQV TEYQPGRASA SRIAEWILNN GVDTLVAAMD TETLAQLVDA AHEAEVPLKV ILAGREVSAE LLQTYGARLA GVTSYANYLP FQVSSPALGA YRAAVARYAP QLVDPDQTLA LTAYVVADML VRGLEEAGEC PSRQSFMDGL RAVEDYDAGG LITRTDFGED FGRLRECYAF VRVNAEGTGI EVVDPDFCGS RL
|
| |