Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0621 |
Symbol | |
ID | 5669038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 721716 |
End bp | 722495 |
Gene Length | 780 bp |
Protein Length | 259 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239548 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001504986 |
Protein GI | 158312478 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCTCG AAGGCAAGAT CGCCGTGGTG ATGGGCGCCG GCTCAGGGGT CGGCCGCGCC TCGGCACTGC GGTTCGCCGA GGAGGGCGCC CGGGTGCTGT GCGCCGACAT CCGGCCCGAG GGGATCAAGG AGACGGCGGC CGCGATCGAG GCGGCCGGCG GCACCGCGGC GGCAGCCGAG TGCGACGTCT CGAAGGAGGC CGACGTGGCC GCCGCGATCG GGGCGGCCGT GGCGCAGTTC GGCCGGATCG ACATCGTCTT CAACAACGTC GGCGTCCCGA CCCCGCGGCT GGGCGCGAAG CTGGAGGACC ACACCGCCGA GGATTTCCAG CGGCTGGTCG CGGTCAACTT CGGGGGCGTG TTCAACGGCT GCAAGCAGGC CGTCCTGCAG TTCAAGCGGC AGGGCGGCGG CGGGGCGATC CTCAACACCG GGTCGGTCGC CGGCCTCGTC GCCTGGGGCG GCTCCGTCTA CGGCGCCACC AAGGGCGCGG TGCACCAGCT GACCCGCGCG GTGGCCATCG AGGGCGCGCC GTTCGGCATC CGGGCGAACG CGATCTGCCC GGCCGGGATG CCGCACACCA ACTTCATGGC CGCCGGCGGG CTCACCATCT CGGGGGACGC CCGGGAGAAG ATGGTGGAGA ACGTCGGGGC CACCCATCCG CTCGGCCGCC CGATCACGGC CGAGGACTGC GCCGAGGCCG CCGTCTACCT GGTCTCCGAC CGCGCGCTCA ACATCACCGG TGTGCTGCTG CCGGTCGACG GCGGATACGT GGCGAGGTGA
|
Protein sequence | MMLEGKIAVV MGAGSGVGRA SALRFAEEGA RVLCADIRPE GIKETAAAIE AAGGTAAAAE CDVSKEADVA AAIGAAVAQF GRIDIVFNNV GVPTPRLGAK LEDHTAEDFQ RLVAVNFGGV FNGCKQAVLQ FKRQGGGGAI LNTGSVAGLV AWGGSVYGAT KGAVHQLTRA VAIEGAPFGI RANAICPAGM PHTNFMAAGG LTISGDAREK MVENVGATHP LGRPITAEDC AEAAVYLVSD RALNITGVLL PVDGGYVAR
|
| |