Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0246 |
Symbol | |
ID | 5668671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 300704 |
End bp | 301759 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239176 |
Product | short chain dehydrogenase |
Protein accession | YP_001504619 |
Protein GI | 158312111 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.153805 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.728482 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCGC AGGCGACACC ACCCGCGACG GCCCGCACGC CGGCCGCGGC GGCCCGCACG CCCTGGGCTC CCGACAGGAG CGCCCTTCGC GGGCGCGTGG CCGTCGTCGC CGGCGCCACC CGCGGCGCGG GTCGCGGGAT CGCGGCGGCG CTCGGTGAGG CCGGCGCCAC CGTCATCTGC ACCGGCCGCA GCAGCAGGAC GGGCGTCCTG CGCTCCGACT ACGACCGCGC CGAGACGATC GAGGAGACCG CGGAGCTCGT CACCAAGCTC GGTGGTGCCG GCATCGCCGT CCCCGTCGAC CATCTGGACC CGGAGCAGGT GCGACGACTG GCCGACCGCG TCCGCGCCGA GCACGGGCAC CTCGACGTGC TCGTCAACGA CATCTGGGGC GGCGAGGTCC TCAAGGGCGG GCCGAGCGAG TGGGACACGC CGGTCTGGGA GCACGACCTC GACCGCGGGA TGCGCATCCT GCGGCTCGCC GTGGACACCC ACCTGATCAC CTCCCACCAC CTGCTCCCGC TGCTGATCGA CCGCCCAGGC GGGCTGGTCG TCGAGGTGAC CGACGGGACG ACGGACTACA ACGCGGCCAA CTACCGGATC TCCGTGTACT ACGACCTCGC CAAGGTCGCC GTGAACCGGC TCGCCTTCTC ACAGGGCCAC GAGATCGCCT CGCACGGCGG GACCGCCGTC GCCGTCACGC CCGGCTGGCT GCGCTCGGAG ATGATGCTCG AGGCGTTCGG CGTCACCGAG GAGACCTGGC GCGACGCCCT CACCCCGCCC GACAGCGCCT CGCCCAGCGG TGCTCCGCCC AGCGGTGCTC CGCCGGACTT CGCGTTCTCC GAGTCGCCGC GTTATGTCGG GCGCGCCGTC GCCGCGCTGG CCGCCGATCC GGGACGGGCC CGCTGGAACC AGCGCTCGGT GACCTCCGGC CGGCTCGCGG CCGAGTACGG CTTCACCGAC GTCGACGGGT CACAACCGGA CATCTGGCCG CGCCTGGAAC GTCCGGCGGA GGCCCCGGCG CCGGCGGCCG GTCAGGAGCC GCACCCGCGC CGGTAG
|
Protein sequence | MSAQATPPAT ARTPAAAART PWAPDRSALR GRVAVVAGAT RGAGRGIAAA LGEAGATVIC TGRSSRTGVL RSDYDRAETI EETAELVTKL GGAGIAVPVD HLDPEQVRRL ADRVRAEHGH LDVLVNDIWG GEVLKGGPSE WDTPVWEHDL DRGMRILRLA VDTHLITSHH LLPLLIDRPG GLVVEVTDGT TDYNAANYRI SVYYDLAKVA VNRLAFSQGH EIASHGGTAV AVTPGWLRSE MMLEAFGVTE ETWRDALTPP DSASPSGAPP SGAPPDFAFS ESPRYVGRAV AALAADPGRA RWNQRSVTSG RLAAEYGFTD VDGSQPDIWP RLERPAEAPA PAAGQEPHPR R
|
| |