Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2032 |
Symbol | |
ID | 5670433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2441642 |
End bp | 2442952 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240953 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001506375 |
Protein GI | 158313867 |
COG category | [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0304773 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.779517 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCCCG GCGCCCGGGA CGCGGGCTGG AGCCTGGCCG ACGTGCCGGA CCAGCGGGGA CGGACCGCCG TCGTCACCGG TGCGAACAGC GGCATCGGCT TCGAGGCGGC GAAGGCGCTC GCCGAGCGCG GCGCGACCGT CGTCCTGGCC TGCCGCGAGC CGGCCCGGGC CGAGCAGGCC GCCGCACGCA TCCGGGCCGC CGCGCCGCGG GCGGACGTCC ACGTCGAGCC GCTCGACCTG ATCTCGATGA CCTCCGTGCG CGCGGCCGCC GAACGGATCC GGGCCGCCCA CCCGCGGCTC GACCTGCTCG TCAACAACGC CGGGGTGATG CTGCCGTCGC CCGGTCCCGG CGGTGCCGCC CTCGGTGCCG GCTTTGGTGA CGACGGCCTC GGTGATCACG GCATCGAGCC GACGTTCGGG GTCAACCACC TGGGGCACTT CGCGCTGACG GGCCTCCTGC TCGACCGCCT TCTGGCGACG CCGCACTCGC GGATCGTGAC GGTCAGCAGC CTCGCCCACC TCGTCGGCCG CCTCGACCTC GAGCAGCTGC GCCGCCTCCC GCCGCACGCC GACCGACCGG GGGGCAGCCC GCCCGTGGTC GTGCCCGACC AGCGTGCCGG GTCGCCGTCC GAACCACCCC TGGCGTCGCC GGCCCGCCAG CGTTCACAGC GGAGGTCAGA AACACAGCGG AGGTCAGAAA CACAGCGGAG GTCAGAAACA CAGCGGGGGT CAGGAACACC GCGCGCGCAG GGCATGCAGC GCCTCCAGAG TCTTCAGGGC ACGCGGGGCC CGCAGAGCCG GCGGCTGCCC CGGCAGCGCC TGCCGTATCC GACATCCAAG CTGGCGAACC TGATGTTCAC CTTCGAGCTG CAGCGCCGGC TCGCGGCCAC CGGAGCGTCG ACGATCGCGG TGGCCGCGCA CCCGGGCATC GCCCGCACGG GGCTCACCCG CAACCTCATG CCCCCGGCGC GGGTGTTCAT GGGCCAGCGA CTGGCACCGC TGATGTCGTG GCTGGTCCAG GAGCCGGAGA TCGGCGCGCT CGCCACCGTC CGGGCCGCCC TCGACCCGGC GGTGCACGCC GGTGAGTACT ACGGACCGTC AGGGCTCTTC CGCAGCACCG GTCCGCCCGC TGTCGGGCGA TCCAGCGCCC GCTCGCGCGA CGCCCGCCAG CAGCAGCGGC TGTGGGCGGA GTCCGAGCGG CTCACCGGCG TCACTTACGA CTTCGATCCC TCCTCGGGGA CAGGGCCCGG CGGTGTCGGA TCCTGGAGGA ACAACGGGAG AAGCGCGTCG CGGAGCGCGG AACTGACCTG A
|
Protein sequence | MRPGARDAGW SLADVPDQRG RTAVVTGANS GIGFEAAKAL AERGATVVLA CREPARAEQA AARIRAAAPR ADVHVEPLDL ISMTSVRAAA ERIRAAHPRL DLLVNNAGVM LPSPGPGGAA LGAGFGDDGL GDHGIEPTFG VNHLGHFALT GLLLDRLLAT PHSRIVTVSS LAHLVGRLDL EQLRRLPPHA DRPGGSPPVV VPDQRAGSPS EPPLASPARQ RSQRRSETQR RSETQRRSET QRGSGTPRAQ GMQRLQSLQG TRGPQSRRLP RQRLPYPTSK LANLMFTFEL QRRLAATGAS TIAVAAHPGI ARTGLTRNLM PPARVFMGQR LAPLMSWLVQ EPEIGALATV RAALDPAVHA GEYYGPSGLF RSTGPPAVGR SSARSRDARQ QQRLWAESER LTGVTYDFDP SSGTGPGGVG SWRNNGRSAS RSAELT
|
| |