Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4138 |
Symbol | |
ID | 3907103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4939458 |
End bp | 4940369 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637881466 |
Product | short chain dehydrogenase |
Protein accession | YP_483215 |
Protein GI | 86742815 |
COG category | [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.452621 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGACA CGGAATCTGG TCACAGCCCT CAGCCGGCTT CGGCGCTCGG CCGGGGCGAT GCCGCACCAC CGGGCCCCGT CCGCCGCACC TGGCTGGTCA CCGGGGCTTC GCGGGGACTG GGCCGCGCCA CCAGCGAGGT TCTGCTGGCC GCGGGCGACC GGGTCGTCGC GACGGCCCGC GACCGTGACA CGCTCAAGGA GCTGGCTGAA CGGTATCCCG ACCAGCTTCT CCGGTTCGCC CTGGACGTGA CCGACCGGTC AATGGTGTTC GACGTCGTGG ACCGAGCGGT CGCCGCTGCG GGAAGCATCG ACGTACTGCT GAACAACGCT GGCTACGGGC TTTCGGGTGC AGTCGAGGAA GTCACCGAGA AGCAGGCCCG CGCACAGCTG GACGTCAACT TCTTCGGTGC ATTGTGGTGC ACTCAGGCGG TGCTGCCTGT TATGCGCAGG CAGGGTGGCG GCCACATTCT TCAGATGTCC AGCATCGCCG GCGTGGCGAC CTATCCTAAC ATCGGCATGT ACCACGCCAG CAAGTGGGCG CTCGAAGGCA TGAGCGAGAC TCTGGCACAC GAGGTCGCCG ATTTCGGCAT CCGTGTGACG ATCGTCGAGC CGGGCGAGTT CCGGACCGAC TGGAGCGCCT CCAGCATGGA GCGGGCGACG CCGATGACCG AGTACGACGA CGTGCTCGCC CGGCGTCGGC ACGGCATGTC CGGGGTGTAC GCGCACGTCC AGCCTGGTGA TCCCCACAGG CTGGGCAAGG CGCTGCGGAC CGTCGTCGAC GCCGCGCAGC CACCGCTGCG GATCCTGCTC GGCAACAGCG CGGCCGACCT CGCGCCGCAG GTGTACCGCG AGCGGCTCGC GGAATGGGAG CGGTGGGATG CGCTGGCGCG GACGACGGAT TTCGCCAGCT GA
|
Protein sequence | MSDTESGHSP QPASALGRGD AAPPGPVRRT WLVTGASRGL GRATSEVLLA AGDRVVATAR DRDTLKELAE RYPDQLLRFA LDVTDRSMVF DVVDRAVAAA GSIDVLLNNA GYGLSGAVEE VTEKQARAQL DVNFFGALWC TQAVLPVMRR QGGGHILQMS SIAGVATYPN IGMYHASKWA LEGMSETLAH EVADFGIRVT IVEPGEFRTD WSASSMERAT PMTEYDDVLA RRRHGMSGVY AHVQPGDPHR LGKALRTVVD AAQPPLRILL GNSAADLAPQ VYRERLAEWE RWDALARTTD FAS
|
| |