Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1984 |
Symbol | |
ID | 3903692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2331352 |
End bp | 2332392 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637879320 |
Product | short chain dehydrogenase |
Protein accession | YP_481087 |
Protein GI | 86740687 |
COG category | [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAGAT CAGCTGACCA GCCGGGCGGC CGGCGCGGAG TGGTGGTGGT CACCGGGGGT TCGGCGGGGC TCGGTCGGGC GATCGTCCGG CAGTTCGCGC GCCGCGGCTA CGACGTGGGT GTCCTGGCCC GTGGCGAGGA TGGTCTGGCC GGTGCGGTAG CGGACGTGGC GGCGGCGGGC CGGCGCGGCC TCGCGCTGCC CGCGGACGTC GCCGACGCCG CGGGCGTGCA GGAGGCCGCC GAGCGGGCGC GCGCGGAGCT GGGGCCGGTC GACGTCTGGG TCAACAACGC GATGGCCAGC GTCTTCGCTC CCTTTCCCCA GATCACCCCG GAGGAGTTCG AACGCGCCAC AGTCACCACC TACCTGGGCC ATGTCAACGG AACCCGTGCG GCGCTGCGGC ACATGATGGC ACGTGACCAC GGGGTGATCA TCCAGGTGGG TTCCGCCCTC GCCTTCCGCG GTATCCCGTT GCAGTCGGCC TACTGCGGCG CCAAGCACGC CATCGTCGGC TTCACCGAGT CGGTGCTGAC CGAACTGCTG CACGACGGCA GCAACGTGCG GGTTGTGATG GTACACATGC CCGCGCTCAA CACCGTCCAG TTCAACTGGG TGTGCTCCCG GCTGCGTCAC CACCCGCGGC CCGTCCCGCC GATCTACCAG CCCGAGGTGG GCGCGCGTGC GGTGGTCCAC GCGGCCGAGC ACCCACGGCG CAGCATGTGG GTCGGCGTCT CCACCGTCGC CACCATCCTT GGCAACCGGA TCGCGCCCGC CCTGCTGGAC CGCTACCTCG CCCGCACCGG CTACGCCAGC CAGCAGGCAC CCGACGACCA CAACCCGATG CTCGGCGAGA ACATCTTCTA CCCGGTGCCC GGCGACCACG GCGCGCACGG CGATTTCGAC GACCGCGCCC ACGCCCACAG CCCGGAACTC TGGCTCAGCC AGCACCGCCG CGGGGTGCTC GCCGCCGTGG TCACCGCGGC CGGCATGGGA GTCGGGGCCG GCATGGGAGT CGGGGCCGAC CGGATGCCCC GGCACCGGTA G
|
Protein sequence | MRRSADQPGG RRGVVVVTGG SAGLGRAIVR QFARRGYDVG VLARGEDGLA GAVADVAAAG RRGLALPADV ADAAGVQEAA ERARAELGPV DVWVNNAMAS VFAPFPQITP EEFERATVTT YLGHVNGTRA ALRHMMARDH GVIIQVGSAL AFRGIPLQSA YCGAKHAIVG FTESVLTELL HDGSNVRVVM VHMPALNTVQ FNWVCSRLRH HPRPVPPIYQ PEVGARAVVH AAEHPRRSMW VGVSTVATIL GNRIAPALLD RYLARTGYAS QQAPDDHNPM LGENIFYPVP GDHGAHGDFD DRAHAHSPEL WLSQHRRGVL AAVVTAAGMG VGAGMGVGAD RMPRHR
|
| |