Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0087 |
Symbol | |
ID | 3905131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 106217 |
End bp | 107347 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637877417 |
Product | dehydrogenases and related proteins-like |
Protein accession | YP_479210 |
Protein GI | 86738810 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.467087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGCACG GACCTGCTCC GGGCTCGGAT GGCGAGCGCT CGGCCTCCGA TCGGTTCTCG GCGACAATCG GGTTGACCGG CGACCCTCGC CACCGGACGG GGAGCACTAT CAATGCTGTG ACGACGCTCC CCGATCTGGT GCCGCCGCGC GATCCCGCCC TTTCTCAGGC CTGGCGGGAG CTGCGGCGCG TCGTGCGTCG GCGGCAGCCG GACCTGCCCC TCGCGATGGC CCTGGTCACC GACGGCTCGG ACAGCACTAC CGCCGAGGTG CTGCGCGATG CCGGGGTCGA CGTCGTCGGT CTCCTCGCCC CGGAGCCGCT GGAGTCCCTG GCCTGGGCCG CCGAGGCGGG TGTCCGACGG GCGTACGCCG ATCTGCTCAC CTTGCTCTCG GACGACATCG AGGCGGTCTG CGTGGAGATG CCGCCGCCCG CCTCCGACAT CGTCGCCCGG CAAGCGGCCG AGATCGGCCT GCACGTGCTC CTGGCCGGGC CGGCGACCGT CGAGGCCGAG GCCCTGCGCG CGGTCGCCGA CCTCGCCGAG GAGGGTGATC TCGCCCACGT GGTCGCGCTC GACGGGCGGG CCTGGCCGGC TGCCACCCAC GTCGCGGCGA CCGTTCCCTC CCTGGGCCGG CTCACCCAGA TGACCGTGGT GGGCGCGCCG AACGGGCCCG CCGGGCGAGT CGAGATCATC GATCTGGCGA TGCGCTGGTG CGGCGAGATC CTCGCGGTCT GCGCCGATCC GGCCGCGATG CCCGCCCCGG CGCTCACCCC GGACGCACCG GTGACGCTGG CCCTGCTGGC GGCGAGCGGC GCGACGGTCC TGATCAACGA GCAGATGGGC GGGAACCTCG GCACAGCCAC CCTCACGGTA TGCGGCGACT CCGGACGCAT CGTCGTCCGG GGCCGTCTGG TCCGCCGCCA GGACGGCAGC GGCATCCGGG ATCTGATGAT GCCGACCGTA CCGACCTCCC GGCCGGGGCT GATGGAGGCC ACCTACGACG TGGTGCGTGC CACCGAACTC GACGACGCCC GGCTGGTCCG CGGCGCCACC TTCCACGACC TGCTCACCGC GAACCACCTG ATGGCCGCCG CGCAGACCTC CCATCAGCAG GGCGGTTGGG TGGAGCTCTG A
|
Protein sequence | MRHGPAPGSD GERSASDRFS ATIGLTGDPR HRTGSTINAV TTLPDLVPPR DPALSQAWRE LRRVVRRRQP DLPLAMALVT DGSDSTTAEV LRDAGVDVVG LLAPEPLESL AWAAEAGVRR AYADLLTLLS DDIEAVCVEM PPPASDIVAR QAAEIGLHVL LAGPATVEAE ALRAVADLAE EGDLAHVVAL DGRAWPAATH VAATVPSLGR LTQMTVVGAP NGPAGRVEII DLAMRWCGEI LAVCADPAAM PAPALTPDAP VTLALLAASG ATVLINEQMG GNLGTATLTV CGDSGRIVVR GRLVRRQDGS GIRDLMMPTV PTSRPGLMEA TYDVVRATEL DDARLVRGAT FHDLLTANHL MAAAQTSHQQ GGWVEL
|
| |