Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2209 |
Symbol | |
ID | 3906348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2583822 |
End bp | 2584937 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637879541 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_481307 |
Protein GI | 86740907 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00644027 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGAGCGG ATGGGGCCGG CCCGGGGGTG CGGACGTTGC GGCTGGTGAA CCAGGATCAC GGGTTTGACT GTCCGGGGTG CGCGTGGCCG GATCCGCCGG TGGGGGAGCG GTCGGTGGCC GAGTTCGCCT TCACCCCGCC GCGCCGCCAC GGGTTCGATG CGGTGGGGAC GATCCGGGCG ATGCGCGACG GCCGGGTGAG GGTGTTCCTC GGCATGGGCG GGAACTTCGT GGCCGCGAGC CCGGACACCG CGGTGACGGA GGCGGCGATG CGGTCGTGCC GGCTGACGGT CCAGGTGTCG ACGACGCTGA ACCGGTCGCA TGTGGTGACG GGCCGGGCGG CGTTGATCCT GCCGGCGCTG GGCCGTACGG AGATCGACGT GCAGGCCGCC GGACCGCAGC AGGTCAGCGT CGAGGACTCG ATGGGGATGG TGCACGCCTC CCGCGGCGGT CTGGCGCCGG CCGGCCCGGG GCTGCGCTCG GAGGTGGCGA TCGTCTGCGG CGTCGCGGCG GCCACCCTGG CCGGCCAGCC GGAGGTGGCC GAATCGGGGA CAGCGGACCG GGTGGGGCTC GCCGGGGACT ACCGGCGGAT CCGCGCCCAC ATCGCCCGGG TCGTCCCCGG GTTCACCGAT TACGAGGCGG GCCTGGCCGA GCTGGGGGGA TTCCCGCTCC CGCACCCGCC GCGGGACAGC CGGACGTTCC CGACGCCGAG CGGGCGGGCC GCGCTGACGG TCAACACCTG TGAGGTGCTG CGGGTCCCGC CGGGACACCT GCTGTTGCAG ACCGTCCGCT CCCACGACCA GTACAACACG ACGATCTACG GCATGGACGA CCGGTACCGC GGGGTGCGCC GCGGCCGGCG CGTCGTGTTC GTCCACCCGG ACGACCTCGA CGACCTCGGT ATCGCCGACG GCACCCACGT CGACCTCGTT GGGGTCTGGA CGGACGGGAT GGACCGGCGC GCGGAGAACT TTCGCGTCGT GGCCTACCCG ACCGCCCGCG GCTGCGCCGC CGCCTACTTC CCGGAGACCA ACGTCCTGGT CCCCCTCGAC AGCACCGCCG CCCGCAGCAA CACCCCCACC TCGAAATCCC TGATCATCCG CCTGGAGGCA GGCTGA
|
Protein sequence | MGADGAGPGV RTLRLVNQDH GFDCPGCAWP DPPVGERSVA EFAFTPPRRH GFDAVGTIRA MRDGRVRVFL GMGGNFVAAS PDTAVTEAAM RSCRLTVQVS TTLNRSHVVT GRAALILPAL GRTEIDVQAA GPQQVSVEDS MGMVHASRGG LAPAGPGLRS EVAIVCGVAA ATLAGQPEVA ESGTADRVGL AGDYRRIRAH IARVVPGFTD YEAGLAELGG FPLPHPPRDS RTFPTPSGRA ALTVNTCEVL RVPPGHLLLQ TVRSHDQYNT TIYGMDDRYR GVRRGRRVVF VHPDDLDDLG IADGTHVDLV GVWTDGMDRR AENFRVVAYP TARGCAAAYF PETNVLVPLD STAARSNTPT SKSLIIRLEA G
|
| |