Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4527 |
Symbol | |
ID | 5672876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5400663 |
End bp | 5401703 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243392 |
Product | alcohol dehydrogenase |
Protein accession | YP_001508808 |
Protein GI | 158316300 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCACCG CAAAGGCCCG GGCCATGAGC GGCCCGACGG CACCGTTCAG CACGATCACC GTCGAGCGCC GCGACGTCGG CCCGCGCGAC GTGCTGATCG ACATCGCCTA CGCCGGGGTC TGCCACACCG ACGTCCACCA CGCCCGCGCG GAGTTCGGGC ACACCCGCTA TCCGATCGTG CCCGGTCACG AGATCGCCGG CATCGTCCGC GAGGTGGGCG CGGAGGTCGC CGGGCTGACC GCCGGCGACC ACGTGGGCGT CGGCTGCCTG GTCGACTCGT GCCGGGACTG CCCCGCCTGC CGCGCGGGGC AGGAGTCCTA CTGCCGCCGC GGCAAGGTGC TGACCTACAA CGGGGTGGGT CGTGACGGGG CGACCACGCT GGGCGGGTAC AGCGAACTCG TGGTCGTCGA CCAGCGGTTC GTCGCCCGCA TCCCCGATGC CCTACCGCTG GACGCCGCGG CCCCCCTGCT CTGCGCGGGC ATCACGATGT ACCAGCCGCT GCAGCGCTGG GGCGCGGGCC CCGGCAGGCG GGTCGGCATC CTGGGGTTCG GCGGGCTCGG GCACATCGGC GTCCAGATCT CCCACGCGCT CGGCGCGCGC ACGACGGTCC TGGAACTCAC CGAGGACCGC CGCGCCGACG CCGAGCGCCT CGGGGCGGAC GACTACCGGA CGACCGGCGA CCTGGGCGCG CTGCGGGACT CGTTCGACCT GATCGTGTCG ACGGTCCCGA CGAACTACGA TCTGTCCTCC CACCTCGACC TGCTCGACCT GGACGGCACG TTCGTCAACC TCGGCGTGCC CGACGAGCCG CTGCGCGTCG ACCCCTACAC GCTGCTGACG AACCGGCGCG TGCTGGCCGG TTCGATGAGC GGCGGCATGC CGCAGACGCA GGAGATGCTC GACTTCTGCG CCGAGAACGG CATCAGGGCC GAGGTGGAGG TCGTCGCGGC GAAGGAGCTC GACCAGGTCT ACGACCGCCT CAGTGCCGGC GACGTCCGGT ACCGGTTCGT GCTCGACGTC GCGACCATCG CCGAGTCCTG A
|
Protein sequence | MITAKARAMS GPTAPFSTIT VERRDVGPRD VLIDIAYAGV CHTDVHHARA EFGHTRYPIV PGHEIAGIVR EVGAEVAGLT AGDHVGVGCL VDSCRDCPAC RAGQESYCRR GKVLTYNGVG RDGATTLGGY SELVVVDQRF VARIPDALPL DAAAPLLCAG ITMYQPLQRW GAGPGRRVGI LGFGGLGHIG VQISHALGAR TTVLELTEDR RADAERLGAD DYRTTGDLGA LRDSFDLIVS TVPTNYDLSS HLDLLDLDGT FVNLGVPDEP LRVDPYTLLT NRRVLAGSMS GGMPQTQEML DFCAENGIRA EVEVVAAKEL DQVYDRLSAG DVRYRFVLDV ATIAES
|
| |