Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0588 |
Symbol | |
ID | 5669005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 680387 |
End bp | 682402 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641239515 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_001504953 |
Protein GI | 158312445 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAGA TATCGGTTAC CGACACGAGG ACGGTCCGCA GCTTCTGCCG GATCTGCACG TCCGTGTGCG GCATCCTCGT CGAGACGGCT GGTGACAAGG TAGTTCGGGT ACGGGGCGAC CGTGACCACC CACTGTCGCG GGGATACACC TGTCCGAAGG GCCGGTCACT CCCGCAGATG CACCATCATC CGGATCGCAT CGAGCGTCCG CTGATGAAGG TCGACGGGGA GCTGCGGCCG ACGACGTGGG AGGAGTGCCT GGACGATCTC GGCGCCCGGC TGCGAAACAT CATCGAGCGG TACGGGCCCG AGTCGGTCGG TGTCTTTTTC GGGAGCGGCA TCGGCATGGA CGCCGCCGGT TACCGGATGG CGCAGGCCCT GCACGCCGCG ATCGGCACGC CGGCGAAGTT CAGTCCCATG ACCATCGACG GAACGGCCAA GGTGCTGACC GCGGATCTGG TGGGCGGTTC ACCGGCTCTC AGCGGCCGGC CCGACTACGA CAACGCCTCG TTCGTCCTCT TCGTCGGCAG CAACCCGGTG GTGTCCCACG GGCACACCGT CGCGATGCCG AACCCCACGG GCACCTTACG GGCGCTGCGG GAGCGGGCGG AGGTGTGGGT CATCGACCCC CGTCACACCG AGACCGCCCG CCTGGCCGGC CACCATCTCG CGCCGCGTCC CGGCACCGAC TACGCGGTCC TCGCCTACCT TGTCCGTGAG ATCCTCCGCG ACGGCGCCGA CCGCGAGATG CTCTCCCGTC ACACCCAGGG TGGCGAGATC CTGGCTGCCG CCGTTGAGCC GTTCACTCTG GAGCACGCCG CCCGAATCGC CGATGTCTCC GCCGACGAGC TGGCCGCGCT CCTCGCCGGC GTGCGACGAG CGGGGCGCGT CGCGATCGAA ACCGGAACCG GCGTCACCAT GGCGTCCAGC GCGAACGTCA CGCAGTGGCT CGCCTGGTCA CTAATGATTA TCACTGGGTC GATGAACCAG CCCGGCGGCG CATGGTTCCA CCCCGGCTTC AAAAACCAGC TGGAGGCCTT CAAGCTGCCG ATCTCGCCGC CCGAAGGCTC GTTCGGGCCG GGCCCGCGCA GCCGTCCGGA GACACAGTCC TTTCTCGGCG AGTGGCCCTG TGCCGTTCTG GCCGACGAGA TCCGCGCGGG CAACATCCGG GCGGTCCTCA ATCTCGGCGG CCATCTCGTC GCGGCCTTCC CCGACACCGA GACGCTGGTT CCCGCGCTGC GGGACCTGGA GCTGTTCGCC ACCATCGAGA TCATCGGCAA CGAGACGACG GCCCTGTCCA CCCACGTCCT GCCGACCAAG GACCAGCTGG AGCGGGCCGA CGTGAGCCTG TGGGACTTCC TGATACAGCG CGTCGCCGTC CAGCACACCC CTGCCGTCGT CGAACCGGTC GGGGACCGGC GTTCCGTGTG GTGGGTGCTC GCGGAACTCG GACAGCGCCT CGGTTACCAG CTCGCCGACA GCAGATCCGG GCAGGTCACC GACGACACCC TGCTCGCCGA GATCACCGCC CACGCCCGGC GCCCGTTCGG TGAGGTCGTC TCCGAAGGCT GGGTCGAGGT ACCCCGCGAG ATTCCCGCGC CGTGGGTGGA CGGGCACGTC GAGCGGATGG GGGGATGGCG CCTCGCTCCC CGGCTGCTCG TCGACCAGCT GGCCGCGCTC CAGCCTCCCG CCCCGCTCGT CCTCACACCA CGACGCCAGA AGCGCCATCT GAACTCCCAG TTCGACTACC TCGGAGAACA GCCCGAGATC ATCCTGCATC CCGACGACGC GGCAGCGGCC GGCGTGGTCG ACAGTCAGCC GGTGACCGTC CGCTCGACCA GCGGCGAGAT CACCGGGATC GCGAAGGTCG ACGGCACCAT CCGCCGTGGA GCGGTCTCGA TACCCCACGG CCACCAGTCG GCGAACGTCA ACCGGCTGAC GGACAAGAGC CAGGTCGACA TCGTCACCGG CATGGTCCGC TACTGCGGCA TCCCGGTGAG CGTCCACCCG GCATAG
|
Protein sequence | MTEISVTDTR TVRSFCRICT SVCGILVETA GDKVVRVRGD RDHPLSRGYT CPKGRSLPQM HHHPDRIERP LMKVDGELRP TTWEECLDDL GARLRNIIER YGPESVGVFF GSGIGMDAAG YRMAQALHAA IGTPAKFSPM TIDGTAKVLT ADLVGGSPAL SGRPDYDNAS FVLFVGSNPV VSHGHTVAMP NPTGTLRALR ERAEVWVIDP RHTETARLAG HHLAPRPGTD YAVLAYLVRE ILRDGADREM LSRHTQGGEI LAAAVEPFTL EHAARIADVS ADELAALLAG VRRAGRVAIE TGTGVTMASS ANVTQWLAWS LMIITGSMNQ PGGAWFHPGF KNQLEAFKLP ISPPEGSFGP GPRSRPETQS FLGEWPCAVL ADEIRAGNIR AVLNLGGHLV AAFPDTETLV PALRDLELFA TIEIIGNETT ALSTHVLPTK DQLERADVSL WDFLIQRVAV QHTPAVVEPV GDRRSVWWVL AELGQRLGYQ LADSRSGQVT DDTLLAEITA HARRPFGEVV SEGWVEVPRE IPAPWVDGHV ERMGGWRLAP RLLVDQLAAL QPPAPLVLTP RRQKRHLNSQ FDYLGEQPEI ILHPDDAAAA GVVDSQPVTV RSTSGEITGI AKVDGTIRRG AVSIPHGHQS ANVNRLTDKS QVDIVTGMVR YCGIPVSVHP A
|
| |