Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_6804 |
Symbol | |
ID | 6134381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 7485624 |
End bp | 7487786 |
Gene Length | 2163 bp |
Protein Length | 720 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641646885 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_001773483 |
Protein GI | 170744828 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.981132 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAGA TCGTCGACCC CGATTCCTTG ATCGGCGCCG AGTCCGAGAG CCGGACCGGG ACCTTCTTCG GCGGCTGCCC GCACGACTGC CCCGACACCT GCTCGATGCT GTTCGAGGTG AAGGACGGCG AACTCCTCGG CGTGCGCGGC AACCCCGACC ACCCGATGAC CCGCGGCGGC CTCTGCGTGA AGCTGAAGGA CTACGAGAAG CGCCACTACC ACCCGGACCG TCTTCTCTAC CCGATGCGCC GGGTCGGCCC GAAGGGCTCG AAGCAGTTCG AGCGCATCAC CTGGGACGAG GCGCTGGACA CGATCGTCGC CCGGTGGAGG GCGATCATCG ACCGTTACGG CCCGCAGGCC ATCGCCCCCT ACAGCTACCT CGGCAACCAG GGGCTCGTGC ACGGGCTCAA CGGCGGCGAC GCGTTCTTCA ACCGCCTGGG CGCCACCGTC ACCGAGCGGA CCTTCTGCGG TGAGGGCTCC TGCACCGCCT GGCTGCTCAC GGTCGGCCCG ACCGCGGGCC TCGACCCGGA CAGCTACATC CACTCCAAGT ACATTGTCAT CTGGGCCTGC AACTCGGTCA GCACGAACCT GCACCACTGG GCGATCGTCA AGGACGCCCA GCAGAAGGGT GCCAAGGTCG TCGTGATCGA CGCCTACGCC TCGCGCACCG CCAAGGGGGC CGACTGGCAC ATCGCACCGA AGCCCGGCAC CGACGGCGCG CTGGCGATGG CGCTGATCAA CGCGATCATC GCGCAGGGCC TCGTCGACCA GGATTACGTC GACAACCACA CGATCGGCTT CGAGGATCTC AAGGAGCGCG CGCGCACGCG CACGCCGGAA TGGGCCGCGG AGATCACCGG CGTCCCGGCC GAGGACATCC GCAAGCTCGC CTACGAGATG GCGACCGCCC AGCCGGTGGG CATCCGCATC GGCGTGGCGC TGGAGCGCCA CTACGGCGGC GGCCAGACCA TCCGGGCCGT CGCCTGCATC CCGGCGCTCA CCGGCGCGTG GCGCCACGTC GGCGGCGGCA TCACGCAGTT CGGCGTCTGG GAGCATCCCT ACAAGTTCGA CGTCATCTGC CGTCCCGACC TGATCCCGGA GGGCACCCGC GTCGTCAGCA ACCTGCAGAT CGGTCGGGCG CTCACCGGCG AGTTGAAGCT CGACCCCCCG ATCATGTCGA TGATGTGCTG GAACTCGAAC CCGGTCACCC AGGCGCCCGA GACCGACAAG ATCGTCGAGG GATTGATGCG CGAGGACCTG TTCCTGGTCT CGGCCGAGCA CTTCATCTCG GACACGGCCT CCTACGCCGA CATCCTGCTG CCGGCCGCGA TGGGCGCCGA GATGGAGGAC ATGATCCTCT CCTGGGGTCA CCTCTACCTG ACCTACAACA CCAAGTGCGC GCAAGCGCCC GGCGAGGCCA TTCCCAACAA CGAGATCTTC CGGAGGCTGG CCGCCCGGCT CGGCTTCGAG GAGGAGAACT TCAAGTGGTC GGACGCGGAG TGCCTGGAGC ACTACGTCGA CTGGAACTCT CCGGCCTGCG AGGGCATCGA CCTGCAATAC CTGCGCGAGC ACGGCTTCGC CCGGCTGAAG GTCGGCACGC CCGACGACCG GGCGCCGCAC CGCGAGGGCA ACTTCCCGAC GCCGACCGGC AAGTGCATGT TCAAGGTCGA GGGCGCCACG AACTTCGTCG CCCCGCCGTT CCGGCAAATG TACGAGGGGT TCCAGCCCGG CGAGGCGCTC GACCCGCTGC CCGACTATCT CGGCCCGCGC GAGTCCCCCG CGAGCGACCC CGCACTCGCC GCGCGCTACC CGCTCAACAT CGTCTCGCCC AAGAGCCACT ACTTCCTGAA CTCCTGCTAC GCGAACATGG AGGACAAGCA GAAGGGGCAG GGGGAGCAGT TCGTGATGAT CAGCCCCCGG GATGCCGAGG CGCGCGGCAT CGTGGATGGC GACCGCGTGC GGGTCGCCAA CGGCCGCGGC GGCTTCAAGG GCGTGGCGCG CGTCACCGAC GACGTGAAGT CCGGGATCGT GGTGGCCACG CTCGGCTACT GGCGCCAGCT CAACGAGGGC ACGGTGAACA GCATCTCGTC GTCGGCCTTC ACCGACATGG GGCATGCGCC CTCGTTCTCC GACAACCTCG TCGAGGTCTC GCGCGTCAAT TGA
|
Protein sequence | MNEIVDPDSL IGAESESRTG TFFGGCPHDC PDTCSMLFEV KDGELLGVRG NPDHPMTRGG LCVKLKDYEK RHYHPDRLLY PMRRVGPKGS KQFERITWDE ALDTIVARWR AIIDRYGPQA IAPYSYLGNQ GLVHGLNGGD AFFNRLGATV TERTFCGEGS CTAWLLTVGP TAGLDPDSYI HSKYIVIWAC NSVSTNLHHW AIVKDAQQKG AKVVVIDAYA SRTAKGADWH IAPKPGTDGA LAMALINAII AQGLVDQDYV DNHTIGFEDL KERARTRTPE WAAEITGVPA EDIRKLAYEM ATAQPVGIRI GVALERHYGG GQTIRAVACI PALTGAWRHV GGGITQFGVW EHPYKFDVIC RPDLIPEGTR VVSNLQIGRA LTGELKLDPP IMSMMCWNSN PVTQAPETDK IVEGLMREDL FLVSAEHFIS DTASYADILL PAAMGAEMED MILSWGHLYL TYNTKCAQAP GEAIPNNEIF RRLAARLGFE EENFKWSDAE CLEHYVDWNS PACEGIDLQY LREHGFARLK VGTPDDRAPH REGNFPTPTG KCMFKVEGAT NFVAPPFRQM YEGFQPGEAL DPLPDYLGPR ESPASDPALA ARYPLNIVSP KSHYFLNSCY ANMEDKQKGQ GEQFVMISPR DAEARGIVDG DRVRVANGRG GFKGVARVTD DVKSGIVVAT LGYWRQLNEG TVNSISSSAF TDMGHAPSFS DNLVEVSRVN
|
| |