Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3267 |
Symbol | |
ID | 6132130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 3619865 |
End bp | 3621583 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641643454 |
Product | respiratory-chain NADH dehydrogenase domain-containing protein |
Protein accession | YP_001770106 |
Protein GI | 170741451 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00072788 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.294343 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGC ACGATGTGTC GCGCGTCAAG CGCTTCGAAC ACCCCGGACG CGGTCGCGAG CGGGCGAGGC CGGTGCCGAA GGGGCGCCAG GTCGATCCCC GGGCGAAGGC CGAGATCGCC GCGCTCCTCG GCGAGGCGCC GCGCCGCCGC GACCTCCTCA TCGAGCACCT GCACCTCGTC CAGGACACCT ACGGGCAGAT CAGCGCCCCC CACCTCGCGG CGCTCGCCGA CGAGATGGGG CTCGCCTTCG CGGAGGTGTT CGAGACCGCG ACCTTCTACG CCCATTTCGA CGTGGTGAAG GAGGGCGAGG CGGCGATCCC GGCGCTGACC GTGCGGGTCT GCGACAGCCT CACCTGCGCG ATGCACGGCG CCGAGGAGCT CCTCGCCGCG CTCCAGGCGG AGATCGGCGC GCAGGTGCGC GTGGTGCGCG CGCCCTGCGT CGGCCTGTGC GACCACGCCC CGGCCGCCGA GGTCGGGCAC AATTTCCTGC ACCGGGCCAC GGTCGAGACT GTGCGGGCGG CGGTCGCGGC CGGCGACACG CACGCGCATC TTCCCGACAC GATCGATTTC GACGCCTACC GGGACGCGGG CGGCTACCGG ACCCTGGAGC GCCTGCGCGC CGGCGATCTC TCGGTCGAGG ACGTGCTCAA GGTCCTCGAC GACGGCTCGC TGCGGGGCCT CGGCGGGGCG GGCTTCCCGA CCGGTCGCAA GTGGCGCTCG GTGCGCGGCG AGCCCGGCCC GCGCCTGATG GCGGTGAACG GCGACGAGGG CGAGCCCGGC ACCTTCAAGG ACCAGCTCTA CCTCAACACC GACCCGCACC GCTTCCTCGA AGGCACCCTG ATCGGCGCCC ACGTGGTCGA GGCCGAGGAC GTCTACCTCT ACATCCGCGA CGAGTACCCG ATCGCCCGCC AGATCCTGGC GCGGGAGATC GCCAAGCTGC CGCCGGGCGG GCCGCGCCTG CACCTGCGGC GCGGGGCCGG AGCCTATATC TGCGGCGAGG AATCCTCGCT GATCGAGTCG ATCGAGGGCA AGCGCGGCCT GCCGCGCCAC AAGCCCCCGT ACCCGTTCCA GGTCGGCCTG TTCGGGCGCC CGACGCTGAT CAACAACGTC GAGACGCTGT ACTGGATTCG CGACCTGATC GAGCGCGGCC CGGATTGGTG GAAGAGCCAC GGCCGCAACG GCCGCACGGG CCTGCGCTCC TACTCGGTGT CGGGCCGGGT GAGGGAGCCG GGCGTGAAGC TCGCGCCGGC CGGGGTCACC ATCCGCGAGT TGATCGAGGA ATTCTGCGGC GGCATGGCCG AGGGCCACCG CTTCGCGGCC TACCTGCCGG GCGGCGCCTC GGGCGGCATC CTGCCGGCCG CGATGGACGA CATCCCGCTC GATTTCGGGA CCCTCGAGAA GTACGGCTGC TTCATCGGCT CGGCCGCCGT GGTGGTGCTC TCCGACAAGG ACGACATCCG CGGTGCGGCC CTCAACCTGA TGCGGTTCTT CGAGGACGAG TCCTGCGGCC AATGCACGCC CTGCCGGGTC GGCACCCAGA AGGCCCGCAT GCTGATGGAG AGCGGGGTCT GGGACACCGA CCTCCTCGGC GAGTTGTCGC AATGCATGCG CGACGCCTCG ATCTGCGGCC TCGGCCAAGC GGCCTCGAAT CCGCTCACCA GCGTCATCAA GTATTTCCCG GACCTCTTCC CGACCCCGCG GCCCATGGCG GCCGAGTAG
|
Protein sequence | MSQHDVSRVK RFEHPGRGRE RARPVPKGRQ VDPRAKAEIA ALLGEAPRRR DLLIEHLHLV QDTYGQISAP HLAALADEMG LAFAEVFETA TFYAHFDVVK EGEAAIPALT VRVCDSLTCA MHGAEELLAA LQAEIGAQVR VVRAPCVGLC DHAPAAEVGH NFLHRATVET VRAAVAAGDT HAHLPDTIDF DAYRDAGGYR TLERLRAGDL SVEDVLKVLD DGSLRGLGGA GFPTGRKWRS VRGEPGPRLM AVNGDEGEPG TFKDQLYLNT DPHRFLEGTL IGAHVVEAED VYLYIRDEYP IARQILAREI AKLPPGGPRL HLRRGAGAYI CGEESSLIES IEGKRGLPRH KPPYPFQVGL FGRPTLINNV ETLYWIRDLI ERGPDWWKSH GRNGRTGLRS YSVSGRVREP GVKLAPAGVT IRELIEEFCG GMAEGHRFAA YLPGGASGGI LPAAMDDIPL DFGTLEKYGC FIGSAAVVVL SDKDDIRGAA LNLMRFFEDE SCGQCTPCRV GTQKARMLME SGVWDTDLLG ELSQCMRDAS ICGLGQAASN PLTSVIKYFP DLFPTPRPMA AE
|
| |