Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3186 |
Symbol | |
ID | 6134073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 3525451 |
End bp | 3526971 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641643374 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001770026 |
Protein GI | 170741371 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00775198 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCCCGA CCGTCCTCTC CATCCGCGAC GACCTCCTCG CCGTGCTGGA GCGGCTCGGC GTCCCGGCCT CCTTCGCCAG GACGGGCCTG CCGGTCCGCT CGCCCATCGA CGGGCTGGTG ATCGGCCACC TGCCCGAGAC CCGGGAGGCC GGGCCGGCCA TCGCGGCGGC CGCGGAGGCC TTCCCGGCCT GGCGCAGGGT GCCGGCGCCC CGCCGCGGCG AGCTGGTGCG GCTCCTCGGC GAGGAGTTGC GGGCCGCCAA GGCCGATCTC GGCCGGCTCG TCACCCTGGA GGCCGGCAAG ATCCTCTCCG AGGGCCTCGG CGAGGTCCAG GAGATGATCG ACATCTGCGA CTTCGCGGTC GGCCTCTCGC GCCAGCTCCA CGGCCTCACC ATCGCGACCG AGCGCCCGGA CCACCGCATG ATGGAGGTCT GGCACCCGCT CGGGCCCTGC GGGGTGATCA CGGCCTTCAA CTTCCCGGTG GCGGTCTGGT CCTGGAACGC GGCGCTCGCC CTGGTCTGCG GCGATCCGGT GGTCTGGAAG CCCTCGGAGA AGACGCCGCT CACGGCCCTG GCCGTACAGG CCATCGCCGG GCGCGCGCTC GCGCGCTTCG GGCCCGAGGC GCCCCCCGGG CTGCTCTCGG TGCTGATCGG CGGGCGGGCC CTGGGCGAGG CGCTGGTGGC CGATCCGCGC ATCCCCCTCG TCTCGGCCAC CGGCTCGACC GCGATGGGGC GTCGGGTGGC GCCGATCCTG GCCGCGCGCT TCGCGCGCGC CATCCTGGAA CTCGGCGGCA ACAACGCCGC GATCATCGCG CCCTCGGCGG ATCTCGACCT CGCGCTGCGG GCCGTCGCCT TCGCGGCGAT GGGCACGGCC GGCCAGCGCT GCACCACGCT GCGGCGCCTC TTCGTGCACG AGAGCGTCGC CGGGAGCTTC CTGCCGCGGC TGCGGCAGGC CTACGCCTCG GTGCGGATCG GCGACCCTCG CGACCCGGCG ACGCTGGTCG GCCCGCTGAT CGACGTCGCG GCGGCCGAGG CCATGGCCCG CGCCCTCGAC GAGGCCCGCG CCCTCGGCGC CGCGATCCAT GGCGGGCAGC GCCTGACCGA CATCCGCGGC GAGGAGGCGG CCTATCTCCG CCCGGCCCTG GTCGAGATGC AGGCGCAGGC CGGCCCGATG CTGCGCGAGA CCTTCGCGCC GATCCTCTAC GTGGTGCCCT ACCGCGACCT CGCCGCGGCG ATCGCGGCCC AGAACGCGGT GGCGGCGGGC CTCTCCTCCT CGATCTTCAC CCGCGACCTC GGGGAGGCCG AGACCTTCCT GTCGGCGGCC GGGTCCGATT GCGGCATCGC CAACGTCAAT ATCGGCCCCT CGGGCGCCGA GATCGGCGGG GCCTTCGGCG GCGAGAAGGA GACGGGCGGC GGGCGCGAGG CCGGCTCGGA TGCCTGGAAG GCCTACATGC GGCGGGCGAC GAACACGATC AATTACGGCC GCGCCCTGCC CCTCGCCCAG GGCGTCAGCT TCGATCTCTG A
|
Protein sequence | MAPTVLSIRD DLLAVLERLG VPASFARTGL PVRSPIDGLV IGHLPETREA GPAIAAAAEA FPAWRRVPAP RRGELVRLLG EELRAAKADL GRLVTLEAGK ILSEGLGEVQ EMIDICDFAV GLSRQLHGLT IATERPDHRM MEVWHPLGPC GVITAFNFPV AVWSWNAALA LVCGDPVVWK PSEKTPLTAL AVQAIAGRAL ARFGPEAPPG LLSVLIGGRA LGEALVADPR IPLVSATGST AMGRRVAPIL AARFARAILE LGGNNAAIIA PSADLDLALR AVAFAAMGTA GQRCTTLRRL FVHESVAGSF LPRLRQAYAS VRIGDPRDPA TLVGPLIDVA AAEAMARALD EARALGAAIH GGQRLTDIRG EEAAYLRPAL VEMQAQAGPM LRETFAPILY VVPYRDLAAA IAAQNAVAAG LSSSIFTRDL GEAETFLSAA GSDCGIANVN IGPSGAEIGG AFGGEKETGG GREAGSDAWK AYMRRATNTI NYGRALPLAQ GVSFDL
|
| |