Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mfla_0535 |
Symbol | |
ID | 3999405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacillus flagellatus KT |
Kingdom | Bacteria |
Replicon accession | NC_007947 |
Strand | + |
Start bp | 560102 |
End bp | 562030 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637937433 |
Product | NHL repeat-containing protein |
Protein accession | YP_544646 |
Protein GI | 91774890 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases [COG3386] Gluconolactonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000386483 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAA TAAGTAGATT GATTGGCTTG CTGGCTGCGG TGGGTTGGAT GACTGCAGCC AACGCTGATC TCGCAGACGT TGAGGCTAAC CTGGGCCGGC TCAAGGTGCC GGAGGGGTTC AAGGTGGAGG TCTATGCCGA AGTGCCGGGC GCACGGCAAA TGGCACTGGG AACTTCCGGT ACAGTCTACG TCGGCACTCG CGGCAACAAG GTCTATGCTG TCGTGGACAA GAACAAGGAT CACAAGGCAG ACGAGGTGAT CACTATCCTT GATGACCTCA AGGTAGGCAA TGGTGTCGCC ATGTGGGAGG GGAATCTCTA CGTGGCCGAG CAGCACCGCA TCACCCGTTA CGCCGCCCCC GATTTTGACC TCAACCTGCC GTTCAAGCAG ATGCGCGAGG TGATTTACGA CCAACTGCCC GACAAGGTAC ATCATGGCTG GCGCTATATC GCCTTCGGCC CCGACAAGAA GCTCTATGCG ACGATTGGCG CGCCATGCAA CGTCTGTGAC CCGCAAGGCA TCGAAGCATC CATCATTCGC ATGGATCCGG ATGGCAAGAA TGTGGAGGTG TTTGCCAAAG GCGTGCGTAA TTCGGTCGGC ATGGATTTTC AGCCGGGCAC CAATGTCCTG TATTTCACCG ATAATGGTGT GGACATGATG GGCGATGATA TTCCGCCTGA CGAACTCAAT GCTGCTCCCC AGGCTGGACT GCATTTCGGT TTCCCTTACG TCGGTGGCCG GGATGCGCGT CCTAAGGACT GGCAGAACAA GAAGCCGCCC CAGGCCGTGA CGCCGCCTGT TGTCGAATTC CAGGCGCATA GCGCAAACCT GGGTTTCAAG TTCTATACCG GCAAGCAGTT TCCCCGTGAT TATCAGGGCA ATGCCATCGT TGCCCAGCAT GGTTCATGGA ACCGCAGCCA GCCCGTCGGT TACCAGCTGA TGCGTGTCGT GTTCGACGAG CAGCATCAGG TCAAGTCGCA CGAAGTTTTT ATCGAGGGAT GGCTCAACGA TGGTGAAGCA TGGGGTCGTC CTGTTGACGT ATTGCAGCTC AATGACGGTT CCTTGCTGGT ATCGGATGAT TACAGCGGCG TCATCTACCG CGTCAGCTAT GGCGAGTCTT CTGCCGCTGC TGCGCCTGGC CGTGGCCAGG CGTCCAAAGT GACCGGCTTG AACATGCCGG AGTCTGCTGT TGCACATCCA GACGGGCGTA TCTTCGTCAG CGAGATTGGC GAGTTCGGCA AGTCCGGTGA CGGTAAGATT ACCGTGATCA ACAAGGACGG CAGTCGCCGG ACGCTGGCCG ACGGTTTGAA TGACCCCAAG GGGCTCGACC TCTTCAACAA TCAGCTGTAT GTCGCGGATA TGGACCAGGT CGTGAGGGTT GGTCTGGACG GCAGCAAAAC CGTCATTGCC AAGTCCGGGG ACTTCCCGGA GAAGCCAATG TTCCTCAATG ACATTGAGAT TGATGGGCTG GGCAACGTCT ATGTCTCGGA TAGTGGGGAC GATGATGGCA AGCATGGCGC GATCTACCAG ATATCCCCTG AAGGGAAGAT CACCCAGCTG ATCAATGACA AGTCCGGCAT CAAGCGTCCG AACGGGCTGT TGCTGGATGG CCCCGGCAAG CTGCTGGTGG CCGATTTCGG CAATGGCAAG CTATTCCAGG TGAATTTTGC CAGCAAGAAG GCGAGCGTCA CGCTGCTCAA CCAGGGCTTT GGCGGCGCTG ACGGGCTGGT GCGCGATACT GATGGGTTGC TTTATGTCAG CGACTGGGCC GGCGGCAAGG TCTGGCAATT GACTGAGCCG CGCGCAACGC CGCAATTGAT CACCGAAGGC CATCAGTCTG CCGCTGATAT TGCCCTATCG GCCGATGGGC GCTTCCTGCT GGTGCCTGAT ATGAAGGCTG GCGAGCTGGT GCCATTGCCC ATCAAGTAA
|
Protein sequence | MKTISRLIGL LAAVGWMTAA NADLADVEAN LGRLKVPEGF KVEVYAEVPG ARQMALGTSG TVYVGTRGNK VYAVVDKNKD HKADEVITIL DDLKVGNGVA MWEGNLYVAE QHRITRYAAP DFDLNLPFKQ MREVIYDQLP DKVHHGWRYI AFGPDKKLYA TIGAPCNVCD PQGIEASIIR MDPDGKNVEV FAKGVRNSVG MDFQPGTNVL YFTDNGVDMM GDDIPPDELN AAPQAGLHFG FPYVGGRDAR PKDWQNKKPP QAVTPPVVEF QAHSANLGFK FYTGKQFPRD YQGNAIVAQH GSWNRSQPVG YQLMRVVFDE QHQVKSHEVF IEGWLNDGEA WGRPVDVLQL NDGSLLVSDD YSGVIYRVSY GESSAAAAPG RGQASKVTGL NMPESAVAHP DGRIFVSEIG EFGKSGDGKI TVINKDGSRR TLADGLNDPK GLDLFNNQLY VADMDQVVRV GLDGSKTVIA KSGDFPEKPM FLNDIEIDGL GNVYVSDSGD DDGKHGAIYQ ISPEGKITQL INDKSGIKRP NGLLLDGPGK LLVADFGNGK LFQVNFASKK ASVTLLNQGF GGADGLVRDT DGLLYVSDWA GGKVWQLTEP RATPQLITEG HQSAADIALS ADGRFLLVPD MKAGELVPLP IK
|
| |