Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2544 |
Symbol | |
ID | 5734422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3266966 |
End bp | 3270508 |
Gene Length | 3543 bp |
Protein Length | 1180 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279684 |
Product | methionine synthase |
Protein accession | YP_001545310 |
Protein GI | 159899063 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0646] Methionine synthase I (cobalamin-dependent), methyltransferase domain [COG1410] Methionine synthase I, cobalamin-binding domain |
TIGRFAM ID | [TIGR00640] methylmalonyl-CoA mutase C-terminal domain [TIGR02082] 5-methyltetrahydrofolate--homocysteine methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00356448 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAGCA TAAAACCTTA CATCGCCGAA TTGGCGAAAC GTGTTCTGAT TTACGACGGC GCAATGGGTA CTAGCTTGGA TTTGTACGAT CTCAAGCCTG AGGATTTTGG CGGCGAACAG TACTTCGGCG CACGCGATTA TCTCGCCTTG ACGCGCCCCG ACATTATCGA ACAAATTCAT AGCTCGTTTA TGGAAGCTGG GGTTGATGTG TTGGAAACCA ATACCTTCCA ATCAACCCGC ATTCGCTTGC ATGAATGGGG CTTGGCCGAT AAAACCTACG AGCAAAATTT TAAAGCTGCC CAACTGGCCC GCAAGGTCGC CGACCGCTAC CAAGCCCAAG ATGGACGCAA ACGTTTTGTG GCTGGCTCGA TGGGGCCAAC TGGCTTTTTG CCTTCATCCG ACGACCCAAC CCTTTCAAAT ATTACCTGGG ATGCGCTTTC CGATACTTTT AAAGAGCAAA CCATCGCCTT GGTCGAAGGT GGGGTTGATG TGCTCTTGGT TGAAACCAGC GTTGATATTC TCGAAGTCAA AGCGGCAATC GATGGTATTC GGCGCGGCTT GGCCGAACTC AAACGCCCTG ATATTGCGAT TCAAGCCCAA GTCTTTTTGG ATCTCTCAGG GCGTATGTTG CTAGGCACGG ATATTCAGGC CACCATGGTC ACGCTCGAAG CCTTGGCCGA TGTTGATGTG ATTGGATTGA ATTGCTCGAC TGGCCCTGAA CACATGCGGC CAGCGATCAC CTACCTCGCC GCTAACTCCA AATTGCCAAT TTCGTGTATT CCCAACGCTG GCTTGCCGAT GGAAGTCAAT GGCGAAACCG TGTTTCCGTT GGAGCCAGCA GGCTTTGCCA AGAGCTTGAC TGAGTTTGTG CAAGATTTGG GTGTGCGCGT CGTTGGTGGT TGTTGTGGCA CAACTCCGGC CCACCTTCGC GCGTTGGTCG ATTCAATTGG CAATAATTGC CCACCCAAAC CGAACCAAGC TCCAATTGTT CCACAAGTTT CATCGGCCAT GCGTTCCTTG GCGCTCCACC AAGATCCAGG CCCATTTATC ATTGGCGAAC GGGTCAACTC GGTTGGTTCG CGTAAAGTTA AAAAATTGTT GCTCAAAGAT GATTATGATG GTGTGTTGCA AATTGCTCGT GAGCAGGCCG ATAGTGGCTC GCACGCGCTC GATGTATGTG TTGCCATGAC CGAGCGCCCC GATGAAAAAG AGCAAATGCA AAAGCTGCTC AAAAAATTAA CCATGGGCGT TGAATTACCA CTCGTCATCG ACTCAACTGA GTATGAAGTG ATCGACGCAG CGCTGCAAAT CTATCCAGGC CGCGCGATTA TCAACTCAAC CTCGCTCGAA GGCGGGCGTG AACAGAAATT CGACAAAACC ATGCCCTTGG TTGCTCGTTA TGGCGCAGCA ATTATCGCCT TGACGATCGA TGAAGAAGGC ATGGCCCATA CTGCTGAAGC CAAAGTTGCG ATCGCCAAAC GTATGGCCGA AATGGGCCGC CAAGAATATG GCGTGCAAAA TGATGCTTGG CTCTTTGATG CATTGACCTT TCCATTAACC ACGGGCCAAG CCGAATTGCG CGAATCAGCC AAAGAAACCA TGGAAGGCAT TCGCCGGATC AAGGCCGAAA TTCCAGGCGC AATGACGGTG CTTGGGGTTT CGAATCTCTC ATTCGGGATT CAGCCCCATG CCCGTTCAGC CCTCAACTCG GTCTTTTTAT ATCACGCCGT CAAGGCTGGG CTTGATGCAG CGATCATCAA TCCAACCCAC GTTATGCCAT TTGCCGAAAT CTCCGCCGAA ACCCGCAAAG TTTGTGAAGA TGTAATTTTT AACAGCGACG AAGAAGCCTT GCCGCGCTTC ATCCAATATT TCGAAGCCAA TACAGCGGTT GCTGGCGAAG CCAAAGTTGA CCCAACCGCC ACCATGAATG CCGAAGAGCG AATTCACTGG AAGATTCTGC ATCGCAAAAA AGATGGCATC GAAGCCGATA TCGACGAATC GCGGGCATGG CGCGAAGCTG GCGGCCAAAC CCAAGGCGAA GCAGCGATTG ATATTCTCAA TGGTGTGATG CTGCCAGCCA TGAAAGAAGT TGGCGATAAA TTCGGCGCTG GTGAATTGAT TCTGCCCTAT GTGCTGCAAT CTGCCGAAGT GATGAAACGT GCCGTCGCCC ATCTCGAACA ATTCCTTGAT CGGGTCGAAG GCACAACCAA GGGTAAAGTT GTGCTAGCCA CGGTCTATGG CGATGTGCAC GACATTGGCA AAAACTTGGT TTACACGATT TTGTCGAATA ACGGCTACAC CGTTTACGAC CTTGGCAAGC AAGTTCCAGT CAATACAATC ATCGAAAAAG CGCTTGAAGT TGGGGCCGAT GCAATTGGGC TTTCGGCTTT GTTGGTCAGC ACATCCAAGC AAATGCCATT GTGTGTGCAG GAGTTGCACA AGCGTAATTT GGCTTTCCCA GTGCTGGTTG GTGGCGCGGC AATCAATCGC CAATATGGCC AACGTATTCT CTTCCCTGAG CCAGAAGTGG CCTACACTGG TGGGGTTTTC TATTGCAAAG ATGCCTTTGA AGGCCTCGAA ACCATGGATA AACTCTCCGA CCCAACTGTG CGCGAAGAAT ATTTGAGCGT GATGGAAGAT GCTGCCCGCG AATCGTTGAA AGCGCCAGAG CGAGGTCGCA CCCTGTTGGA AAAACTGGGC AAAAATAATC CTAGCGGTGT GCGTTCAACC ACTCGCAACG ATAATCCAGT GCCAACTCCA CCGTTCTGGG GCGTACAAGT TGCCAAAAAA ATTCGGCTCG ATGATGTTAC GCCATATCTT GACCTGAATA CGCTTTGGCG TTTGGGTTGG GGCATCAAAA ACCTCAAGGG CGCTGAATAC GATCGAGTGG TCAATGATGA GTTCAAACCA CGGTTGCGCC GCATGATCGA CGATGCCAAG CGCAATGGTT GGCTGCAACC ACAAGCAGTC TATGGCTATT TCCCAGCTCA AGCCGCTGGC GAAGAGTTGG TGGTCTATGA TCCTAAGGAT CGCAAAACCG TGCTGACGCG CTTTGCCTTC CCGCGCCAAC CAAGCCGTGA ACGCTTATGT CTAGCTGATT ACTTCCGACC TGTTGAAAGC GGCGAATTCG ATGTGGTTGG CTTGCAAATC GTCACCATGG GCCATGTTGC AGCTGAGCGC GTTGAAAGCT TGCAAAAAGC CAACGATTAT AGCGAATCGT TCTTTAGCCA TGGCTTGAGC ACAACCTACG CCGAAGCACT CGCCGAATTT ACCAACAAAA TTATCAGCCA AGGCTTAGGT TTGAGTTATC AGGCTAAACG CTACTCGTGG GGCTACCCAG CCTGCCCCGA TCTTGAGGAG CATACCAAGC TGTTTAGTGT TTTGCCCGCC AAAGAGATCG ATGTCAGTTT GACCTCGGCC TTCCAACTAG ACCCTGAGCA AAGCACCGCC GCCATCGTCG TGCATCACCC TGATGCTAAA TATTTCTCAA TTGGCTCAGT CGCCGAACGC GCCGAAAGTG ATGTCGCCGA GCTGACCAGT TAA
|
Protein sequence | MTSIKPYIAE LAKRVLIYDG AMGTSLDLYD LKPEDFGGEQ YFGARDYLAL TRPDIIEQIH SSFMEAGVDV LETNTFQSTR IRLHEWGLAD KTYEQNFKAA QLARKVADRY QAQDGRKRFV AGSMGPTGFL PSSDDPTLSN ITWDALSDTF KEQTIALVEG GVDVLLVETS VDILEVKAAI DGIRRGLAEL KRPDIAIQAQ VFLDLSGRML LGTDIQATMV TLEALADVDV IGLNCSTGPE HMRPAITYLA ANSKLPISCI PNAGLPMEVN GETVFPLEPA GFAKSLTEFV QDLGVRVVGG CCGTTPAHLR ALVDSIGNNC PPKPNQAPIV PQVSSAMRSL ALHQDPGPFI IGERVNSVGS RKVKKLLLKD DYDGVLQIAR EQADSGSHAL DVCVAMTERP DEKEQMQKLL KKLTMGVELP LVIDSTEYEV IDAALQIYPG RAIINSTSLE GGREQKFDKT MPLVARYGAA IIALTIDEEG MAHTAEAKVA IAKRMAEMGR QEYGVQNDAW LFDALTFPLT TGQAELRESA KETMEGIRRI KAEIPGAMTV LGVSNLSFGI QPHARSALNS VFLYHAVKAG LDAAIINPTH VMPFAEISAE TRKVCEDVIF NSDEEALPRF IQYFEANTAV AGEAKVDPTA TMNAEERIHW KILHRKKDGI EADIDESRAW REAGGQTQGE AAIDILNGVM LPAMKEVGDK FGAGELILPY VLQSAEVMKR AVAHLEQFLD RVEGTTKGKV VLATVYGDVH DIGKNLVYTI LSNNGYTVYD LGKQVPVNTI IEKALEVGAD AIGLSALLVS TSKQMPLCVQ ELHKRNLAFP VLVGGAAINR QYGQRILFPE PEVAYTGGVF YCKDAFEGLE TMDKLSDPTV REEYLSVMED AARESLKAPE RGRTLLEKLG KNNPSGVRST TRNDNPVPTP PFWGVQVAKK IRLDDVTPYL DLNTLWRLGW GIKNLKGAEY DRVVNDEFKP RLRRMIDDAK RNGWLQPQAV YGYFPAQAAG EELVVYDPKD RKTVLTRFAF PRQPSRERLC LADYFRPVES GEFDVVGLQI VTMGHVAAER VESLQKANDY SESFFSHGLS TTYAEALAEF TNKIISQGLG LSYQAKRYSW GYPACPDLEE HTKLFSVLPA KEIDVSLTSA FQLDPEQSTA AIVVHHPDAK YFSIGSVAER AESDVAELTS
|
| |