Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3580 |
Symbol | |
ID | 6134378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 3996692 |
End bp | 3997903 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641643747 |
Product | homocitrate synthase |
Protein accession | YP_001770395 |
Protein GI | 170741740 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR02660] homocitrate synthase NifV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.483441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0700115 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAAC CGTCCTCCGC CTCCCCTCCC CCCGCACCGT CCGGGCCGCC CTCCGCCCGG ACCGTGTTCC TCAACGACAC CACCCTGCGC GACGGCGAGC AGGCCCCGGG CGTCGCCTTC ACCCGCCGCG AGAAGATCGA GATCGCCGAG GCCCTCGCGG CGGCCGGGGT CCCGGAGATC GAGGCCGGCA CGCCGGCCAT GGGCGAGGAC GAGATCGAGA CGATCCGCTC CATCGTCTCG CTGCGGCTGC CCCTGCGGGT GATCGCGTGG TGCCGGATGC GCGAGGACGA CCTCCTCGCC GCGGTCGCCG CGGGCGTGCC GGCGGTCAAC CACTCGATCC CGGTCTCGGA CGCCCAGCTG CGCGGCAAGC TCGGCCGCGA CCGCGCCTTC GCCCTCGACG CGGTCGCCGC GACGGTGGCG CGCGCGCGGC GCCTCGGCCT CGCGGTGGCG GTCGGCGCCG AGGACGCCTC GCGCGCCGAT CCCGACTTCC TCTGCCGCGT CGCCGAAGCG GCGCGGGCGG CGGGCGCCGA GCGCCTGCGC CTCGCCGACA CGCTCGGCGT GCTCGATCCC TTCGCCGCCG ACGCCCTGGT CCGGCGCCTC GCCGCGGCCA CCGACCTCGC CCTCGAATTC CACGCCCACG ACTATCTCGG CCTCGCCACC GCCAACACGC TGGCGGCGCT GCGGGCGGGG GCGCGCCACG CCAGCGTCAC CGTGACGGGG CTCGGCGAGC GGGCCGGCAA TGCCGCCCTG GAGGAGGTGG CGGTGGCGCT GGCGCGGTTC GGCCAGGGGC CGACCGGGAT CGACCTTCGC GCGCTGCGCC CGCTCGCCGC CGCCGTCGCG GCGGCGGCCG AGCGTCCCCT GCCGCGCGGC AAGGCCATCG TGGGCGAGGA CATCTTCACC CACGAATCCG GCATCCACGT CGCCGGGCTG CTGCGGGACC GGGCGACCTA CGAGGCGCTC GATCCCGGGA TGCTCGGGCG CAGCCACCGC ATCGTGATCG GCAAGCATTC GGGGGTGGCG GCGCTCGCCA GCGCCCTCGC GGCGCAGGGG CGCAGCCTCG ACGCGGAGGT CGCCCGCGAC CTCCTGGAAC GGGTCCGGGC GGCGGCGGTG CGCACCAAGG CGGCGGTGCC GCCGGGCCTG CTGCGGCGCC TCCACGACGA GTGCCTGATG AGCGCGCGGC CGCTGCCGCG CTTCGCCGCC GCGGGGAGCT GA
|
Protein sequence | MPEPSSASPP PAPSGPPSAR TVFLNDTTLR DGEQAPGVAF TRREKIEIAE ALAAAGVPEI EAGTPAMGED EIETIRSIVS LRLPLRVIAW CRMREDDLLA AVAAGVPAVN HSIPVSDAQL RGKLGRDRAF ALDAVAATVA RARRLGLAVA VGAEDASRAD PDFLCRVAEA ARAAGAERLR LADTLGVLDP FAADALVRRL AAATDLALEF HAHDYLGLAT ANTLAALRAG ARHASVTVTG LGERAGNAAL EEVAVALARF GQGPTGIDLR ALRPLAAAVA AAAERPLPRG KAIVGEDIFT HESGIHVAGL LRDRATYEAL DPGMLGRSHR IVIGKHSGVA ALASALAAQG RSLDAEVARD LLERVRAAAV RTKAAVPPGL LRRLHDECLM SARPLPRFAA AGS
|
| |