Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2990 |
Symbol | |
ID | 5835484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 3339759 |
End bp | 3340859 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641368790 |
Product | chorismate synthase |
Protein accession | YP_001640450 |
Protein GI | 163852407 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCACA ACACCTTCGG CCACCTGTTC CGTGTCACCA CCTTCGGCGA GAGCCACGGG GTGGCGCTCG GCTGCGTGGT GGACGGGTGC CCGCCCGGCC TCGCGCTGGA GGCGGACGAG ATCCAGGCGG AGCTCGACCG GCGCAAGCCC GGCCAGTCGC GCTTCACCAC GCAGCGGCGC GAGCCGGATC AGGTGAAGAT CCTGTCCGGC GTGTTCAGCG ACGACCGCAC CGGCGGGCGC CAGCTCACCA CCGGCACGCC GATCGCGCTG ATGATCGAGA ACACCGATCA GCGCTCAAAA GACTATTCCG AGATCCGCGA CAGCTACCGC CCCGGCCACG CCGACTTCAC CTACGACGCC AAGTACGGCA TCCGCGACTA TCGCGGCGGC GGACGCTCCT CCGCCCGCGA GACCGCCGCG CGGGTCGCGG CCGGCGCGGT GGCGCGCAAG GTCATCCCCG GCATCACCAT CCGCGCCGCC CTGGTGCAGA TGGGGCCGCA CGCCATCGAC CGCACGAACT GGGATTGGGA GCAGGTCGGC CAAAATCCGT TCTTCTGCCC TGACGCGAAG GCGGCGGCGC TCTACGAGAC CTATCTCGAC GAAATCCGAA AGGACGGCTC CTCTGTCGGC GCGGTGATCG AGGTGGTGGC CGAAGGCGTG CCGCCCGGCC TCGGCGCACC GATTTACGGC AAGCTCGACG CGGATCTGGC GGCGGCGATG ATGTCGATCA ACGCGGTCAA GGGCGTGGAG ATCGGCGACG GCTTCGCGGC CGCCGCGTTG CGCGGCGAGG ACAATGCCGA TGAGATGCGC GCCGGCAATG ACGGCCGCCC CCGCTTCCTC GCCAACCATG CCGGCGGCAT CCTGGGCGGC ATCTCGTCGG GCGAGCCGGT GGTTGTCCGG TTTGCCGTAA AGCCGACTTC CTCGATCCTG ACCCCGCGCC AGAGCGTCAA CCGCGACGGG GCTGAGATCG ACCTCATCAC CAAGGGCCGC CACGACCCCT GCGTCGGCAT CCGCGCCGTC CCCGTCGCCG AGGCGATGAT GGCCTGCGTG CTGGCCGACC ACACTCTTCG CCATCGCGGG CAGAACGGCG AGCGCCCGTG A
|
Protein sequence | MSHNTFGHLF RVTTFGESHG VALGCVVDGC PPGLALEADE IQAELDRRKP GQSRFTTQRR EPDQVKILSG VFSDDRTGGR QLTTGTPIAL MIENTDQRSK DYSEIRDSYR PGHADFTYDA KYGIRDYRGG GRSSARETAA RVAAGAVARK VIPGITIRAA LVQMGPHAID RTNWDWEQVG QNPFFCPDAK AAALYETYLD EIRKDGSSVG AVIEVVAEGV PPGLGAPIYG KLDADLAAAM MSINAVKGVE IGDGFAAAAL RGEDNADEMR AGNDGRPRFL ANHAGGILGG ISSGEPVVVR FAVKPTSSIL TPRQSVNRDG AEIDLITKGR HDPCVGIRAV PVAEAMMACV LADHTLRHRG QNGERP
|
| |