Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1022 |
Symbol | aroA |
ID | 5710538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1057354 |
End bp | 1058706 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641266933 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001532365 |
Protein GI | 159043571 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.321263 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.615068 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCC ACGGTGACCC GATCCCCATG ACCGCCCATC CGTCCGGACC GCTCAGCGGC ACAGCGCAGG TGCCCGGGGA CAAGTCGATC TCGCACCGGT CGCTGATCCT CGGCGCGCTC GCGGTGGGGG AGACGAAGGT CACCGGGCTG CTCGAAGGGC AGGACGTGCT GGACACCGCG CGGGCGATGC AGGCGTTCGG GGCGGAGGTG ATCCAGCACG CGCCGGGGGC CTGGTCGGTG CATGGGGTCG GCACCGGCGG GTTTGCCGAG CCCGAGGATG TGATCGATTG CGGCAATTCG GGCACCGGCG TGCGGCTGAT CATGGGGGCC ATGGCCACGA CGCCGATCAC CGCGACCTTC ACCGGCGATG CGAGCTTGCG CAGCCGTCCC ATGGGCCGGA TCACCGACCC GCTGGCGGGG TTCGGGACCA CCGCCGTGGG CCGCCGGGGC GGGCGCTTGC CCATGACGCT GACCGGGGCC GCGGACCCGG TGCCGGTGCG CTACACCGTG CCGGTGCCGT CGGCGCAGGT GAAATCCGCC GTCCTGCTGG CGGGGCTGAA CGCGCCGGGG CAGACCGTGG TGATCGAGGC CGAGGCGACG CGGGACCATT CGGAGCGGAT GCTGCGCGGC TTTGGCGCCG AAATCAGCGT CGAGAGCGCG CCCGAGGGCA ATGTCATCAC CCTGACCGGC CAGCCGGAGC TGCGCCCCCA GACCATCGTG GTGCCGCGCG ATCCCTCTTC GGCCGCCTTC CCGGTGGCGG TGGGGCTGAT CGTGCCCGGC TCCGACGTGC TGGTGCCGGG TATCGGGCTG AACCCGACCC GCGCGGGGCT CTATACCACC TTGCAGGAGA TGGGCGCCGA GCTGAGCTTC GAGAATATGC GCGAGGAGGG CGGCGAGCCC GTCGCGGACC TGCGCGCGCG CTTTAGCGAC GCCATGCAAG GCATTGAAGT GCCACCTGAA CGCGCGCCTT CCATGATCGA CGAATACCCG ATCCTGAGCG TGATCGCCGC CTATGCCACC GGGCGCACGG TGATGCGCGG CGTCAAGGAG CTGCGCGTCA AGGAGAGCGA CCGGATCGAC GCCATGGCGC GCGGGCTGGA GGCCTGCGGT GTGCGGGTGG AGGAGGACGA GGATACCCTG ATCGTGCACG GGATGGGGCC GGGGGGTGTG CCGGGGGGGG CCACCTGCGC CAGCCATCTC GACCACCGGA TCGCCATGAG TTTCCTGTGC TGCGGGCTGG CCGCGCAGAC CCCCGTCTCG GTCGACGATG GCGGCCCGAT CGCCACCAGC TTCCCGATCT TCGAGCCGCT GATGACCGCG CTGGGCGCGA CCCTCACCCG CGACAGTACC TGA
|
Protein sequence | MSAHGDPIPM TAHPSGPLSG TAQVPGDKSI SHRSLILGAL AVGETKVTGL LEGQDVLDTA RAMQAFGAEV IQHAPGAWSV HGVGTGGFAE PEDVIDCGNS GTGVRLIMGA MATTPITATF TGDASLRSRP MGRITDPLAG FGTTAVGRRG GRLPMTLTGA ADPVPVRYTV PVPSAQVKSA VLLAGLNAPG QTVVIEAEAT RDHSERMLRG FGAEISVESA PEGNVITLTG QPELRPQTIV VPRDPSSAAF PVAVGLIVPG SDVLVPGIGL NPTRAGLYTT LQEMGAELSF ENMREEGGEP VADLRARFSD AMQGIEVPPE RAPSMIDEYP ILSVIAAYAT GRTVMRGVKE LRVKESDRID AMARGLEACG VRVEEDEDTL IVHGMGPGGV PGGATCASHL DHRIAMSFLC CGLAAQTPVS VDDGGPIATS FPIFEPLMTA LGATLTRDST
|
| |