Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_3592 |
Symbol | aroA |
ID | 3722109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007494 |
Strand | - |
Start bp | 684191 |
End bp | 685528 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640073258 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_355096 |
Protein GI | 77465593 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.886783 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGAC ACGGGCCCGC GCAACCGATG ACCGCTCGCC GCTCGGGGCC GCTGAAGGGG CGTGCCGAGA TCCCCGGCGA CAAGTCGATC AGCCACCGCG CGCTGATCCT CGGCGCCATG GCGGTGGGCG AGACGCGGAT CACGGGTCTT CTCGAGGGGC AGGACGTGCT CGACACCGCG AAGGCCATGC GCGCCTTCGG CGCCGAGGTG ATCCAGCACG GGCCGGGCGC CTGGTCGGTG CATGGGGTGG GCGTGGGGGG CTTCACCGAG CCCGCCGAGG TGATCGACTG CGGCAATTCC GGCACCGGCG TGCGGCTGGT GATGGGGGCG ATGGCTACCT CGCCGCTGAC CGCGACCTTC ACCGGCGATG CCTCGCTCCG CAAGCGGCCC ATGGGGCGGG TGACCGATCC GCTGGCTCTG TTCGGCACGC GCGCCTACGG GCGCAAGGGC GGGCGGCTGC CGATGACGCT CGTGGGCGCC GCCGATCCGG TGCCGGTGCG CTATACGGTG CCGGTGCCCT CGGCGCAGGT GAAATCGGCG GTTCTTCTGG CCGGGCTGAA CGCGCCCGGG CAGACGGTCG TGATCGAGCG CGAGGCGACG CGGGACCATT CCGAGCGGAT GCTGCGCGGC TTCGGCGCCG AACTCAGCGT CGAGACGGGG CCCGAGGGGC AGGTCATCAC CCTGACCGGG CAGCCGGAGC TTCGGCCGCA GACGGTGGCG GTTCCGCGCG ATCCGTCCTC GGCGGCCTTT CCGGTCTGCG CGGCGCTGAT CGTCGAGGGA TCCGAGATCC TGGTGCCGGG GGTGAGCCGC AATCCCACCC GGGACGGGCT CTATGTGACG CTGCTCGAGA TGGGCGCAGA CATCGCCTTC GAGAACGAGC GCGAGGAGGG CGGCGAGCCG GTGGCCGACC TGCGCGTGCG GGCCTCGGCC CTCAAGGGCG TGGAGGTGCC GCCCGAGCGC GCGCCCTCGA TGATCGACGA ATATCCGATC CTGTCGGTGG TGGCGGCCTT TGCCGAGGGC TTGACGATCA TGCGCGGCGT GAAGGAGCTG CGCGTGAAGG AGAGCGACCG GATCGACGCG ATGGCGCGGG GGCTTGAGGC CTGCGGCGTG CGGATCGAGG AGGACGAGGA TACGCTGATC GTCCATGGCA TGGGCCGGGT GCCGGGCGGG GCCACCTGCG CCACCCATCT CGATCACCGG ATCGCCATGA GCTTCCTCGT GCTCGGCATG GCGGCGGAGG CGCCGGTGAC CGTCGACGAC GGCTCGCCTA TCGCGACCTC CTTCCCTGCC TTCATCGATC TGATGGCGGG TCTCGGAGCG GATCTGGCGG CGGGCTGA
|
Protein sequence | MSGHGPAQPM TARRSGPLKG RAEIPGDKSI SHRALILGAM AVGETRITGL LEGQDVLDTA KAMRAFGAEV IQHGPGAWSV HGVGVGGFTE PAEVIDCGNS GTGVRLVMGA MATSPLTATF TGDASLRKRP MGRVTDPLAL FGTRAYGRKG GRLPMTLVGA ADPVPVRYTV PVPSAQVKSA VLLAGLNAPG QTVVIEREAT RDHSERMLRG FGAELSVETG PEGQVITLTG QPELRPQTVA VPRDPSSAAF PVCAALIVEG SEILVPGVSR NPTRDGLYVT LLEMGADIAF ENEREEGGEP VADLRVRASA LKGVEVPPER APSMIDEYPI LSVVAAFAEG LTIMRGVKEL RVKESDRIDA MARGLEACGV RIEEDEDTLI VHGMGRVPGG ATCATHLDHR IAMSFLVLGM AAEAPVTVDD GSPIATSFPA FIDLMAGLGA DLAAG
|
| |