Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2004 |
Symbol | trpE |
ID | 3719337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 602636 |
End bp | 604147 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640070167 |
Product | anthranilate synthase component I |
Protein accession | YP_352055 |
Protein GI | 77462551 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.271307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGCTCG TGACCTCCTT CGAAAGCTTC GAGCGCGGCT GGAAGGCCGG GCAGAACCAG ATCGTCTATG CCCGGCTGAC CGCGGATCTC GACACGCCGG TGTCGCTGAT GCTGAAGCTC GCCGAGGCGC GCACCGACAC GTTCATGCTG GAATCGGTGA CGGGCGGCGA GATCCGCGGC CGCTATTCGG TCGTGGGCAT GAAGCCCGAC CTGATCTGGC AGTGCCACGG GCAGGACAGC CGCATCAACC GCGAGGCGCG CTTCGACCGG CAGGCCTTCC AGCCGCTGGA AGGCCACCCG CTCGAGACGC TGCGGGCGCT GATCGCCGAG AGCCGGATCG AGATGCCGGC CGACCTGCCC CCGATCGCGG CGGGCCTCTT CGGCTATCTC GGCTATGACA TGATCCGGCT GGTCGAGCAT CTGCCGGGGA TCAACCCCGA TCCGCTCGGT CTGCCCGATG CGGTGCTGAT GCGGCCCTCG GTCGTGGCGG TGCTCGACGG GGTGAAGGGC GAGGTCACCG TGGTGGCGCC CGCATGGGTC TCGTCGGGCC TCTCGGCGCG GGCCGCCTAT GCGCAGGCGG CCGAGCGGGT GATGGATGCG CTGCGCGATC TCGACCGCGC GCCGCCCGCG CAGCGCGACT TCGGCGAGGT GGCGCAGGTG GGCGAGATGC GCTCGAACTT CACCCACGAG GGCTACAAGG CCGCGGTCGA GAAGGCCAAG GACTACATCC GCGCGGGCGA CATCTTCCAG GTGGTGCCGT CGCAACGCTG GGCGCAGGAC TTCCGTCTGC CGCCCTTCGC GCTCTACCGC TCCTTGCGCA AGACGAACCC CTCGCCCTTC ATGTTCTTCT TCAACTTCGG CGGCTTCCAG GTTGTGGGGG CCAGCCCCGA GATCCTCGTG CGGCTGCGCG ACCGCGAGGT GACGGTGCGT CCCATCGCCG GCACCCGCAA GCGCGGCGCG ACACCCGAGG AGGACCGCGC GCTGGAGGCC GACCTTCTGT CCGACAAGAA GGAACTGGCC GAGCATCTGA TGCTGCTCGA TCTCGGGCGA AACGACGTGG GCCGGGTGGC GAAGATCGGC ACCGTGCGCC CGACCGAGAA GTTCATCATC GAGCGCTATT CCCACGTCAT GCATATCGTC TCGAACGTGG TGGGCGAGAT CGCGGAGGGC GAGGATGCGC TCTCGGCGCT GCTGGCGGGC CTGCCGGCGG GCACCGTCTC GGGCGCGCCC AAGGTGCGGG CGATGGAGAT CATCGACGAG CTCGAGCCGG AAAAGCGCGG CGTCTATGGC GGCGGCGTGG GCTATTTCGC GGCCAACGGC GAGATGGATT TCTGCATTGC GCTGCGGACC GCGGTCCTGA AGGACGAGAC GCTCTACATC CAGTCGGGCG GCGGCGTCGT CTATGACAGC GACCCCGAGG CCGAATATCA GGAGACGGTC AACAAGGCCA GGGCGCTCCG CCGGGCCGCC GAGGATGCGG GCCTCTTCGC CCGCCGCGCC GGGAACGGCT GA
|
Protein sequence | MSLVTSFESF ERGWKAGQNQ IVYARLTADL DTPVSLMLKL AEARTDTFML ESVTGGEIRG RYSVVGMKPD LIWQCHGQDS RINREARFDR QAFQPLEGHP LETLRALIAE SRIEMPADLP PIAAGLFGYL GYDMIRLVEH LPGINPDPLG LPDAVLMRPS VVAVLDGVKG EVTVVAPAWV SSGLSARAAY AQAAERVMDA LRDLDRAPPA QRDFGEVAQV GEMRSNFTHE GYKAAVEKAK DYIRAGDIFQ VVPSQRWAQD FRLPPFALYR SLRKTNPSPF MFFFNFGGFQ VVGASPEILV RLRDREVTVR PIAGTRKRGA TPEEDRALEA DLLSDKKELA EHLMLLDLGR NDVGRVAKIG TVRPTEKFII ERYSHVMHIV SNVVGEIAEG EDALSALLAG LPAGTVSGAP KVRAMEIIDE LEPEKRGVYG GGVGYFAANG EMDFCIALRT AVLKDETLYI QSGGGVVYDS DPEAEYQETV NKARALRRAA EDAGLFARRA GNG
|
| |