Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1799 |
Symbol | trpD |
ID | 5712787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1871379 |
End bp | 1872398 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641267719 |
Product | anthranilate phosphoribosyltransferase |
Protein accession | YP_001533142 |
Protein GI | 159044348 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0000601677 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.211733 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG CGCTGAAACC CCTGATCGGG GCCGCCGCCG ACCGTCCCCT GACCCGGGCG GAGGCCGAGG CGGCCTTCGA GATCCTGTTC GAGGGCGAGG CGACGCCGAG CCAGATGGGC GGGCTGCTGA TGGCGCTGCG TACGCGCGGC GAGACGGTGG CGGAATACGC AGCCGCGGCC GCGGTGATGC GCGCCAAATG CAATGCAGTG CGCGCGCCCG AAGGCGCGAT GGACATCGTC GGCACCGGGG GCGACGGCAA GGGGACGCTG AACATCTCCA CCGCGACGGC CTTCGTGGTG GCGGGCGCGG GCGTGCCCGT GGCCAAGCAT GGCAACCGCA ATCTCAGCTC GAAATCCGGG GCGGCGGATG CGCTGGCGCA GATGGGGATC GAGGTCATGG TCGGCCCGGC GGTGGTCGAG CGGGCGTTGT CGGAGATCGG CATCGCCTTC ATGATGGCGC CGATGCACCA CCCGGCGATC CGGCACGTCA TGCCCACGCG GGCCGAGCTG GGCACGCGGA CCATGTTCAA CATCCTCGGC CCCCTGACCA ACCCGGCAGG CGTCAAGCGG CAGCTGACCG GCGCGTTCTC CCGCGACCTG ATCCGGCCCA TGGCCGAGAC CCTGGCGGCG CTCGGCTCGG ACGTGGCCTG GCTGGTTCAC GGCTCGGACG GCACCGACGA GCTGACGATT ACGGGCGTGT CCTGGGTGGC GGGGCTGTCG GACGGGGCGG TGACCGAGTT CGAGGTCCAT CCCGAAGAGG CCGGCTTGCC GGTCCATCCG TTCGAGGCCA TCCTCGGGGG CACGCCCGAG GAAAACGGCG CCGCCTTCCG CGCGCTTCTG GCCGGGGAGG CGTCGGCCTA CCGAGACGCG GTGCTCTTGA ACGCCGCGGC GGCACTGAAG GTCGCGGGGC GCGTCACCGC CCTGCCCGAT GGCGTGATGC TGGCGGCGGA GGCGATCGAC AGCGGCGCGG CCCTGGCCAA GGTGCAGGGG CTGGCACAGA TCACATCGGA GGCGACATGA
|
Protein sequence | MSDALKPLIG AAADRPLTRA EAEAAFEILF EGEATPSQMG GLLMALRTRG ETVAEYAAAA AVMRAKCNAV RAPEGAMDIV GTGGDGKGTL NISTATAFVV AGAGVPVAKH GNRNLSSKSG AADALAQMGI EVMVGPAVVE RALSEIGIAF MMAPMHHPAI RHVMPTRAEL GTRTMFNILG PLTNPAGVKR QLTGAFSRDL IRPMAETLAA LGSDVAWLVH GSDGTDELTI TGVSWVAGLS DGAVTEFEVH PEEAGLPVHP FEAILGGTPE ENGAAFRALL AGEASAYRDA VLLNAAAALK VAGRVTALPD GVMLAAEAID SGAALAKVQG LAQITSEAT
|
| |