Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3509 |
Symbol | |
ID | 5324397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3713117 |
End bp | 3714286 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640792461 |
Product | aromatic amino acid aminotransferase |
Protein accession | YP_001329162 |
Protein GI | 150398695 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1448] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGACG CCCTCACCCG CCAAGCCGAC GACCCCTTGC TGGCCCTTAT CGGCCTGTTC AGGAAGGATG AGCGCCCAGG CAAGGTGGAC CTCGGCGTCG GCGTCTATCG TGACGAAGCC GGCCGCACGC CGATCTTCCG AGCGGTCAAG GCCGCGGAAA AGCGACTTCT CGAAACGCAG GACAGCAAGG CCTATGTCGG ACCCGAGGGA GACCTCGATT TTCTTGATCG CCTCTGGCAG CTCGTCGGCG GGGATATAGT CGAGCGCAGC CATGTGGCGG GCGTCCAGAC GCCCGGCGGC TCCGGCGCGC TCCGTCTGGC GGCGGACCTC ATCGCCCGCA TGGGCGGCCG GGGCATCTGG CTCGGGCTGC CGAGCTGGCC GAACCACGCG CCGATCTTCA AGGCGGCCGG GCTCAATATA GCCACCTACG ATTTTTTCGA CATTCCGTCG CAATCGGTAA TCTTCGATAA TCTCGTGAGC GCGCTGGAAG GCGCTGCACC CGGCGATGCG GTGCTGCTGC ATGCCAGCTG CCACAATCCG ACCGGCGGCG TGCTCAGCGA AGCGCAATGG ATGGAGATCG CCGCGCTGGT GGCCGAGCGC GGCCTGCTGC CGCTCGTCGA TCTCGCCTAT CAAGGGCTCG GCCGCGGCCT CGATCAGGAT GTCGCCGGCC TCAGACATCT TCTCGGCGTC GTGCCGGAAG CGCTCATGGC GGTTTCCTGC TCGAAATCCT TCGGGCTTTA TCGCGAGCGC ACGGGCGCGA TCTTCGCCCG CACCAGCTCA TCCGCATCGG CGGACAGGGT GCGCTCCAAT CTCGCGGGCC TCGCCCGCAC CAGCTATTCC ATGCCGCCGG ATCACGGCGC AGCCGTCGTG CGGACGATCC TTGGCGACCC GGAACTCAGA CGCGACTGGG CGGAGGAGCT GGAGACGATG CGGCTCAGGA TGACCGGCCT CCGGCGGTCG CTTGCCGAGG GGCTCCGCAC CCGCTGGCAG AGCCTCGGCG CGGTCGCCGA TCAGGAGGGC ATGTTCTCGA TGCTGCCGCT TTCCGAAGCG GAGGTGATGC GGCTCAGGAC CGAGCACGCC ATCTATATGC CCTCCTCCGG CCGCATCAAC ATCGCCGGGC TGAAGACCAC GGAAGCCGCC GAGGTTGCCG GCAAGTTCAC CAGTCTCTGA
|
Protein sequence | MFDALTRQAD DPLLALIGLF RKDERPGKVD LGVGVYRDEA GRTPIFRAVK AAEKRLLETQ DSKAYVGPEG DLDFLDRLWQ LVGGDIVERS HVAGVQTPGG SGALRLAADL IARMGGRGIW LGLPSWPNHA PIFKAAGLNI ATYDFFDIPS QSVIFDNLVS ALEGAAPGDA VLLHASCHNP TGGVLSEAQW MEIAALVAER GLLPLVDLAY QGLGRGLDQD VAGLRHLLGV VPEALMAVSC SKSFGLYRER TGAIFARTSS SASADRVRSN LAGLARTSYS MPPDHGAAVV RTILGDPELR RDWAEELETM RLRMTGLRRS LAEGLRTRWQ SLGAVADQEG MFSMLPLSEA EVMRLRTEHA IYMPSSGRIN IAGLKTTEAA EVAGKFTSL
|
| |