Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5171 |
Symbol | |
ID | 5319473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 124772 |
End bp | 125623 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640776949 |
Product | aminotransferase class IV |
Protein accession | YP_001313881 |
Protein GI | 150377286 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | [TIGR01121] D-amino acid aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.623088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.871234 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAGGA TCGTCTACGT CCATGGTCAA TTCATACCCG AGGAGGAAGC CCGGATCGGG CTTTTTGACC GGGGTTTTCT GTTCGGCGAT GCCGTCTACG AGGTAACGGC GGTGATCGGC GGGCGGATGA TCGATAACGA TCTGCATCTC GGTCGGCTCT GCCGTTCGCT GAAGGAGCTC GCCATCCCGC TCGCGCTCTC CCTGGAGCAA ATTGCAGGCA TTCAAGCGGA ACTGATCGCC CGCAACGACT TGCGGGACGG CACGGTCTAT CTGCAGGTCT CGCGCGGCGA GGCGGATCGC GACTTTCTCT ATTCCGAGGC ACTGGCGCCG AAGCTTGTCG GCTTCACCCA GGCGAAGATG CTGAGCGGCA CGAGGGCCCA GCGGGACGGC ATCGCGGTCG ATCTGGCGGA CGATCCGCGC TGGAAACGGC GTGACATCAA AACCGCCATG CTTCTCGGCC AGGTCATGGC CAAACAAGCG GCACGCGCCC GCGGTTTTGA CGATGTCTGG CTGGTGGAGA ACGGGCTGGT GACCGAGGGC GCCAGCTCCA CGGCCCATGT CATAACGGCC GAGGGGCACA TCCTCACGCG CGCCGCTTCG CATGTGACGC TGCCGGGCTG TACGCAGCGT GCTCTTGCCA AGCTCTGCGC GGCCGAGGGG CTACAAATCG TGGAGCGTGC CTTCGCGCCC GGCGAGGCGC AGGCTGCCGC GGAGGCGTTT CAGACCTCCG CGTCCAGCCT GGTGACCCCG GTCGTGCGGA TCGGCGTGCG GCTTGTCGGC AACGGCAGGC CCGGACCGAT GACCCGGAAA CTCCAGGCTC TATATCTGGA AGCGGCCGGC ATTGCGCACT AA
|
Protein sequence | MGRIVYVHGQ FIPEEEARIG LFDRGFLFGD AVYEVTAVIG GRMIDNDLHL GRLCRSLKEL AIPLALSLEQ IAGIQAELIA RNDLRDGTVY LQVSRGEADR DFLYSEALAP KLVGFTQAKM LSGTRAQRDG IAVDLADDPR WKRRDIKTAM LLGQVMAKQA ARARGFDDVW LVENGLVTEG ASSTAHVITA EGHILTRAAS HVTLPGCTQR ALAKLCAAEG LQIVERAFAP GEAQAAAEAF QTSASSLVTP VVRIGVRLVG NGRPGPMTRK LQALYLEAAG IAH
|
| |