Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2578 |
Symbol | |
ID | 5323446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2674641 |
End bp | 2677697 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640791521 |
Product | hypothetical protein |
Protein accession | YP_001328243 |
Protein GI | 150397776 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.54149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.14969 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACA TGCTGAAGAC GACACCCGAT TTCACGATCG AAGAGGCGCA AGCACTCCTT GCGCAGCATT TCGGACTGAA CGCGGCGCTC ACGCCGCTCG ACAGCGAGCG GGACCAGAAC TTCAAAGTCA GCGCCGGCGA CGGCCGCAGC TATATCCTGA AGATCATCAA TGCGGCCGAG CCCGAAATCG AAAGCGATTT TCAGACGGCC CTTCTGGCGC ATCTCGGCGC CAGGGCCGAC ACTCTTCCGG TGCCGCATCT GCAGCCTGCC TTATCCGGCG AAAGCCTGGC TACGACGAGC GCCAGGAGCG GCCTCGTTCA CCGGTTGCGC CTGGTCAGCT GGATCGAGGG CATGCCGCTT GCCCAGTCCG AAAGGACCGA CGCGGCGCTT CGGTCCCTCG GCCACATGCT CGGCCGCTTC GACGCCTCGC TCAAGGGCTT CATGCATCCT GGCGCACTTC GCGATCTCGA CTGGGACATC CGCAATGCCG GACGCTCTGC CGGGCGGCTC CTGCATGTCG CCGATCCCCA GGATCGTGCT CTTCTTCAAC GCTTTATCGA TCGGTTCGAA GAGCGCATCG CCCCGCGGCT GCCGATGCTG CGTTCGGCCG TCATCCATAA CGACGCGAAC GACTGGAACG TGCTCGTCGG TGAGGACGAT CGCAACCGCA TCTCCGGCAT CATTGACTTC GGCGATGCCC TCTATACGCC CGTCATGGCT GAAGTCGCCA TCGCGGCCGC CTATGCCGGG CTCGATCATC CCGATCCGAT CGGCGCGGCC GCTGCGATCG CCAATGGTTA TCACGCCGAA TACCCACTCC TCGAAGAGGA AGTCGACCTC CTCTTCGACC TGATCGCCAT GCGACTGGTG ACCTCGGTGA CGATCTCCGC CTCGCGTCGG GCGCATACCG GTGGCAACCC TTACCTCGCG ATCAGCGAAA GGCCGGCCTG GGCGCTCCTG CGCAAGCTCG ATGCGATGAA CCCGCGCTTC GCGACGGCAA TCCTGAGAAA GGCCTGCGGT TTCGAGGCTG TCGCCGGCGC CCGCGCCGTC GGCGCCTGGA TCGACCGCAA CCGCAAGAAC CTGCTGGCGC TTCTCGACCG TCCAGCCGCA GCCTATGCCG CCGACATTGT CCCCTATGGC GACCCTGCGC ATTCGATGAC CGTAAATTCG GCAGCGGCGC GGCCGCATGA GGCGCAATCG GTTTGGACGG AGCATTGCCG TGGTACCGGT GTCGAGCTCG GCATCGGTCC CTGGGGTGAG GCCCGCACCG TCTATTCGGG CGAAATGTTC GTCTCGCGCC TGCTCGAGAA GACCCGCCGC TCGCGCCATC TCGGCCTCGA CCTCTTCAAG GCTGCAGGCA CGAAGGTCTA TACGCCGCTC GCGGCGACGG TCGCGAGTGT CGAGATCGAG ACGGATCCGC TCGGCTATGG CTGCCTTGTC GCGCTGCGCC ATGAACCGGA GGGTTGCCCG CCCTTCCTGA CGCTCTGGGG ACATCTTGCC CATGAAGCTG TCGGTCGGCT GAAGGCCGGC GACACGCTGG AGGCCGGCGC GCTCGTCGGC GAAATGGGCG CTCCGGAGGA AAACGGCGGC TGGGCGCCGC ATCTGCATCT GCAGATCTGC ACGGACACGG GCCTTTCGGC GTCGGAAATC CTGGGCGTGG GCGAGGAACG CTATCTCGAC GTCTGGTCAG AACTTTTCCC CGACGCAAGC GCATTTGCCG GTGTTGCTCC GGAATTCTAC GAGCAGACCG GCCGCACCCA TGAGGAAATC GTCAGGCTTC GAAAGGACCT GCTGCTCTCG AACCTGTCGA TCTCCTACGA AAAGCCGATA AAGTTCGTGC GCGGCGAAGG CGTCTGGCTC ATCGACGATC GCGGTCGCGC CTATCTCGAC TGCTTCAACA ATGTCTGCCA CATCGGTCAC GCCCATCCGG CCGTGGTGGA AGCGATCGCG CGCCAGGCCG CAACGCTCAA TACCAACACG CGCTACCTCC ACGATAATAT CGTCGCCTAT GCCGAGCGGC TGACGTCGAC GTTGCCGAAG GAACTCGCGA TTGCCGCCTT CGCCAATAGC GGCTCCGAAG CCAACAGCCT CGCCCTGCGC CTGATGCGTG CGCACACGGG CCGCGAAAAC GCTCTCGTGC TCGACTGGGC CTATCACGGC ACGACGCAGG AACTGATCGA TCTCAGCGCT TACAAATTTC GCCGCAAGGG CGGAAGGGGT CCAAAATCGC ATGTGCACGT GGCAGCCGTC CCGGACAGTT ATCACGCCCC CGCCGATTGG CCGGCCGAGG AGCATGGCAA ACGCTTTGCA GAAGACATCG CCGAACTGAT CGCGGCCATG CGTGCCAGAG GCGAAGCGCC CGGCTTTTTC CTCGCCGAGT CCATTCCCAG CGTCGCAGGC CAGGTGTTTC TGCCGGACGG GTACCTCAAG GAGGTCTACC GCATGGTTCG GGACGCCGGG GGCGTCTGCA TCGCGGATGA GGTACAGGTC GGTTTCGGCC GGGTCGGCAG CCATTGGTGG GCCTTCGAAA CGCAAGGCGT CGTCCCCGAC GTCGTCACAA TGGGCAAGCC GATCGGCGCG GGTCATCCGC TCGCCGCCGT GGTCACCACG CGTGAGATCG CGGCCTCGTT CGACAACGGC ATGGAATATT TCAACACCTT CGGCGGCAAT CCCGTGTCCT GCGCCGCCGG CCTCGCCGTG CTCGACGTCA TCGAAGGCGA AGACTTGCGC CGCAACGCCC TTGAGATCGG CAATTATCTC CTTGCCGCCT TCCGCTCGAT GCAGGAGCGC TATGAGGTCA TCGGCGACAT CAGGGGTCTC GGCCTCTTTC TCGGCATAGA GCTCGTCAGC GACCGAAGCA CCAAGGCGCC GGCGACGGAG ATCGCCCGGG CCGTCTCGAA CGGAGCACGG CAGCGCGGCG TCCTGATGGG CACGGAGGGA CCGCATGACA ATGTCCTGAA GATGCGTCCG CCCATGATCT TTTCGAAGCG CGATGCCGAT CACCTGATCG CCGTGCTCGC GGAGACATTC GAGGCCGTGC TCGCGCGAGC CGGATAG
|
Protein sequence | MNDMLKTTPD FTIEEAQALL AQHFGLNAAL TPLDSERDQN FKVSAGDGRS YILKIINAAE PEIESDFQTA LLAHLGARAD TLPVPHLQPA LSGESLATTS ARSGLVHRLR LVSWIEGMPL AQSERTDAAL RSLGHMLGRF DASLKGFMHP GALRDLDWDI RNAGRSAGRL LHVADPQDRA LLQRFIDRFE ERIAPRLPML RSAVIHNDAN DWNVLVGEDD RNRISGIIDF GDALYTPVMA EVAIAAAYAG LDHPDPIGAA AAIANGYHAE YPLLEEEVDL LFDLIAMRLV TSVTISASRR AHTGGNPYLA ISERPAWALL RKLDAMNPRF ATAILRKACG FEAVAGARAV GAWIDRNRKN LLALLDRPAA AYAADIVPYG DPAHSMTVNS AAARPHEAQS VWTEHCRGTG VELGIGPWGE ARTVYSGEMF VSRLLEKTRR SRHLGLDLFK AAGTKVYTPL AATVASVEIE TDPLGYGCLV ALRHEPEGCP PFLTLWGHLA HEAVGRLKAG DTLEAGALVG EMGAPEENGG WAPHLHLQIC TDTGLSASEI LGVGEERYLD VWSELFPDAS AFAGVAPEFY EQTGRTHEEI VRLRKDLLLS NLSISYEKPI KFVRGEGVWL IDDRGRAYLD CFNNVCHIGH AHPAVVEAIA RQAATLNTNT RYLHDNIVAY AERLTSTLPK ELAIAAFANS GSEANSLALR LMRAHTGREN ALVLDWAYHG TTQELIDLSA YKFRRKGGRG PKSHVHVAAV PDSYHAPADW PAEEHGKRFA EDIAELIAAM RARGEAPGFF LAESIPSVAG QVFLPDGYLK EVYRMVRDAG GVCIADEVQV GFGRVGSHWW AFETQGVVPD VVTMGKPIGA GHPLAAVVTT REIAASFDNG MEYFNTFGGN PVSCAAGLAV LDVIEGEDLR RNALEIGNYL LAAFRSMQER YEVIGDIRGL GLFLGIELVS DRSTKAPATE IARAVSNGAR QRGVLMGTEG PHDNVLKMRP PMIFSKRDAD HLIAVLAETF EAVLARAG
|
| |