Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2580 |
Symbol | |
ID | 5323448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2678871 |
End bp | 2681798 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640791523 |
Product | hypothetical protein |
Protein accession | YP_001328245 |
Protein GI | 150397778 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.458682 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGACG AATTGCTGAA GCACCGGACG CAGCTTGCAA GACCGGACGT GTCGACCGAC GAAGCGCAGG CGGTCCTGGC CCAACACTAC GGTCTTTCCG GTGATCTTGC CGAACTCGGC AGCCAGCAGG ATCGCAACTT CCGCGTCGAC GCAGACGAAG GGCGTTTCGT TCTGAAGGTA ACGCGCGTCG AATATGCGCG GGTGGAGATC GAAGCCCAGA ATGCCGCATT GCGGCATGTC GGCGCAAAGC CCGGCGCGCC GAAAGTGCCG GAAGTGGTGC CCTCTCTCGG CGGTGAGGAC ATCGTCTCCG CAGCCGTTCG CGAGGAGACC TATCATGTCC GCTTGCTCAC CTATCTCGAG GGTAGCCCTC TGACGCGCCG CAGGCACCTG GGCGCTGAAT CGGTGGCGGC ACTCGGCGAT GTTGCGGGAC AACTCGCCGC CGCACTCAAG GATTTCGATC ATCCTGGGCT CGAACGAGAG CTGCAATGGG ACCTCAGAAG GGCGGGGCCG GTCGCGCTGC ACCTGCTTTC GGCCATGGCC GACGTGGACC TGCGCAAGCG CATTGCCGAG GCGATGATCG GTGCGATGCG CAAAGTTCAG CCCTTGATGC CGGAGCTGCG CCTGCAGGCG GTCCATCAGG ACGTTACCGA CGACAATGTC GTGAGCCGGG TGGACAGCGG CGGGCGGCTC ATCCCCGACG GTGTCATCGA TTTCGGCGAC GTCCTGAAGG GCTGGGTGGT CGCCGATCTC GCCGTCACCT GCGCATCGTT GCTGCACCAT GCGGGCGGCG ATCCCTTCAC TATCCTGCCG GCGGTGAAGG CGTTCCATGC GGCTTACCCG CTGACCGATG CCGAATTGAC GGCCCTCTGG CCGCTGATCG TCGCACGCGC CTGCGTTCTC GTCGCGAGCT CGGCGCACCA GCTCGAGGTC GATCCGGAAA ATGCCTATGC CGCAAGCAAT GCCGCGCATG AGCGGGAAAT CTTCGACGTT GCGGTTTCGG TGCCGAACGA GCTGATGGAG CATGCGATCA GAAAAGCTGT GGCAAGGGAA CGGCAGCCAG CGCCATCCGA GATGCACGGC CGCCTGTTGC CGGACCTCAA TGCCGCAAGT GTCGGCATCG TCGATCTCTC CGTGCTCGGG CCGCACCTTC CTGCCGATCG CTGGCACTAC GAGGACACCG AAGCGCTGCT CCTGCAGTCG GCGGCGCGCG CGGCCGGCGC CGCGGCAACC CGCTATGGCG AGTACCGGCT TACCGAAACG CGGCTCCTGC AGGCAAGCGC GCCACGAACA TTCGCGCTCC ACGTGGATTT GTGCCTACAC GGGCAGACCG CCGTGCATGC GCCTTTCGCG GGCCGGCTCC ACCAGGGCGG CGGCAAGCTG ATCCTCTCGG GAGAGGGTCT TCACCTCCAT CTTTACGGTG TCGAGGCGGA CGATCCTGCC GAAGGTACGT TGGAGCCGGG CGCGAGGATC GGATTGGTCC CCGGCGAACC GTCCGCATTG AGGTTCCTGC GCGTACAGCT TTGCACTGTG CTGGATATGG ATCCGCCCGC CTTCGCTGCC CCGCATCAGG CGGAGGCATG GGGCCGGCTC TGCCCATCGC CTGAGACGAT CTTGGGTTTC GGATGCGATG CGCCCTTGCC TGACGCAGCC GCGCTCCTGC AGCGCCGTCA CCGGCATTAT GCACGGCCGC AGAAGAACTA TTATCGCATG CCGCCGCAGA TCGAGCGCGG GTGGAAGGAG CACCTCTTCG ACCTCGAGGG GCGCGCCTAT CTCGACATGG TCAACAATGT CACGCTCGTC GGCCACGGGC ACTCCCGATT GTCCGCTGCG GTCGGGCGGC AATGGTCCTT GCTCAACACC AATTCGCGGT TTCACTATGC CGCCGTGGCG GAATTCTCGG AACGCCTGGC GGCACTGGCG CCAGAGGGGC TCGACACGGT CTTTCTCGTC AACAGCGGTT CGGAGGCGAA CGATCTCGCC CTCCGGCTCG CCTGGGCCGC TTCCGGCGCG CGCAATGTAG TCTCGCTACT CGAGGCCTAT CACGGCTGGA CGGTTGCAAG CGACGCCGTT TCCACCTCGA TCGCCGACAA CCCGCAAGCG CTGACGACGC GTCCGGACTG GGTGCATCCG GTCGTCTCAC CGAATACCTA TCGCGGTCCG TTCCGCGGGG AGGGATCGAC GGGCGACTAT GTGGATGCCG TCTCTCGAAA GCTCCGGGAA CTCGACGAGA AGGGCGGGAA GCTCGCCGGC TTCATCTCAG AGCCCGTCTA CGGCAATGCC GGAGGCATTC CGCTTCCGCC GGGCTATCTG GAAGCGGTCT ATGCCCTGGT GCGAGCGAGG GGCGGCGTCT GCATCGCCGA CGAGGTGCAG GTCGGCTACG GCCGGCTCGG CCATTATTTC TGGGGTTTCG AGCAACAGGG CGTGGTGCCC GATATCATCA CCGTCGCAAA GGGTATGGGC AACGGCCACC CGCTGGGCGC CGTGATCACC AGGCGCACAA TTGCCGATGC GCTGGAGGAG GAAGGCTATT TCTTCTCCTC GGCCGGCGGC AGCCCCGTGA GTTCGGTGGT CGGCCTGACC GTCCTCGACA TCCTTCACGA CGAGGCCCTG ACGGAGAATG CCCGGTCCGT GGGCGACTAC CTCAAGGGGC GCCTCGAGGC GCTCGTGGAG CGGTTTCCGC TCGCCGGCGC CGTTCACGGC ATGGGGCTCT ATCTGGGCGT CGAATTCGTC CGGGACCGCG AAACGCTCGA ACCCGCCACG GAAGAGACGG CCGCGATCTG CGACCGCCTT CTCGACCTCG GCGTTATCAT GCAGCCGACC GGAGACCATT TGAACGTCCT GAAGATCAAG CCGCCGCTCT GCCTCGCCCG GGAGAGCGCG GATTTCTTCG CCGACACGCT GGGCAGGGTG CTCGAAGAGG GGTGGTAA
|
Protein sequence | MVDELLKHRT QLARPDVSTD EAQAVLAQHY GLSGDLAELG SQQDRNFRVD ADEGRFVLKV TRVEYARVEI EAQNAALRHV GAKPGAPKVP EVVPSLGGED IVSAAVREET YHVRLLTYLE GSPLTRRRHL GAESVAALGD VAGQLAAALK DFDHPGLERE LQWDLRRAGP VALHLLSAMA DVDLRKRIAE AMIGAMRKVQ PLMPELRLQA VHQDVTDDNV VSRVDSGGRL IPDGVIDFGD VLKGWVVADL AVTCASLLHH AGGDPFTILP AVKAFHAAYP LTDAELTALW PLIVARACVL VASSAHQLEV DPENAYAASN AAHEREIFDV AVSVPNELME HAIRKAVARE RQPAPSEMHG RLLPDLNAAS VGIVDLSVLG PHLPADRWHY EDTEALLLQS AARAAGAAAT RYGEYRLTET RLLQASAPRT FALHVDLCLH GQTAVHAPFA GRLHQGGGKL ILSGEGLHLH LYGVEADDPA EGTLEPGARI GLVPGEPSAL RFLRVQLCTV LDMDPPAFAA PHQAEAWGRL CPSPETILGF GCDAPLPDAA ALLQRRHRHY ARPQKNYYRM PPQIERGWKE HLFDLEGRAY LDMVNNVTLV GHGHSRLSAA VGRQWSLLNT NSRFHYAAVA EFSERLAALA PEGLDTVFLV NSGSEANDLA LRLAWAASGA RNVVSLLEAY HGWTVASDAV STSIADNPQA LTTRPDWVHP VVSPNTYRGP FRGEGSTGDY VDAVSRKLRE LDEKGGKLAG FISEPVYGNA GGIPLPPGYL EAVYALVRAR GGVCIADEVQ VGYGRLGHYF WGFEQQGVVP DIITVAKGMG NGHPLGAVIT RRTIADALEE EGYFFSSAGG SPVSSVVGLT VLDILHDEAL TENARSVGDY LKGRLEALVE RFPLAGAVHG MGLYLGVEFV RDRETLEPAT EETAAICDRL LDLGVIMQPT GDHLNVLKIK PPLCLARESA DFFADTLGRV LEEGW
|
| |