Gene Smed_2580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2580 
Symbol 
ID5323448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2678871 
End bp2681798 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content65% 
IMG OID640791523 
Producthypothetical protein 
Protein accessionYP_001328245 
Protein GI150397778 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases
[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.458682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGACG AATTGCTGAA GCACCGGACG CAGCTTGCAA GACCGGACGT GTCGACCGAC 
GAAGCGCAGG CGGTCCTGGC CCAACACTAC GGTCTTTCCG GTGATCTTGC CGAACTCGGC
AGCCAGCAGG ATCGCAACTT CCGCGTCGAC GCAGACGAAG GGCGTTTCGT TCTGAAGGTA
ACGCGCGTCG AATATGCGCG GGTGGAGATC GAAGCCCAGA ATGCCGCATT GCGGCATGTC
GGCGCAAAGC CCGGCGCGCC GAAAGTGCCG GAAGTGGTGC CCTCTCTCGG CGGTGAGGAC
ATCGTCTCCG CAGCCGTTCG CGAGGAGACC TATCATGTCC GCTTGCTCAC CTATCTCGAG
GGTAGCCCTC TGACGCGCCG CAGGCACCTG GGCGCTGAAT CGGTGGCGGC ACTCGGCGAT
GTTGCGGGAC AACTCGCCGC CGCACTCAAG GATTTCGATC ATCCTGGGCT CGAACGAGAG
CTGCAATGGG ACCTCAGAAG GGCGGGGCCG GTCGCGCTGC ACCTGCTTTC GGCCATGGCC
GACGTGGACC TGCGCAAGCG CATTGCCGAG GCGATGATCG GTGCGATGCG CAAAGTTCAG
CCCTTGATGC CGGAGCTGCG CCTGCAGGCG GTCCATCAGG ACGTTACCGA CGACAATGTC
GTGAGCCGGG TGGACAGCGG CGGGCGGCTC ATCCCCGACG GTGTCATCGA TTTCGGCGAC
GTCCTGAAGG GCTGGGTGGT CGCCGATCTC GCCGTCACCT GCGCATCGTT GCTGCACCAT
GCGGGCGGCG ATCCCTTCAC TATCCTGCCG GCGGTGAAGG CGTTCCATGC GGCTTACCCG
CTGACCGATG CCGAATTGAC GGCCCTCTGG CCGCTGATCG TCGCACGCGC CTGCGTTCTC
GTCGCGAGCT CGGCGCACCA GCTCGAGGTC GATCCGGAAA ATGCCTATGC CGCAAGCAAT
GCCGCGCATG AGCGGGAAAT CTTCGACGTT GCGGTTTCGG TGCCGAACGA GCTGATGGAG
CATGCGATCA GAAAAGCTGT GGCAAGGGAA CGGCAGCCAG CGCCATCCGA GATGCACGGC
CGCCTGTTGC CGGACCTCAA TGCCGCAAGT GTCGGCATCG TCGATCTCTC CGTGCTCGGG
CCGCACCTTC CTGCCGATCG CTGGCACTAC GAGGACACCG AAGCGCTGCT CCTGCAGTCG
GCGGCGCGCG CGGCCGGCGC CGCGGCAACC CGCTATGGCG AGTACCGGCT TACCGAAACG
CGGCTCCTGC AGGCAAGCGC GCCACGAACA TTCGCGCTCC ACGTGGATTT GTGCCTACAC
GGGCAGACCG CCGTGCATGC GCCTTTCGCG GGCCGGCTCC ACCAGGGCGG CGGCAAGCTG
ATCCTCTCGG GAGAGGGTCT TCACCTCCAT CTTTACGGTG TCGAGGCGGA CGATCCTGCC
GAAGGTACGT TGGAGCCGGG CGCGAGGATC GGATTGGTCC CCGGCGAACC GTCCGCATTG
AGGTTCCTGC GCGTACAGCT TTGCACTGTG CTGGATATGG ATCCGCCCGC CTTCGCTGCC
CCGCATCAGG CGGAGGCATG GGGCCGGCTC TGCCCATCGC CTGAGACGAT CTTGGGTTTC
GGATGCGATG CGCCCTTGCC TGACGCAGCC GCGCTCCTGC AGCGCCGTCA CCGGCATTAT
GCACGGCCGC AGAAGAACTA TTATCGCATG CCGCCGCAGA TCGAGCGCGG GTGGAAGGAG
CACCTCTTCG ACCTCGAGGG GCGCGCCTAT CTCGACATGG TCAACAATGT CACGCTCGTC
GGCCACGGGC ACTCCCGATT GTCCGCTGCG GTCGGGCGGC AATGGTCCTT GCTCAACACC
AATTCGCGGT TTCACTATGC CGCCGTGGCG GAATTCTCGG AACGCCTGGC GGCACTGGCG
CCAGAGGGGC TCGACACGGT CTTTCTCGTC AACAGCGGTT CGGAGGCGAA CGATCTCGCC
CTCCGGCTCG CCTGGGCCGC TTCCGGCGCG CGCAATGTAG TCTCGCTACT CGAGGCCTAT
CACGGCTGGA CGGTTGCAAG CGACGCCGTT TCCACCTCGA TCGCCGACAA CCCGCAAGCG
CTGACGACGC GTCCGGACTG GGTGCATCCG GTCGTCTCAC CGAATACCTA TCGCGGTCCG
TTCCGCGGGG AGGGATCGAC GGGCGACTAT GTGGATGCCG TCTCTCGAAA GCTCCGGGAA
CTCGACGAGA AGGGCGGGAA GCTCGCCGGC TTCATCTCAG AGCCCGTCTA CGGCAATGCC
GGAGGCATTC CGCTTCCGCC GGGCTATCTG GAAGCGGTCT ATGCCCTGGT GCGAGCGAGG
GGCGGCGTCT GCATCGCCGA CGAGGTGCAG GTCGGCTACG GCCGGCTCGG CCATTATTTC
TGGGGTTTCG AGCAACAGGG CGTGGTGCCC GATATCATCA CCGTCGCAAA GGGTATGGGC
AACGGCCACC CGCTGGGCGC CGTGATCACC AGGCGCACAA TTGCCGATGC GCTGGAGGAG
GAAGGCTATT TCTTCTCCTC GGCCGGCGGC AGCCCCGTGA GTTCGGTGGT CGGCCTGACC
GTCCTCGACA TCCTTCACGA CGAGGCCCTG ACGGAGAATG CCCGGTCCGT GGGCGACTAC
CTCAAGGGGC GCCTCGAGGC GCTCGTGGAG CGGTTTCCGC TCGCCGGCGC CGTTCACGGC
ATGGGGCTCT ATCTGGGCGT CGAATTCGTC CGGGACCGCG AAACGCTCGA ACCCGCCACG
GAAGAGACGG CCGCGATCTG CGACCGCCTT CTCGACCTCG GCGTTATCAT GCAGCCGACC
GGAGACCATT TGAACGTCCT GAAGATCAAG CCGCCGCTCT GCCTCGCCCG GGAGAGCGCG
GATTTCTTCG CCGACACGCT GGGCAGGGTG CTCGAAGAGG GGTGGTAA
 
Protein sequence
MVDELLKHRT QLARPDVSTD EAQAVLAQHY GLSGDLAELG SQQDRNFRVD ADEGRFVLKV 
TRVEYARVEI EAQNAALRHV GAKPGAPKVP EVVPSLGGED IVSAAVREET YHVRLLTYLE
GSPLTRRRHL GAESVAALGD VAGQLAAALK DFDHPGLERE LQWDLRRAGP VALHLLSAMA
DVDLRKRIAE AMIGAMRKVQ PLMPELRLQA VHQDVTDDNV VSRVDSGGRL IPDGVIDFGD
VLKGWVVADL AVTCASLLHH AGGDPFTILP AVKAFHAAYP LTDAELTALW PLIVARACVL
VASSAHQLEV DPENAYAASN AAHEREIFDV AVSVPNELME HAIRKAVARE RQPAPSEMHG
RLLPDLNAAS VGIVDLSVLG PHLPADRWHY EDTEALLLQS AARAAGAAAT RYGEYRLTET
RLLQASAPRT FALHVDLCLH GQTAVHAPFA GRLHQGGGKL ILSGEGLHLH LYGVEADDPA
EGTLEPGARI GLVPGEPSAL RFLRVQLCTV LDMDPPAFAA PHQAEAWGRL CPSPETILGF
GCDAPLPDAA ALLQRRHRHY ARPQKNYYRM PPQIERGWKE HLFDLEGRAY LDMVNNVTLV
GHGHSRLSAA VGRQWSLLNT NSRFHYAAVA EFSERLAALA PEGLDTVFLV NSGSEANDLA
LRLAWAASGA RNVVSLLEAY HGWTVASDAV STSIADNPQA LTTRPDWVHP VVSPNTYRGP
FRGEGSTGDY VDAVSRKLRE LDEKGGKLAG FISEPVYGNA GGIPLPPGYL EAVYALVRAR
GGVCIADEVQ VGYGRLGHYF WGFEQQGVVP DIITVAKGMG NGHPLGAVIT RRTIADALEE
EGYFFSSAGG SPVSSVVGLT VLDILHDEAL TENARSVGDY LKGRLEALVE RFPLAGAVHG
MGLYLGVEFV RDRETLEPAT EETAAICDRL LDLGVIMQPT GDHLNVLKIK PPLCLARESA
DFFADTLGRV LEEGW