Gene Smed_2578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2578 
Symbol 
ID5323446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2674641 
End bp2677697 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content64% 
IMG OID640791521 
Producthypothetical protein 
Protein accessionYP_001328243 
Protein GI150397776 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases
[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.54149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.14969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA TGCTGAAGAC GACACCCGAT TTCACGATCG AAGAGGCGCA AGCACTCCTT 
GCGCAGCATT TCGGACTGAA CGCGGCGCTC ACGCCGCTCG ACAGCGAGCG GGACCAGAAC
TTCAAAGTCA GCGCCGGCGA CGGCCGCAGC TATATCCTGA AGATCATCAA TGCGGCCGAG
CCCGAAATCG AAAGCGATTT TCAGACGGCC CTTCTGGCGC ATCTCGGCGC CAGGGCCGAC
ACTCTTCCGG TGCCGCATCT GCAGCCTGCC TTATCCGGCG AAAGCCTGGC TACGACGAGC
GCCAGGAGCG GCCTCGTTCA CCGGTTGCGC CTGGTCAGCT GGATCGAGGG CATGCCGCTT
GCCCAGTCCG AAAGGACCGA CGCGGCGCTT CGGTCCCTCG GCCACATGCT CGGCCGCTTC
GACGCCTCGC TCAAGGGCTT CATGCATCCT GGCGCACTTC GCGATCTCGA CTGGGACATC
CGCAATGCCG GACGCTCTGC CGGGCGGCTC CTGCATGTCG CCGATCCCCA GGATCGTGCT
CTTCTTCAAC GCTTTATCGA TCGGTTCGAA GAGCGCATCG CCCCGCGGCT GCCGATGCTG
CGTTCGGCCG TCATCCATAA CGACGCGAAC GACTGGAACG TGCTCGTCGG TGAGGACGAT
CGCAACCGCA TCTCCGGCAT CATTGACTTC GGCGATGCCC TCTATACGCC CGTCATGGCT
GAAGTCGCCA TCGCGGCCGC CTATGCCGGG CTCGATCATC CCGATCCGAT CGGCGCGGCC
GCTGCGATCG CCAATGGTTA TCACGCCGAA TACCCACTCC TCGAAGAGGA AGTCGACCTC
CTCTTCGACC TGATCGCCAT GCGACTGGTG ACCTCGGTGA CGATCTCCGC CTCGCGTCGG
GCGCATACCG GTGGCAACCC TTACCTCGCG ATCAGCGAAA GGCCGGCCTG GGCGCTCCTG
CGCAAGCTCG ATGCGATGAA CCCGCGCTTC GCGACGGCAA TCCTGAGAAA GGCCTGCGGT
TTCGAGGCTG TCGCCGGCGC CCGCGCCGTC GGCGCCTGGA TCGACCGCAA CCGCAAGAAC
CTGCTGGCGC TTCTCGACCG TCCAGCCGCA GCCTATGCCG CCGACATTGT CCCCTATGGC
GACCCTGCGC ATTCGATGAC CGTAAATTCG GCAGCGGCGC GGCCGCATGA GGCGCAATCG
GTTTGGACGG AGCATTGCCG TGGTACCGGT GTCGAGCTCG GCATCGGTCC CTGGGGTGAG
GCCCGCACCG TCTATTCGGG CGAAATGTTC GTCTCGCGCC TGCTCGAGAA GACCCGCCGC
TCGCGCCATC TCGGCCTCGA CCTCTTCAAG GCTGCAGGCA CGAAGGTCTA TACGCCGCTC
GCGGCGACGG TCGCGAGTGT CGAGATCGAG ACGGATCCGC TCGGCTATGG CTGCCTTGTC
GCGCTGCGCC ATGAACCGGA GGGTTGCCCG CCCTTCCTGA CGCTCTGGGG ACATCTTGCC
CATGAAGCTG TCGGTCGGCT GAAGGCCGGC GACACGCTGG AGGCCGGCGC GCTCGTCGGC
GAAATGGGCG CTCCGGAGGA AAACGGCGGC TGGGCGCCGC ATCTGCATCT GCAGATCTGC
ACGGACACGG GCCTTTCGGC GTCGGAAATC CTGGGCGTGG GCGAGGAACG CTATCTCGAC
GTCTGGTCAG AACTTTTCCC CGACGCAAGC GCATTTGCCG GTGTTGCTCC GGAATTCTAC
GAGCAGACCG GCCGCACCCA TGAGGAAATC GTCAGGCTTC GAAAGGACCT GCTGCTCTCG
AACCTGTCGA TCTCCTACGA AAAGCCGATA AAGTTCGTGC GCGGCGAAGG CGTCTGGCTC
ATCGACGATC GCGGTCGCGC CTATCTCGAC TGCTTCAACA ATGTCTGCCA CATCGGTCAC
GCCCATCCGG CCGTGGTGGA AGCGATCGCG CGCCAGGCCG CAACGCTCAA TACCAACACG
CGCTACCTCC ACGATAATAT CGTCGCCTAT GCCGAGCGGC TGACGTCGAC GTTGCCGAAG
GAACTCGCGA TTGCCGCCTT CGCCAATAGC GGCTCCGAAG CCAACAGCCT CGCCCTGCGC
CTGATGCGTG CGCACACGGG CCGCGAAAAC GCTCTCGTGC TCGACTGGGC CTATCACGGC
ACGACGCAGG AACTGATCGA TCTCAGCGCT TACAAATTTC GCCGCAAGGG CGGAAGGGGT
CCAAAATCGC ATGTGCACGT GGCAGCCGTC CCGGACAGTT ATCACGCCCC CGCCGATTGG
CCGGCCGAGG AGCATGGCAA ACGCTTTGCA GAAGACATCG CCGAACTGAT CGCGGCCATG
CGTGCCAGAG GCGAAGCGCC CGGCTTTTTC CTCGCCGAGT CCATTCCCAG CGTCGCAGGC
CAGGTGTTTC TGCCGGACGG GTACCTCAAG GAGGTCTACC GCATGGTTCG GGACGCCGGG
GGCGTCTGCA TCGCGGATGA GGTACAGGTC GGTTTCGGCC GGGTCGGCAG CCATTGGTGG
GCCTTCGAAA CGCAAGGCGT CGTCCCCGAC GTCGTCACAA TGGGCAAGCC GATCGGCGCG
GGTCATCCGC TCGCCGCCGT GGTCACCACG CGTGAGATCG CGGCCTCGTT CGACAACGGC
ATGGAATATT TCAACACCTT CGGCGGCAAT CCCGTGTCCT GCGCCGCCGG CCTCGCCGTG
CTCGACGTCA TCGAAGGCGA AGACTTGCGC CGCAACGCCC TTGAGATCGG CAATTATCTC
CTTGCCGCCT TCCGCTCGAT GCAGGAGCGC TATGAGGTCA TCGGCGACAT CAGGGGTCTC
GGCCTCTTTC TCGGCATAGA GCTCGTCAGC GACCGAAGCA CCAAGGCGCC GGCGACGGAG
ATCGCCCGGG CCGTCTCGAA CGGAGCACGG CAGCGCGGCG TCCTGATGGG CACGGAGGGA
CCGCATGACA ATGTCCTGAA GATGCGTCCG CCCATGATCT TTTCGAAGCG CGATGCCGAT
CACCTGATCG CCGTGCTCGC GGAGACATTC GAGGCCGTGC TCGCGCGAGC CGGATAG
 
Protein sequence
MNDMLKTTPD FTIEEAQALL AQHFGLNAAL TPLDSERDQN FKVSAGDGRS YILKIINAAE 
PEIESDFQTA LLAHLGARAD TLPVPHLQPA LSGESLATTS ARSGLVHRLR LVSWIEGMPL
AQSERTDAAL RSLGHMLGRF DASLKGFMHP GALRDLDWDI RNAGRSAGRL LHVADPQDRA
LLQRFIDRFE ERIAPRLPML RSAVIHNDAN DWNVLVGEDD RNRISGIIDF GDALYTPVMA
EVAIAAAYAG LDHPDPIGAA AAIANGYHAE YPLLEEEVDL LFDLIAMRLV TSVTISASRR
AHTGGNPYLA ISERPAWALL RKLDAMNPRF ATAILRKACG FEAVAGARAV GAWIDRNRKN
LLALLDRPAA AYAADIVPYG DPAHSMTVNS AAARPHEAQS VWTEHCRGTG VELGIGPWGE
ARTVYSGEMF VSRLLEKTRR SRHLGLDLFK AAGTKVYTPL AATVASVEIE TDPLGYGCLV
ALRHEPEGCP PFLTLWGHLA HEAVGRLKAG DTLEAGALVG EMGAPEENGG WAPHLHLQIC
TDTGLSASEI LGVGEERYLD VWSELFPDAS AFAGVAPEFY EQTGRTHEEI VRLRKDLLLS
NLSISYEKPI KFVRGEGVWL IDDRGRAYLD CFNNVCHIGH AHPAVVEAIA RQAATLNTNT
RYLHDNIVAY AERLTSTLPK ELAIAAFANS GSEANSLALR LMRAHTGREN ALVLDWAYHG
TTQELIDLSA YKFRRKGGRG PKSHVHVAAV PDSYHAPADW PAEEHGKRFA EDIAELIAAM
RARGEAPGFF LAESIPSVAG QVFLPDGYLK EVYRMVRDAG GVCIADEVQV GFGRVGSHWW
AFETQGVVPD VVTMGKPIGA GHPLAAVVTT REIAASFDNG MEYFNTFGGN PVSCAAGLAV
LDVIEGEDLR RNALEIGNYL LAAFRSMQER YEVIGDIRGL GLFLGIELVS DRSTKAPATE
IARAVSNGAR QRGVLMGTEG PHDNVLKMRP PMIFSKRDAD HLIAVLAETF EAVLARAG