Gene Smed_1469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1469 
Symbol 
ID5322327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1552027 
End bp1553235 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID640790417 
Productaminotransferase class V 
Protein accessionYP_001327149 
Protein GI150396682 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.747649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0112306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGATCG ATGCGATCGG CCGAGGCTCT GGTAAAGGAT CGATGTCGGG ATCGCGCATC 
TATATGGACT GGAACGCCAC GGCGCCGCTT CTGCCCGAAG CGCGCGAGGC GCTCGTGTCC
GCGCTCGATC ATCTCGGCAA TCCTTCTTCT GTTCACGGCG AGGGGCGTGC TGTCCGGGCT
CTTGTCGAGA GCGCCCGGCG CGATATTGCC GCGCTCTGCG GCGCGCAAGC CGCGGCGGTC
GTCTTCACCA GTGGCGCCAC GGAAGCGGCG AATATGGTGC TGACGCCGGA GTTTCGCATG
GGGCGCACGC CGCTGAAGGC GGGGAAGCTC TATGTTTCGG CGATCGAGCA TCCGGCAGTT
CGAGAGGGCG GGCGCTTCTC GCGCGATAAT ACTTGTGAGA TAGCGGTGAC AAGCGCCGGG
ATCATCGACT GCGCAGCGCT CGAAACCCAG CTGCAGGCGC ATGACCGTGA GACCGGTCTA
CCCATGGTGG CGGTCATGCT TGCCAACAAT GAAACGGGCG TTGTCCAGCC GATCGCGGAC
GTTGCGGCCA TCGTGCGCGC GCATGGCGGC ATACTGGTCG TCGATGCGGT GCAGGCGGCC
GGGCGCCTCC CGCTTTCGAT CGAGGCGCTC GGTGCGGATT TCCTCATCCT GTCGTCGCAT
AAGATCGGCG GACCGAAGGG GGCCGGCGCC CTCGTCGCCC GAGGCGAAAT CATGATGCCG
TCTTCCTTGA TCCGCGGAGG CGGACAGGAA AAGGGACATC GTTCGGGGAC CGAGAATGCA
GCGGCACTGG CGGGCTTTGC GGCCGCGGCC CGCGCCGCCG CCAGGGATAT AGATGGGCGG
ATGGCCGCTG TCGCCGCAAT GCGCGACAGT CTCGAAATGA AAATGCGATC CAGCGCGCCT
GACGTGATCA TCCATGGGCA GAGCGTTTCG CGCCTCGCAA ATACGTGCTT CTTTACCCTT
CCAGGCCTCA AGGCGGAAAC GGGGCAGATC GCATTCGATC TCGAGGGCGT CGCCTTGTCG
GCAGGTTCGG CCTGTTCATC CGGGAAGGTC GGGCAAAGTC ACGTTCTGAC GGCTATGGGC
TACGATCCGC GACAGGGCGC GCTCAGGATT TCGATCGGCG AAGCGACGAC GCAGGTGGAA
ATCGAGCGCT GCGCCGCGAT ATTCGCGAAA GTGGCGGCGC GGCGACCCTC GACCGGACAG
GCGGCCTGA
 
Protein sequence
MPIDAIGRGS GKGSMSGSRI YMDWNATAPL LPEAREALVS ALDHLGNPSS VHGEGRAVRA 
LVESARRDIA ALCGAQAAAV VFTSGATEAA NMVLTPEFRM GRTPLKAGKL YVSAIEHPAV
REGGRFSRDN TCEIAVTSAG IIDCAALETQ LQAHDRETGL PMVAVMLANN ETGVVQPIAD
VAAIVRAHGG ILVVDAVQAA GRLPLSIEAL GADFLILSSH KIGGPKGAGA LVARGEIMMP
SSLIRGGGQE KGHRSGTENA AALAGFAAAA RAAARDIDGR MAAVAAMRDS LEMKMRSSAP
DVIIHGQSVS RLANTCFFTL PGLKAETGQI AFDLEGVALS AGSACSSGKV GQSHVLTAMG
YDPRQGALRI SIGEATTQVE IERCAAIFAK VAARRPSTGQ AA