Gene Smed_4215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4215 
Symbol 
ID5319225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp698195 
End bp699187 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content63% 
IMG OID640776020 
ProductNMT1/THI5-like domain-containing protein 
Protein accessionYP_001312953 
Protein GI150376357 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCT CCATTCGCGC CTTCGCATTG GGCAGCCTGG TCGCAATCGG CTTCTCCAGC 
CTTGCGGCAG CGGAAGCTCA AGAACCGGTG ACGCTTCGGT TTCTCGCGAG CCAGGGCAGC
TTATCGCCGC ACGAACTCGC CTATGAACTC GGCTATTTCG ATGGCCTCGG AATCAAGCTC
GAGAATGTCG GTTATGCGGG AGGCGGACCG GCGTCCCTGT TCGCGCTTGC TTCCGGCAGC
GTCGACATCG GCTCGGCGGC GACGGCAGCC GTGATCAACT CGATTGCCGG CGGCAACGAT
TTCGTCGCGG CCTTTCCGAC CAATGGCATC AACGAGCAGG TCAAGAGCAT CTTCTACGTG
CTCGAGGACA GCCCTATCAG GACGATCGAG GATATCGCGG GTAAGACGAT CGCCGTGAAT
ACGCTTGGGG CCCATCTCGA CTACGCGATA CGCGAGGCGC TGCATAGCAA CGGCCTGCCC
GAAAACGCAG CCAATCTCGT CGTCGTGCCC GGACCTCAGC TCGAACAGAC GCTCCGGTCG
AACCAGGTGG ACATCTCCGC CGTCGGATAC TGGCAGGCGA CGTTCAATGG GCAGCTCGTC
GCCAATGGCG GAGTACGCGC GGTCTTCGAC GACACGGATG TGCTCGGCGA GATTGCCGGA
GGTTTCGCGG TTCTGCGCCG CGATTTCGTG GAAAAGAATC CGGACGCAGC CAGGCGCTTC
GTAGAGCAGT CCGCGCGCGC CGCCGACTGG TCGCGTGAGC ATCCGGACGA AGCACGTGCC
TTGCTTGCCC GCATTCTGAC CGAGCGCGGC GAGAACGGCG ATCTCGCGAA GCATTGGACC
GGATTCGGCC TGCGGAAAGG CGCGAAAGCG ACCGAGCGGG ATCTGGATTT CTGGATCGGC
GTCCTCGAAC GCGAGGGCAG CCTGCCCCGA GGCAAATACA AGGCTTCCGA TCTTTTGTTC
CGGCCGGACG CCAAGTCCGC CGCGTCGAAT TGA
 
Protein sequence
MTISIRAFAL GSLVAIGFSS LAAAEAQEPV TLRFLASQGS LSPHELAYEL GYFDGLGIKL 
ENVGYAGGGP ASLFALASGS VDIGSAATAA VINSIAGGND FVAAFPTNGI NEQVKSIFYV
LEDSPIRTIE DIAGKTIAVN TLGAHLDYAI REALHSNGLP ENAANLVVVP GPQLEQTLRS
NQVDISAVGY WQATFNGQLV ANGGVRAVFD DTDVLGEIAG GFAVLRRDFV EKNPDAARRF
VEQSARAADW SREHPDEARA LLARILTERG ENGDLAKHWT GFGLRKGAKA TERDLDFWIG
VLEREGSLPR GKYKASDLLF RPDAKSAASN