Gene Smed_5414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5414 
Symbol 
ID5319716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp378285 
End bp379172 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content61% 
IMG OID640777180 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001314112 
Protein GI150377517 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.42118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.729214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTG CCATGAAAGC TGACATTCTA AAGGGGCTGA AAAAGCCTCC TCTTGGACCG 
GTTTTGACCG CGTCCAAGAT CCTGCTGGCC AGCTATATAC TGCTCGCCAT TGCGGCACCG
GCGATCGCTC CGCAAAATCC CTACGATCCC TTGCAGATAT ATGGATGGGA AGCATCCTCC
CCGCCGGGAA CCATGGGCGG CGGCGGTTTT CGCTACCTGC TCGGTACCGA CGGGCTCGGC
AGAGATATCG TCAGCACCAT TCTTTACGGC CTGCGGATCA GCCTCATGGT GTCGGTCACG
AGCGCCGCGA TCGCCGCCCT GATCGGGCTT ACCGCCGGTG TCAGTGCCGC CTATTTCGGC
AAGTGGGTCG ACGCCGCGAT CATGCGGCTC GTCGACCTTC AGCTCAGCCT GCCGACCATA
CTCATCGCCC TGATCGCCAT TGTGACGCTT GGGCCGGGCA TCGACCGGAT CATTCTGGCC
CTGATCATCG CGCAATGGGC AACCTATGCG CGGATCGCGC GCAGCGTGGC GCTAAGCGAG
ACGAACAAGT CCTATATCGA CGCCGCCCGA CTGATGCGGC TGCCAACCGC ACGCATCATC
TTTCGCCACC TGCTGCCAAA CAGCATCGCA CCGGCCGTGA CCTTGATTCC GATCGAAGTC
GGTCATGCCG TCGCACTCGA AGCGACCTTG TCCTTTCTCG GGCTTGGCGT GCCGATCGAC
AAGCCGTCGC TCGGCTCCGC GGTTGCAAAC GGCTTTCAGT ATCTGCTGAC CGGCCAGTAC
TGGATCAGCC TTTTTCCTGG CCTCGCTCTG TTCGGCCTGA TCGCAAGCAT CAATCTCGTC
GGCGAAGACG TCCGGCGCAG TCTCGATCCA AGGAACCATC CGGCATGA
 
Protein sequence
MKPAMKADIL KGLKKPPLGP VLTASKILLA SYILLAIAAP AIAPQNPYDP LQIYGWEASS 
PPGTMGGGGF RYLLGTDGLG RDIVSTILYG LRISLMVSVT SAAIAALIGL TAGVSAAYFG
KWVDAAIMRL VDLQLSLPTI LIALIAIVTL GPGIDRIILA LIIAQWATYA RIARSVALSE
TNKSYIDAAR LMRLPTARII FRHLLPNSIA PAVTLIPIEV GHAVALEATL SFLGLGVPID
KPSLGSAVAN GFQYLLTGQY WISLFPGLAL FGLIASINLV GEDVRRSLDP RNHPA