Gene Smed_5558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5558 
Symbol 
ID5319860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp525396 
End bp526571 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content61% 
IMG OID640777307 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001314239 
Protein GI150377644 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.203602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0670857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCA ATACACAGTC TGTTTTCGTC ATGGAAACCG GTCCTGCCGT CAGCCCCCTG 
AATTCGGGTG GCAAGGTGGT GACCTTGGAA GCCGCCGGCC AGTGGACGCT CATTTGCAAG
CGCTTTCTCC GTCACAAGAT AGCGGCCTTC GCCGCGGGCA TCATTCTGGT GCTCTATCTG
ATCGGTGCTT TCGCCGAGTT CTTAGCACCC GGTCTGCCGA ACGCGTCGAG GCCGCAATAT
ACATTTGCGC CGCCTCAGAG GATCTCGCTG TTTACACATA CGGACAATGC CGGTACGCGA
TTCCTGCCCC ACGTCAAGGG TTACCGGATC GAGATCGAGC CCAAGGCCCT TCGCCGCAGC
TTTGTCGTAG ATGAGGCCAA AATCATTCCG ATCGGCTTCT TTGTCAAAGG CGCCCCATAC
AAGCTGTGGG GCGTCATCCC GATGGAAACC CATCTTGTCG GACCCCTAGA CGCACGCGAG
CCGATGTATC TGCTGGGCTC CGATCGCCTC GGGCGGGATC TCATGAGCCG CCTGATCTAC
GGAACCCGCG TCTCCATGTC GATCGGCCTG GTGGGGGTGG CGATCTCGCT CACGCTCGGC
GTCGTCATGG GAGCAGTTTC GGGTTATTAC GGCGGCTGGG TCGACACCCT GATCCAGAGA
GTCATCGAGA TCGTCAGCGC CATGCCGACC ATCCCTCTCT GGCTGGGACT GGCAGCCGCC
ATCCCGCTCA GCTGGTCTCC GCTGTCGGTG TACTTCATGG TCACGCTCAT CGTTTCGCTG
CTTAGCTGGA CGGGACTGGC GCGAGAGGTG CGCGGACGAT TTTTCGCGTT GCGCGGCGAT
GATTTCGTCA CGGCCGCAAG GCTTGACGGA GCCGGTGAGC GCCGGATCAT TTTCCGTCAC
ATCCTGCCGT CGCTCACCAG CCACATACTC GCCGTCGTGA CGCTCGCAAT CCCGACGATG
ATCGTGGCGG AAACCTCGCT GTCATTCCTC GGCGTCGGCC TGAAGCCGCC GGTCGTCAGC
TGGGGCGTGC TGCTGCAGGA TGCCCAGAAC GTGCGCACCG TCGCAACCGC GCCCTGGCTG
CTGATCTGGC CGAGCCTTGC CGTGGTTCTT GCTGTGCTCT CCTTCAATTT CTTCGGCGAC
GGACTGCGCG ATGCAGCCGA TCCCTACGAT ACTTGA
 
Protein sequence
MSINTQSVFV METGPAVSPL NSGGKVVTLE AAGQWTLICK RFLRHKIAAF AAGIILVLYL 
IGAFAEFLAP GLPNASRPQY TFAPPQRISL FTHTDNAGTR FLPHVKGYRI EIEPKALRRS
FVVDEAKIIP IGFFVKGAPY KLWGVIPMET HLVGPLDARE PMYLLGSDRL GRDLMSRLIY
GTRVSMSIGL VGVAISLTLG VVMGAVSGYY GGWVDTLIQR VIEIVSAMPT IPLWLGLAAA
IPLSWSPLSV YFMVTLIVSL LSWTGLAREV RGRFFALRGD DFVTAARLDG AGERRIIFRH
ILPSLTSHIL AVVTLAIPTM IVAETSLSFL GVGLKPPVVS WGVLLQDAQN VRTVATAPWL
LIWPSLAVVL AVLSFNFFGD GLRDAADPYD T