Gene Smed_5065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5065 
Symbol 
ID5319367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp11416 
End bp12384 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content65% 
IMG OID640776845 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001313777 
Protein GI150377182 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCT CCGATTTCAC GGCGTCACTC ACGCCACCCA TCCCTGCCAC CACGAGCGCG 
CCACGCAGCG CGATCTCGGA ACTCCTGCAC GACAAGGCCG CCGCCATCGG CCTTGCCTTC
ATCCTACTCA TTGTGTTCCT GGCCCTGTTC GCTCCCCTTG TCGCACCGTA CGATCCGGCC
GCGCAGTCGA TCATGGCCCG GCTGAAGCCG CCCGTCTGGA TGGCGCGCGG CACGTGGGAA
CACCTGCTCG GAACTGATAA TCTCGGCCGC GACGTCCTGT CCCGCATCAT CTGGGGCGCA
AGGGCGACGC TGACGATCGG CGCCGTCACC TGTCTTCTGG CGGCGACGCT CGGGACAGTC
GTCGGCCTAT GGGCCGGATT CATCGGTGGG CGCACGGATT CGGTCCTGAT GCGTCTGGTC
GACATCCAGG TCAGCTTCCC CGGAATCCTT CTCATCCTGC TCGTCGTCGC GGTTCTCGGG
CCCGGCGTCT GGACGCTTGT TGCGGTCCTG TCGGTGACGA ACTGGATGGT CTATGCCCGG
CTGGTGCGCG GCATTGTCTC GTCGACCCGT CAGACCCCTT ATGTCGAGGC CGCTGAAGTG
ATCGGCTGTC GCCCCGCACG GGTGATCTTC AGGCATATCC TGCCGAACAT CGTCTCTCCG
CTTTTGACGC TTGCGATCCT GGAGTTCACC AATATCGTGC TGGCGGAAGC GGCTGTGTCG
TTCCTCGGCT TCGGCGTTCA GCCACCGGCG ACCTCGTGGG GCCTCGACGT CGCCTCGGGA
CGCGATTACC TGTTCATCGC GTGGTGGCTC GTGACTTTTC CCGGCCTTGC GATCGTCGTG
ACAGTGCTGT CCATCAATCT TTTTGCCAAC TGGCTGAGGG TGACGACCGA TCCCGAGGAA
CGCGAGAAGC GTTTTGCGCG CGCCGAGACG GCGAAGCGGC GCCGCGCCCG GCGGAGGGTG
GGTGCATGA
 
Protein sequence
MAASDFTASL TPPIPATTSA PRSAISELLH DKAAAIGLAF ILLIVFLALF APLVAPYDPA 
AQSIMARLKP PVWMARGTWE HLLGTDNLGR DVLSRIIWGA RATLTIGAVT CLLAATLGTV
VGLWAGFIGG RTDSVLMRLV DIQVSFPGIL LILLVVAVLG PGVWTLVAVL SVTNWMVYAR
LVRGIVSSTR QTPYVEAAEV IGCRPARVIF RHILPNIVSP LLTLAILEFT NIVLAEAAVS
FLGFGVQPPA TSWGLDVASG RDYLFIAWWL VTFPGLAIVV TVLSINLFAN WLRVTTDPEE
REKRFARAET AKRRRARRRV GA