Gene Smed_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1406 
Symbol 
ID5322257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1485961 
End bp1487607 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content63% 
IMG OID640790348 
Productsodium-dependent inorganic phosphate (Pi) transporter 
Protein accessionYP_001327087 
Protein GI150396620 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter
[TIGR01013] Phosphate:Na+ Symporter (PNaS) Family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.677902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.933566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCCA CCATCGTGAT GGTCAATCTC TTCGGCGCAG TCGCGCTGTT GCTGTTCGGC 
CTGTCGCTCG TCAAGGAAGG GGTCACGCGT GCGTTCGGAG CGCGGCTTCG ATCAGGCCTT
GCGCGCGGCA CGGACGGTCC CTTACGGGCT TTTGCGACGG GGCTCGTCGC AACCCTCGGC
CTGCAGAGTT CGACGGCGAC GGCCCTCATG ACGGCCTCTT TCGTGGAGAA GGGGCTGATC
CGCTCGCAGA TGGCGCAGAT CGTGTTGCTC GGCGCCAATG TCGGCACGGC GATGACAGCG
TGGATCGTCG CCCTCGGCAT CGGCTGGCTA TCGCCGCTGC TCATCCTCGC GGGCGTTGCG
ACACACCGGC TGAGCCGGTC GAGCGCCCGC CAGGGCGCGG GCTACGCGCT TGTGGGCATT
GGCGTGATGC TTCTTTCGCT CGAGTTGCTC GGCGCCGCGA CCGAGCCGAT GCGCCAGTCC
CAGGCTCTGG CAGCGTTTCT CGGCATGCTC GATAGCGCGT TGCCCGTGGC GATGATCATA
TCGGCCGGTC TCGCATTTGC GTCATCCTCC AGTCTTGCCG TCGTAATGCT GGTGCTGTCG
CTTTCCTCTG CCGGGGTGCT TTCACCCGGA CTGACCGTCG CTCTGGTGCT CGGGGCCAAT
CTCGGGGGGG CGATCCCGCC GGTTATCACC ACGCTCGGTT CGGCGCCTGC CGCTCGACGA
GTCACGCTCG GCAATCTCGT GGTGCGCGGA ATCGGCTGCC TGGTCGCCCT TCCGGCCGTT
AGCGTCAGTG CCGATTTTCT GCAGAGTCTG CCGCTGCCGC GACAGAACCT GCCCGTCGAC
ATTCACCTGC TTTTCAATCT CCTTTTGGCC ATCGTCGCCT GGCCATTCGC CGGCTTGCTC
GCCCGCACCA TGAATCGGTT CGTCCCGGAC GACGAAGCGG GGGAGTCCGG GCCGCACTTT
CTCGACGAAG CGACGCTCGA TACGCCGGTG ATGGCCCTTT CCGGCGCGAC GCGAGAGGTC
CTGAGAGTGG GCGATCTGAT CGAGGTCATG CTGATGCGTG CCTCCGAGGC ATTCAGCAGG
AATGACGTCA ACGATCTCGG CGACATTGAG AAATACGAAC GGCAGGTCGA CCTGTTGCAG
CAGGAAGTGA AAATCTTCCT TTCGCGGCTG GGGCGCGATG GTCTCAATGA AGAGGATGGC
CGCCGTTCGA TCGTGATCAT CGACTATGCC ATCAACCTGG AGCACATGGG CGACATCATC
GAAAAAGGCC TTTGCGAGCA GATCGCCAAA AAGGTGAGGG GGGGGCTGCG ATTCTCCGAG
GAAGGCTATC AGGAGCTGAA GGGCCTTTTC GATCTCACCA TCGATAATCT GCGCATCGCC
CAGACCATAT TCGTCACGCG CAATGCCGAC CTTGCGCGCC ATCTCATCGA GGTAAAGGTA
AACGTGCGTC ATATGGAGAA GCGCTCGGCC GAGCGGCATC TGGAGCGGTT GCGCGGCGGT
CTTCTCGAAA GTCTTCAGAC CAGTTCGCTG CACCTCGACA TGCTGCGTGA CCTGAAGCGC
ATCAACGCGC ATATCGCATC CGTCGCCTAT CCCATCCTCG AGGAAAGCGG CCTTCTGACG
GAAAGCCGGC TGCGGCCGCC GGGTTGA
 
Protein sequence
MQSTIVMVNL FGAVALLLFG LSLVKEGVTR AFGARLRSGL ARGTDGPLRA FATGLVATLG 
LQSSTATALM TASFVEKGLI RSQMAQIVLL GANVGTAMTA WIVALGIGWL SPLLILAGVA
THRLSRSSAR QGAGYALVGI GVMLLSLELL GAATEPMRQS QALAAFLGML DSALPVAMII
SAGLAFASSS SLAVVMLVLS LSSAGVLSPG LTVALVLGAN LGGAIPPVIT TLGSAPAARR
VTLGNLVVRG IGCLVALPAV SVSADFLQSL PLPRQNLPVD IHLLFNLLLA IVAWPFAGLL
ARTMNRFVPD DEAGESGPHF LDEATLDTPV MALSGATREV LRVGDLIEVM LMRASEAFSR
NDVNDLGDIE KYERQVDLLQ QEVKIFLSRL GRDGLNEEDG RRSIVIIDYA INLEHMGDII
EKGLCEQIAK KVRGGLRFSE EGYQELKGLF DLTIDNLRIA QTIFVTRNAD LARHLIEVKV
NVRHMEKRSA ERHLERLRGG LLESLQTSSL HLDMLRDLKR INAHIASVAY PILEESGLLT
ESRLRPPG