Gene Smed_4592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4592 
Symbol 
ID5319008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1089560 
End bp1091449 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content60% 
IMG OID640776393 
ProductABC transporter related 
Protein accessionYP_001313325 
Protein GI150376729 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.390249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.807513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCAG GAGCGGACCT GCTGCGAGTT GAGGGTCTGC GGATCACATT CTCAGTGCTG 
GGCGGCGGGG TCGAAGCCGT CCGCGGAGCG AGTTTCAGAA TTCTGCCCGG ACGGGTGACC
GCGCTCGTCG GTGAATCCGG GTCCGGCAAG TCGGCGATCA GCCAGGCGAT CATGGGCATC
CTGCCGAGCG TAGCAAGCGT CACGGGCCGG GTGGTGTTCA ACGATCCGAC CGGGAACAAG
CCCGTCGACC TTCTTTCGCT GGACAGCGGC GGGCGCGAAA TCCAAGATAT CCGCGGCGCG
CGCATCAGCA AAATCTTTCA GGAGCCGATG ACATCTCTGT CGCCGCTCCA CACGATCGGT
AACCAGATTT CCGAAGTGCT GAAGATTCAT ACGGATATCG ACAAGGCTGG CCGCCGGGAG
CGGACGGAGC AACTGCTCGG CTATGTCGGC TTCTCCGACC CGAAGCGGGC ATATGACATG
TACCCGTTCG AACTCTCAGG CGGCATGCGT CAGCGCGCGA TGATTGCCAT GGCGCTGATC
TGCAGTCCGG CGCTGCTCAT CGCGGATGAA CCGACAACTG CGCTCGATGT TACGGTTCAG
GCGCAGATAT TACAGCTCCT GCGCGGGCTT CAAAGCAAGC TCAACATGGC CATGCTTTTG
ATCACCCACG ATCTTGGTGT CGTAGCCAAC ATGGCCGACG AGGTGGTCGT TATCTATCAT
GGCGAGATCG TCGAAGCAGG GCCCGTGGAT GCAATCTTCC GCAATCCCCA GCATCCTTAT
CTCAAGGGTT TAATGGCGGC CGTGCCGCAC TTCGACATGA AACCCGGCGA ACGGCTCAAA
GCTCTGCGCG AGGTGCCGGT CCAGGCCGGC GGGATTATCG GCGATCGCGA TGTCAGGAAA
GCCGGAGGGC CAAATGTGCT CGTGTCCGTC CGCAACCTGT CGAAGACGTT CTCCACGCGC
AGTTCAGGCT GGTTCGGCAG CGGCTCCGCT TTTCGGCACC GTGCAGTGGA CAATGTCAGC
TTCGACATTC GCCGCGGTGA ATGCCTTGGG CTTGTCGGTG AAAGCGGATG TGGCAAGACC
ACTATAAGCA AAATCCTGAT GCGAGCCGTA ACGCCGGATA AGGGAACGAT CACGTTCGAC
GATGGTGAGG GCGCGCTCGA CGTCCTGAAG CTCGATGGCG CGGAACTCAA GGCGCTGCGC
GCGAAGATCC AGATGGTCTT CCAGGACCCT GTTTCTTCGC TGTCGCCGCG CATGACGGTC
AAGAATATTC TCAGCGAGCC GCTGGAAATC CATGGCCGCG GAACGCCGAA ATCGCGCGTC
GAAACCGTTC GCTCGCTCCT GCAGGCGGTA GGCCTCGACC AACGCTTCAT CAATCGTTAT
CCCCACAGCT TCTCCGGCGG CCAAAGACAG CGAATCGGCA TCGCCCGGGC ACTGGCGCTC
GTGCCGCAGC TGCTGATCTG CGACGAACCC GTTTCCGCGC TCGATGTTTC GGTACAGGCC
CAGATCCTCA ATCTTCTCAA GGATCTGCAA AAAGAACTCG GCCTTACGAT GCTGTTCATC
TCGCACAATC TCGCGGTCGT CGACTACATG GCCGACCGGA TCGCCGTCAT GTATGCGGGC
CGGATCGTGG AGCTGGCACC GCGTGAAGTC CTGATGAGCG ACCCGATCCA CCCCTATACC
AAATCGCTCC TCGCTGCCGT TCCCTATCCT GATCTCGATC GGAAGCTCGA TTTTGATTTG
CTTCAGGTGA GCGGCGGGTC GGACCAGCAG CGCTGGGGCG TGCAGTTCGC CGATGGTGGC
GATGACGAGG CTCTTGTTCC CGCCGATCTC GGCGGCGGCC ATTTTGTCCT TGCCCGCAAA
TCGGTGGATG CCAGGGAGTT ACGCCCATGA
 
Protein sequence
MTSGADLLRV EGLRITFSVL GGGVEAVRGA SFRILPGRVT ALVGESGSGK SAISQAIMGI 
LPSVASVTGR VVFNDPTGNK PVDLLSLDSG GREIQDIRGA RISKIFQEPM TSLSPLHTIG
NQISEVLKIH TDIDKAGRRE RTEQLLGYVG FSDPKRAYDM YPFELSGGMR QRAMIAMALI
CSPALLIADE PTTALDVTVQ AQILQLLRGL QSKLNMAMLL ITHDLGVVAN MADEVVVIYH
GEIVEAGPVD AIFRNPQHPY LKGLMAAVPH FDMKPGERLK ALREVPVQAG GIIGDRDVRK
AGGPNVLVSV RNLSKTFSTR SSGWFGSGSA FRHRAVDNVS FDIRRGECLG LVGESGCGKT
TISKILMRAV TPDKGTITFD DGEGALDVLK LDGAELKALR AKIQMVFQDP VSSLSPRMTV
KNILSEPLEI HGRGTPKSRV ETVRSLLQAV GLDQRFINRY PHSFSGGQRQ RIGIARALAL
VPQLLICDEP VSALDVSVQA QILNLLKDLQ KELGLTMLFI SHNLAVVDYM ADRIAVMYAG
RIVELAPREV LMSDPIHPYT KSLLAAVPYP DLDRKLDFDL LQVSGGSDQQ RWGVQFADGG
DDEALVPADL GGGHFVLARK SVDARELRP