Gene Smed_5040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5040 
Symbol 
ID5319089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1560111 
End bp1561784 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content63% 
IMG OID640776821 
ProductABC transporter related 
Protein accessionYP_001313753 
Protein GI150377157 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTGG AACACGTGAC GACACCCCTG CTGCGGGTGC AGAACCTTGG CCTTCGCCAT 
GTCTCGGGCT CGGCCACAAC ACCAATCCTG TCCGACGTGA GTTTCGAACT CGGGCGAGGC
GAGATCCTCG GCATCATCGG CGAGTCCGGC GCCGGCAAAT CGACCGTCGG CAATGCCATC
CTCGGGCTGC TTTCGCCCGA GTTCCAGCAA ACCTCCGGAA CAATCGAGTT CGATGGCAAG
GCGATCGACG GCATGACTGC CGACGAGCGC CAGACGCTTC GAGGACGGCG GATATCGGCG
ATCTTCCAGG ACCACACCGC CTCGCTCGAT CCGCTGATGA GCGTCGGCGC CCAAGTCGAA
GAAACCTGCC TTACGCTCGA CAGCTCGCTT TCCAAGCGCG AGGCGCGTGC TCGCGCGATT
GATTTGCTTG CCCGTGTCGG CATTCCCGAA CCGGAGCGGC GATACCGCAA TTACCCGCAC
CAGTTCTCCG GCGGGCAGAG GCAACGCGTC GTCATCGCCA TTGCGCTTGC CGGCTCTCCC
GACATCATCA TCGCCGACGA GCCGACGTCG GCGCTCGATG CGACGGTTCA GAAGCAGGTT
CTGGAACTTC TAAAGATACT TGTCGACGAA ACCGGTGTCT CGATCATCCT CGTCACCCAC
GACATGGGCG TGATCTCCGA AATCACCAAC CGGGTTCTCG TCATGCGAAA GGGCCAGGTG
GTCGAAGCGG ACCGCACAGC CACCATTCTC GATCAGCCCC GCCACGACTA TACGAAGAAG
CTGCTTGCCG CCGTCCCAAG GCTTCGCATC CCGACACGCA CTGTCAAGGC GGAAGATGAT
GGCGGCCAAG CAGGCTCTTC CGCCATAGAC GGCGACCAGA ACCCGCTTCT GGTCGCGGAA
GGCCTTTCGA AACAGTTCGC GCCGCAGGGT TTTGCTTGGG GCATCGGCCG GGGCAAGCCG
AGATTCGGCC TGCGCGACGT CGGCATCCGG CTGCCGCGCG GGTCGATCAC CGGGATCGTC
GGCGAGAGCG GCAGTGGAAA GACGACCTTC GGCCGCATCC TCGCGGGCCT CGATACGGCA
CCGACCGGCA GGATCACGAT CGGGGAACAC GCCTTCAACG TTTCGCGAAG TGGCCGCCGC
AGCGGTCTTC TCGGCCGCGT CCAGATGATC TTCCAGGATC CATCCGTCTC CCTCAACCCC
CGCATGACCA TCGGCGAGAC GCTCGATGAA AGCATCCGCT TCGGCGGGAG GACCGGCTCC
AGCAAAGAGC CGGCGGACCT GGCCGCAATG ATGGATCGGC TTGGTCTGCC GCGAAGCCTG
CTCGCCCGCC ATCCGCATCA GCTCTCCGGT GGCCAAAAGC AGCGTGTCTG TATCGCTCGC
GCCCTGTTGG CACGGCCTGA GATCATCGTC GCGGACGAGC CGACGTCCGC GCTCGACGTC
TCGGTTCAGG CCGAGATCGT CGGTCTCCTG AAGGACACGA TCGCGGAGCG GGCGATGACA
ATGGTCTTCA TCTCCCATGA CCTCGCGATC GTTCAAGCGA TGTGCAGCTC CGTCTTCATC
TTCAAGGACG GCCGGATCGA GGATTTCGGG CCCACCGAGT TCATCTTCTC GCGATCCGAC
AATCCCTACA CCCGCGCCCT CATCAACGCC CGACCCCAGC GGTTCACATG CTGA
 
Protein sequence
MTVEHVTTPL LRVQNLGLRH VSGSATTPIL SDVSFELGRG EILGIIGESG AGKSTVGNAI 
LGLLSPEFQQ TSGTIEFDGK AIDGMTADER QTLRGRRISA IFQDHTASLD PLMSVGAQVE
ETCLTLDSSL SKREARARAI DLLARVGIPE PERRYRNYPH QFSGGQRQRV VIAIALAGSP
DIIIADEPTS ALDATVQKQV LELLKILVDE TGVSIILVTH DMGVISEITN RVLVMRKGQV
VEADRTATIL DQPRHDYTKK LLAAVPRLRI PTRTVKAEDD GGQAGSSAID GDQNPLLVAE
GLSKQFAPQG FAWGIGRGKP RFGLRDVGIR LPRGSITGIV GESGSGKTTF GRILAGLDTA
PTGRITIGEH AFNVSRSGRR SGLLGRVQMI FQDPSVSLNP RMTIGETLDE SIRFGGRTGS
SKEPADLAAM MDRLGLPRSL LARHPHQLSG GQKQRVCIAR ALLARPEIIV ADEPTSALDV
SVQAEIVGLL KDTIAERAMT MVFISHDLAI VQAMCSSVFI FKDGRIEDFG PTEFIFSRSD
NPYTRALINA RPQRFTC