Gene Smed_2117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2117 
Symbol 
ID5322977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2180219 
End bp2181868 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content61% 
IMG OID640791055 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_001327785 
Protein GI150397318 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.129596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.211481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTC AGTTCATCTA TCACATGGCC GGGCTCAACA AGGCCTATGG CAACAAGAAG 
GTCCTCGAGA ACATCCATCT CTCGTTCTAT CCGGAAGCGA AGATCGGCAT TCTCGGCCCG
AACGGGGCCG GTAAGTCGAC CGTGCTCCGG ATCATGGCGG GCCTCGATAC CGAATATACC
GGCGAGGCAT GGGTCGCCGA AGGTGCGAGG GTCGGCTATC TCGCACAGGA GCCTCAGCTC
GACGCTCAGA AGAACGTGCT CGAGAACGTG ATGGAAGGGG TCGCCGCCAA GAAGGCGATC
CTCGATCGCT ACAACGAGCT GATGATGAAT TATTCCGACG AGACCGCGGA CGAAGGCGCA
AGGCTCCAGG ATGTCATCGA TAGCCAGAAC CTATGGGATC TCGACAGCCA GGTGGAGATG
GCGATGGAAG CCTTGCGCTG CCCGCCGGCG GACGCGGATG TCGCCAATCT GTCCGGTGGC
GAAAAGCGCC GTGTCGCTCT TTGCAAGCTC CTCCTGTCGC AGCCCGAACT GCTTCTGCTC
GACGAACCGA CCAACCATCT CGATGCGGAA ACGATCGCCT GGCTCGAGAA GCATCTGCGC
GAATATCCGG GTGCCGTGCT GATGGTCACT CACGACCGCT ACTTCCTCGA CAACGTCACG
GGGTGGATTC TCGAGCTCGA CCGCGGCCGG GGAATTCCCT ACGAGGGCAA CTATTCCGCC
TATCTGCAGT CCAAATCCAA GCGCATGGCC CAGGAAGGGC GCGAAGAGGC TGCCCGCCAG
AAAGCGATCA GCCGCGAGCA GGAGTGGATC TCATCGAGCC CGAAGGCTCG CCAGGCGAAG
TCGAAGGCGC GTGTGCGCGC CTATGACGAG CTGGTCAAAG CGGCCGCGGA CCGGCGTCCC
GGGGACGCGC AGATCATCAT TCCCGTCGGC GAGCGCCTGG GACAGGTCGT CATCGAGGCG
GAGAATATTT CCAAGGGCTA CGACGACCAG TTGCTGATCG ACGGCCTGAC CTTCAAGCTG
CCGCCAGGCG GCATCGTCGG CGTCATCGGC CCGAACGGCG CTGGCAAGAC GACGCTCTTC
CGCATGATCA CCGGCCAGGA GCAGCCGGAC GGCGGTTCCA TCCGCATCGG CGACAGCGTG
CAGCTCGCCT ATGTCGACCA GAGCCGCGAT GCGCTCGATG CGAATAAGAC TGTCTTCGAA
GAAATTTCAG GCGGCAACGA CGTCATCAAG CTCGGCAAGC ACGAGGTCAA TGCGCGCGCC
TACTGCTCGG CCTTCAACTT CAAAGGCGGC GATCAGCAGC AGAAAGTCGG CACGCTTTCC
GGTGGCCAGC GCAACCGCGT GCACCTTGCA AAGATGCTGA AGTCCGGCGG TAACGTCGTG
CTGCTCGACG AACCGACCAA CGACCTCGAC ACGGAGACTC TGGCGGCGCT CGAGGATGCT
CTCGAGAACT TTGCGGGTTG CGCAGTGATC ATCAGCCACG ATCGCATGTT CCTCGACCGT
CTCGCCACCC ATATCCTCGC CTTCGAGGGC GACAGTCACG TCGAGTGGTT CGAAGGCAAC
TTCGAGGATT ACGAAAAGGA CAAGATCCGC CGTCTCGGTC CGGACTCGGT CAATCCCAAG
CGGGTAACCT ACAAGCGCCT GACGCGTTAA
 
Protein sequence
MARQFIYHMA GLNKAYGNKK VLENIHLSFY PEAKIGILGP NGAGKSTVLR IMAGLDTEYT 
GEAWVAEGAR VGYLAQEPQL DAQKNVLENV MEGVAAKKAI LDRYNELMMN YSDETADEGA
RLQDVIDSQN LWDLDSQVEM AMEALRCPPA DADVANLSGG EKRRVALCKL LLSQPELLLL
DEPTNHLDAE TIAWLEKHLR EYPGAVLMVT HDRYFLDNVT GWILELDRGR GIPYEGNYSA
YLQSKSKRMA QEGREEAARQ KAISREQEWI SSSPKARQAK SKARVRAYDE LVKAAADRRP
GDAQIIIPVG ERLGQVVIEA ENISKGYDDQ LLIDGLTFKL PPGGIVGVIG PNGAGKTTLF
RMITGQEQPD GGSIRIGDSV QLAYVDQSRD ALDANKTVFE EISGGNDVIK LGKHEVNARA
YCSAFNFKGG DQQQKVGTLS GGQRNRVHLA KMLKSGGNVV LLDEPTNDLD TETLAALEDA
LENFAGCAVI ISHDRMFLDR LATHILAFEG DSHVEWFEGN FEDYEKDKIR RLGPDSVNPK
RVTYKRLTR