Gene Smed_4541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4541 
Symbol 
ID5319042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1028728 
End bp1029831 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content63% 
IMG OID640776342 
Productsulfate ABC transporter, ATPase subunit 
Protein accessionYP_001313274 
Protein GI150376678 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1118] ABC-type sulfate/molybdate transport systems, ATPase component 
TIGRFAM ID[TIGR00968] sulfate ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.68861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTCC GCGTCCAAAA CATTCGCAAG GAATTCGCCC GCTTTCCCGC GCTCGATAAC 
GTGTCCCTCG ACATCCGTTC CGGCGAATTG ATAGCCCTGC TCGGTCCCTC AGGCTCCGGA
AAGACCACAC TGCTCAGGCT GGTCGCCGGG CTCGAAAGCC CGACGGGAGG CACGATATTC
TTCGGTGATG ACGACGCTTC GAAAAAGACC GTGCAGGAGC GGAACATCGG CTTCGTCTTC
CAGCATTACG CACTTTTCCG GCACATGACA GTGCTCGACA ACGTCGCTTT CGGGCTGAAG
GTCCGCACGG CCAAACGGCG TCCCCCGGCC GCTGAAATCC GCCGCAGGGC GCTCGATCTG
CTCGATCTCG TGCAGCTTTC CGGCTTGGAG AAACGCTATC CGGCCCAGCT CTCCGGCGGT
CAGCGTCAGC GCGTGGCACT CGCCCGGGCC ATGGCTGTCG AACCCAATGT TCTGCTTCTC
GACGAGCCCT TCGGCGCTCT CGATGCGCAG GTGCGCAAGG AATTGCGGCG CTGGTTGCGG
GAAATCCACG ACCGCACCGG TCACACCACG ATCTTCGTCA CCCACGACCA GGAGGAAGCA
CTCGAGCTTG CCGACCGCGT CGTGGTGATG AGCAAGGGGA CGATCGAGCA GGTCGGCTCG
CCCGACGAGA TCTATGACCA TCCTGTCTCG CCTTTCGTCT ATGGTTTCAT CGGACAGTCC
AATTGCCTCG ATGTCACGCT TGCCAATGGC GAGATCTGGC TCGAGGACCG GCCGATCGGC
CTGCGCGCCG CGAACGAACC GGACGGTCCA GCAACCCTCT ATTTCCGGCC GCACGACGTC
GAACTTATTG ACGGCTGCGG CGGCTGCCTC GCCGGGCTGG TCACGGCCAG CCGACGCGTG
GCAGGCACGC GGCATCTCGA ACTCGAACTC GGACGCACGC ATCCGCGGGT GGAGATCGAA
CTTCCGCCGG AACGCGCCGC CTTTGCCGAC CACACGCGAA TTGCCTTCCG GCCAACGCGA
TGGAAGCTGT TCCAAAAGGG GGAACGCCGG ATAACGGCAC GGGAAGAAGT GGTGGTGCCC
GAGCTTGAAG CTACCGGCAC CTGA
 
Protein sequence
MEVRVQNIRK EFARFPALDN VSLDIRSGEL IALLGPSGSG KTTLLRLVAG LESPTGGTIF 
FGDDDASKKT VQERNIGFVF QHYALFRHMT VLDNVAFGLK VRTAKRRPPA AEIRRRALDL
LDLVQLSGLE KRYPAQLSGG QRQRVALARA MAVEPNVLLL DEPFGALDAQ VRKELRRWLR
EIHDRTGHTT IFVTHDQEEA LELADRVVVM SKGTIEQVGS PDEIYDHPVS PFVYGFIGQS
NCLDVTLANG EIWLEDRPIG LRAANEPDGP ATLYFRPHDV ELIDGCGGCL AGLVTASRRV
AGTRHLELEL GRTHPRVEIE LPPERAAFAD HTRIAFRPTR WKLFQKGERR ITAREEVVVP
ELEATGT