Gene Smed_5336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5336 
Symbol 
ID5319638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp297247 
End bp298773 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content64% 
IMG OID640777109 
ProductABC transporter related 
Protein accessionYP_001314041 
Protein GI150377446 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.403197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAG CCGTTCTCGA CATCCGCAAC GTCAGCAAGC GCTTCGGCGA CAATCTCGCC 
AATGACGACA TATCGCTGAG TCTTGGCAAG GGCGAGATCG TGGCACTCCT CGGCGAAAAC
GGCGCCGGCA AGACCACCCT GATGAGCATT CTGTTCGGCC ATTACGTGCC GGATACCGGC
AAGGTGCTGG TCGGAGGTCG GGAACTGCCG TCGGGCAAAC CGCGCGCGGC GATCCGTGCC
GGGATCGGCA TGGTGCACCA GCATTTCTCC CTCGCCCCCA ACCTGACGGT GCTGGAAAAC
GTCATGGCCG GCACGGAGAG CCTTTGGCAT CTCCGATCCG GCACACCCGC GGCACGGCGA
AAGCTCAACG GCATTTGCCA GCGGTTCGGC CTGACGGTGG AGCCCGATGC GCGGGTCGGC
GACCTGTCAG TCGGCGAGCA GCAGCGCGTC GAGATCCTGA AGGCACTCTA CAACGATGCG
CGTATCCTGG TGCTCGACGA ACCCACGGCG GTGCTGACCA ATCTGGAGGC CGAGCGGCTG
TTCGCAACGC TGAAGGAGAT GGCACGCGAA GGTCTTTCGC TGATTTTCAT TTCGCACAAG
CTGGACGAGG TCATGGCCGC CGCCGACCGA ATTGTCGTGC TGCGCGGCGG TCGCAAGGTC
GCCGAGCGAC TGGCGGAAAA GACGAACAAG GCCGAGCTTG CCGAACTGAT GGTGGGCCGC
AAGTTATCCC GACCGGTGCG CGAGCCTTCC ACGCCCGGCG AGGAGGTGCT TAAGGCTGCC
GGCGTCAGCG TCATCATCGA TGGAGTGGAA AGACTGAAAT CGGTAGATTT CAGCCTTCGG
GCAGGAGAAG TCCTCGGAAT TATCGGTGTT TCCGGCAACG GCCAGACGAC ACTGGCGCAT
CTCCTGTCGG GTACGGTCGG GCGTGACAGG GGCGAGCTGC TGCTGTTCGG CCAGCCTGTC
GGCGACCTTA CTGTGGATGA GGCCGTCGGG GCCGGAATTG GCCGCATTCC GGAAGATCGC
AACGAGGAGG GAGCGATCGG CGAAATGGCC ATCTGGGAGA ACGCCGTTCT CGAGCGCCTG
CCGCGATTTT CACGCCATGG CCTCGTCGAT CGGCAGGCAG CCCAGGCATT TGCCGGAGAG
ATCATCGATG CCTTCGACGT GCGCGGCGGC AGGCCGCCGA CGCGGACGCG CCTGCTCTCG
GGCGGCAACA TGCAGAAACT CATCCTCGGG CGCAATCTGA TCGACCGGCC GCGCATCCTC
ATTGCAGCGC AGCCGGCGCG CGGGCTCGAC GAAGGAGCTG TCGTCGCGGT CCATGCGCGC
CTGCTCGAAG CGCGGCGCGC GGGAACCGCG GTGCTCCTGA TCTCGGAGGA CCTCGAAGAA
GTGATGGCGC TCGCCGATCG CATTCAGGCC ATTGTCAACG GCCGGCTTTC CCTTCCGATC
GCTGCCGAGA GTGCCGATGC CACGAAACTC GGCCTGATGA TGGCCGGCGA ATGGAATGAC
GAGCACGAGG TTCCCCATGC GCTTTGA
 
Protein sequence
MTEAVLDIRN VSKRFGDNLA NDDISLSLGK GEIVALLGEN GAGKTTLMSI LFGHYVPDTG 
KVLVGGRELP SGKPRAAIRA GIGMVHQHFS LAPNLTVLEN VMAGTESLWH LRSGTPAARR
KLNGICQRFG LTVEPDARVG DLSVGEQQRV EILKALYNDA RILVLDEPTA VLTNLEAERL
FATLKEMARE GLSLIFISHK LDEVMAAADR IVVLRGGRKV AERLAEKTNK AELAELMVGR
KLSRPVREPS TPGEEVLKAA GVSVIIDGVE RLKSVDFSLR AGEVLGIIGV SGNGQTTLAH
LLSGTVGRDR GELLLFGQPV GDLTVDEAVG AGIGRIPEDR NEEGAIGEMA IWENAVLERL
PRFSRHGLVD RQAAQAFAGE IIDAFDVRGG RPPTRTRLLS GGNMQKLILG RNLIDRPRIL
IAAQPARGLD EGAVVAVHAR LLEARRAGTA VLLISEDLEE VMALADRIQA IVNGRLSLPI
AAESADATKL GLMMAGEWND EHEVPHAL