Gene Smed_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4011 
Symbol 
ID5318291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp466766 
End bp468292 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content61% 
IMG OID640775819 
ProductABC transporter related 
Protein accessionYP_001312752 
Protein GI150376156 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGAA GGCAGCTGCT TCTAATGGAT GGCATTCACA AGCGTTTCGG GCCGCTCATC 
GTAAGCGCCG GCGTCACCCT CGATATCGAG GAGAACGAAA TTCACGCCCT TCTCGGTGAA
AACGGTGCCG GCAAGACCGT TTTGATGAGC ATACTCTGCG GCGTTCTCTC GCCCGATGAA
GGGCAGATCA TTTTCAAGGG CCAGGCACTC AAGCCCGGCT CCCCGCGCCA AGCCATCAAG
CATCGCATCG GCATGGTCCA CCAGCACTTC ATGCTGGTCC CCAACCTGAC GGTCGCCGAG
AACTACGTTC TGGGACAGGG CTCGCCGTTC AGCGTGATCC GCGACATGAA GACCGTTCAT
GCCCGCATCG TCGAGCTCAG CGACCGCTAC GGTCTCGGCG TGAAGCCCGA CGCGCTCATC
TCCGACCTTT CGGTGGGCGA ACAGCAGCGC GTCGAAATTC TCAAGGTGCT CTACCACGGC
GTTGAACTGC TGATCCTCGA CGAGCCGACG GCGGTCCTGA CGCCTCAGGA GACCGATCGG
CTCCTGCATT TGCTTCGCCG CCTCGTCGAG GACGGGAAGA CGGTCATCTT CATTTCGCAC
AAACTGGACG AGGTGATACG CGTCAGCGAC CGCGTCAGCG TGATGCGCGA CGCCAAGGTT
GTCCGCACCG TGAAAACCTC CGAAACCAAT GCGCGCGAAC TCGCGCGCAT GATGGTCGGC
AGAGACGTTC TGATGGATTT GCCGCGGGCA CCGATCGAGC CCGGTCGCGT CGTGCTCTCG
GTCGAGCATC TGAACTGCGA CGGTGAATCC GGGCTTCCGG CACTCCACGG GGTGTCCTTC
CAGGTCCGGG CCAACGAAAT CGTCGGAATC GCGGGCGTCT CCGGCAATGG CCAGAGTGAG
CTGGCTCTGG CCTTGACGGG GCTCCTTCCG ATTGCTTCGG GCAGCGTCAC GCTCGAAGGG
GCCGAACTCG TGGGCCGATC GTCCCACGAC ATCAATCAGA TGCCCATTGC GCATATCCCC
GAAGATCGTC ACCGGATGGG TATCGTGCTG CCGCTTCCGC TGACGGAAAA CGTGATACTT
CAGCGCTTCG ACAAGCCGCC GTTCAGCTCT CGAGGCATGC TCGACATGAG CGAAATCACG
AGGCAAACCC GTGATCTCAT GCGGCGCTTC CAGGTCAAGG CCTCCGGCCC GGGCGACCGC
ATCCAGAATC TGTCCGGCGG CAACCAGCAA AAGCTCGTCG TCGGACGCGA GCTGGACCGC
CGCTTCGATT TCCTGCTGAT CAACCAGCTT ACGCGCGGAA TCGACATCGG CGCGACCGAG
CTGGTGATGC ACAAGATCCT CGAGCAACGC AGCGCCGGCA AGGCCATTCT GCTGATATCC
ACGGAGCTTG AAGAGCTGTT TACGCTCTGC GATCGAATTC TCGTCATGTA TGAGGGCCGT
CTCGTCGGTG AAATGCCACC CGATCGCGGC CGCCTGGAGG AGATCGGACT GCTGATGGCC
GGCAAATCGC CGATGTCGGC GGCGTGA
 
Protein sequence
MSGRQLLLMD GIHKRFGPLI VSAGVTLDIE ENEIHALLGE NGAGKTVLMS ILCGVLSPDE 
GQIIFKGQAL KPGSPRQAIK HRIGMVHQHF MLVPNLTVAE NYVLGQGSPF SVIRDMKTVH
ARIVELSDRY GLGVKPDALI SDLSVGEQQR VEILKVLYHG VELLILDEPT AVLTPQETDR
LLHLLRRLVE DGKTVIFISH KLDEVIRVSD RVSVMRDAKV VRTVKTSETN ARELARMMVG
RDVLMDLPRA PIEPGRVVLS VEHLNCDGES GLPALHGVSF QVRANEIVGI AGVSGNGQSE
LALALTGLLP IASGSVTLEG AELVGRSSHD INQMPIAHIP EDRHRMGIVL PLPLTENVIL
QRFDKPPFSS RGMLDMSEIT RQTRDLMRRF QVKASGPGDR IQNLSGGNQQ KLVVGRELDR
RFDFLLINQL TRGIDIGATE LVMHKILEQR SAGKAILLIS TELEELFTLC DRILVMYEGR
LVGEMPPDRG RLEEIGLLMA GKSPMSAA