Gene Smed_5320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5320 
Symbol 
ID5319622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp277907 
End bp278923 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content61% 
IMG OID640777094 
Productputative ABC transporter 
Protein accessionYP_001314026 
Protein GI150377431 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.493735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATTC GTAAGATGCT TCTGGCGTCT GTCGCCGTCG CATGTGCCGC AATGCCAGTC 
TCGGCACTTG CCGACACATC CGGCAAGAAG ATCGCCCTTT CCAACAACTA TGCCGGCAAC
TCCTGGCGGC AGGCCATGTT GACAAGCTGG GACAAGGTGA CGGGCGAGGC GGTCAAGGCC
GGTGTCGTCG CCGCCGCGGA CGCCTTCACC ACAGCCGAGA ACCAGGCAAC CGAGCAGGCA
GCCCAGATTC AGAACATGAT TCTGCAGGGC TATGACGCCA TCGTTCTCAA TGCCGCTTCT
CCGACAGCCC TGAACGGTGC TGTAAAAGAG GCGTGCGATG CGGGGATTAC GGTCGTCTCG
TTCGACGGCA TCGTCACCGA GCCCTGCGCC TGGCGCATCG CCGTCGATTT CAAGGAGATG
GGCCGCAGCC AGGTCGAATA TCTTTCGAAG AAGCTGCCTC AGGGTGGAAA CCTGCTGGAG
ATCCGCGGCC TTGCCGGCGT CTTTGTCGAT GACGAAATCT CGGCCGGCAT CCACGAGGGC
GTCAAGCAGT TCCCGCAGTT CAAGATTGCC GGATCGGTCC ATGGCGACTG GGCGCAGGAC
GTTGCCCAGA AGGCCGTCGC CGGCATTCTG CCGAGCCTGC CGGAAATAGC CGGCGTCGTC
ACGCAGGGCG GCGACGGGTA CGGTGCGGCA CAGGCAATCG CGGCAGCAAA GCGGCCAATG
CCTATCATCG TCATGGGAAA CCGCGAAGAT GAGCTGAAAT GGTGGCAGCA GCAGAAGGAA
GCCAATGGCT ATGAGACGAT GTCGGTCTCC ATTGCCCCCG GCGTGTCGAC GCTGGCCTTC
TGGGTCGCCC AGCAGATACT GGACGGCAAG GAAGTGAAAA AAGATCTGGT GGTGCCGTTC
CTCCGCATCG ATCAGGACAA TCTCGAGCAG AACCTCGCCA ATACCCAGGC CGGCGGTGTC
GCGAACGTTG AATATGCGCA GGAAGACGCG ATCAAGGTCA TCGAAGCGGC CAAGTAA
 
Protein sequence
MTIRKMLLAS VAVACAAMPV SALADTSGKK IALSNNYAGN SWRQAMLTSW DKVTGEAVKA 
GVVAAADAFT TAENQATEQA AQIQNMILQG YDAIVLNAAS PTALNGAVKE ACDAGITVVS
FDGIVTEPCA WRIAVDFKEM GRSQVEYLSK KLPQGGNLLE IRGLAGVFVD DEISAGIHEG
VKQFPQFKIA GSVHGDWAQD VAQKAVAGIL PSLPEIAGVV TQGGDGYGAA QAIAAAKRPM
PIIVMGNRED ELKWWQQQKE ANGYETMSVS IAPGVSTLAF WVAQQILDGK EVKKDLVVPF
LRIDQDNLEQ NLANTQAGGV ANVEYAQEDA IKVIEAAK