Gene Smed_4217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4217 
Symbol 
ID5319227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp700245 
End bp701417 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content62% 
IMG OID640776022 
Productextracellular ligand-binding receptor 
Protein accessionYP_001312955 
Protein GI150376359 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGGA TCATTCTGGC CGCGCTCGCG GCCCTGGTAA CGAGCGCGGC GGCGCATGCC 
GATGCGATCA AGGTCGGCGT CGTTGGGCCA TTTTCAGGGC CTTTTGCGCT GCAGGGAAAG
AACTTCAAGG CGGGAATCGA CGCCTATATG GCGCTGAGCG GGGTCAAGGT CGGCGAAGAC
GATATCGAGA TCATCTATCG CGACGTACCG CAGGCGGATC CGGCCCAGTC CAAGGCGCTG
GCACAGGAAC TGGTTGTCAA GGAAGGCGTG CAGTATCTCG CCGGGTTCTA TTTCACGCCG
GATGCGATGG CCGTCACGCC GTTGCTCGAG CAGGCGAACG TGCCGCTGGT GATCATGAAT
GCCGCAACAT CGGCGATCGT CACCAAGAGC CCCCTTGTGG TGCGCACTTC CTTCACGCTT
TGGCAGACAT CCACGCCGAT CGCCAAGGTG GCGAGGGAAG CCGGCGTCTC GAAGATTATC
TCGGTCGTCA GCGACTACGG CCCGGGCATC GACGCGGAGA ACGCATTCAA GACGGGCTTC
GAAGCCGTGG GCGGCCAGGT CGTCGAGGCG ATCCGCATGC CGCTGTCGAC CAACGACTTC
TCGCCGATCA TGCAGCGCAT CAAGGACTCC GGCGCAGAAG GGGTCTTTGC CTTCCTGCCG
TCCGGCCCGA CGACGCTCGG CTTCGTCAAG GCTTTCAACG AAAACGGGCT AAAGGACGGC
GGCGTCAAAT TCTTCGCCCC CGGCGATCTC ACACAGGAGT CCGACCTGCC GGCGCTGGGT
GATGCCGCGC TCGGCCTGCA AACGACTTTC CACTACTCCG TCTCGCATGA TTCTCCCGAG
AACAAGGCGT TCGTCGAGGC GGCCGCCAAG GCGATCGGCA ATCCGACTGA GCTGTCCTTC
CCCTCGGTCG GCGCCTATGA CGGCATGCAT GTCATCTATA AGATGATCGA GGCGACAGGA
GGCGAGCAGA ACGCTCAGAA GGCGGTCGAC GCGGTCAAGG GGCTCTCCTG GACGAGCCCG
CGCGGACCGG TTTCGATCGA TCCCGAATCG CGCCATATCA CGCAGAACAT CTATCTGCGC
GAGGTCTCAA AGGCCGACGA TGGCACCTAT TACAACAAGG AAATCCAGAC CTTCGAGAAA
CAGGGCGATC CCGGCCTCGC GGCGCTGAAG TGA
 
Protein sequence
MRRIILAALA ALVTSAAAHA DAIKVGVVGP FSGPFALQGK NFKAGIDAYM ALSGVKVGED 
DIEIIYRDVP QADPAQSKAL AQELVVKEGV QYLAGFYFTP DAMAVTPLLE QANVPLVIMN
AATSAIVTKS PLVVRTSFTL WQTSTPIAKV AREAGVSKII SVVSDYGPGI DAENAFKTGF
EAVGGQVVEA IRMPLSTNDF SPIMQRIKDS GAEGVFAFLP SGPTTLGFVK AFNENGLKDG
GVKFFAPGDL TQESDLPALG DAALGLQTTF HYSVSHDSPE NKAFVEAAAK AIGNPTELSF
PSVGAYDGMH VIYKMIEATG GEQNAQKAVD AVKGLSWTSP RGPVSIDPES RHITQNIYLR
EVSKADDGTY YNKEIQTFEK QGDPGLAALK