Gene Smed_1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1948 
Symbol 
ID5322807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2002028 
End bp2003008 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content63% 
IMG OID640790886 
Productextracellular solute-binding protein 
Protein accessionYP_001327617 
Protein GI150397150 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.113804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.383853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGC TTCTCCCTGC CGTCGCCGCA GCGCTCGTCG CCGGCCTCCT GTCCACTGCC 
GCTCTCGCCG AAAGCCTCGT GCTTTATACC AGTCAGCCGA ACGAGGACGC GCAGGCGACG
GTGGATGCAT TCGAGGCGGC CAACCCCGGC GTCGAGGTGG AATGGGTGCG GGAGGGAACA
ACAAAAATCA TGGCTAAGCT GATGGCCGAG ATCGAGGCGG GCAACCCCGT GGCGGATGTG
CTTCTGATCG CCGATACGGT GACGATGCAG CGGTTGAAGG AGGCGGGCCA GCTGATGCCT
TACAAATCTC CGGAAGCCTC AGCTTTCGAG GCCTCTCTCT TCGACCCGGA CGGCACCTAT
TATTCGACGA AGATGATCAC CACCGGGATC ATCTACAACA CTTCCGCCGC GATGAAGCCG
GCCGGCTGGG AGGATCTTGC GAAACCCGAG GCCAAGGGGC TCGTCACCAT GCCCAGCCCG
CTCACGTCAG GCGCGGCGCT GATCCATGCC CAGACGCTTG CCGGCATCGG TGCGCTCGGT
TGGGACTACT ACGAGGCGCT CGCGGAAAAC GGCGCGACGG CCGCCGGCGG CAATGGCGGC
GTGTTGAAGT CCGTCGCAAC GGGCGAGAAG GCCTATGGGA TGGTGGTGGA TTTCATGGCG
ATCCGCGAGA AGGCAAAGGG CGCGCCGGTG GAGTTCGTCT TTCCGGCGGA GGGCGTTTCG
GCCGTTACCG AGCCGGTCGC CATCCTGAGA ACCGCAAAAA ACCCGGATGC AGCAAAGAAA
TTCGTCGATT TCCTCCTTTC GGAAGAGGGG CAGCAGGTGG CAGTGACGAT GGGCTACATT
CCGGCCCGCA ACGGGCTTGC CTTGCCCGAG GGATTTCCCG CCCGCGAGGT TGTTAAGGTG
CTGCCGGTCG ACGCCGCCGC AGCCGTGAAG AATTCCGACG CGGATCTGAA AACCTTCTCG
GGGATTTTCG GCACCAACTG A
 
Protein sequence
MKTLLPAVAA ALVAGLLSTA ALAESLVLYT SQPNEDAQAT VDAFEAANPG VEVEWVREGT 
TKIMAKLMAE IEAGNPVADV LLIADTVTMQ RLKEAGQLMP YKSPEASAFE ASLFDPDGTY
YSTKMITTGI IYNTSAAMKP AGWEDLAKPE AKGLVTMPSP LTSGAALIHA QTLAGIGALG
WDYYEALAEN GATAAGGNGG VLKSVATGEK AYGMVVDFMA IREKAKGAPV EFVFPAEGVS
AVTEPVAILR TAKNPDAAKK FVDFLLSEEG QQVAVTMGYI PARNGLALPE GFPAREVVKV
LPVDAAAAVK NSDADLKTFS GIFGTN