Gene Smed_1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1413 
Symbol 
ID5322264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1494057 
End bp1495349 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content58% 
IMG OID640790355 
Producthypothetical protein 
Protein accessionYP_001327094 
Protein GI150396627 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0999393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.104956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAC TTTTCGTCAT TTTGCTTCTG CTTTTGATCA ACGCCTTTTT TGCGTTGTCG 
GAAATGGCAC TCGTGTCGGC AAGCAAACCG CTCCTTCGTC AGATGGTCAA GCAGGGAATC
CCGCGCGCAG AAGCCGCACT CAGGCTTGCG GAAGATCCAG GCAAATTCCT GTCGACGGTG
CAGGTCGGGA TCACCCTGGT CGGTATCCTG GCTGGCGCCT ATGGAGGCGC GACAATCGCC
GCCAACATCG CGCCGTTCCT GAACGACATC GCCTGGATCA GCCCTTACGG CGATACGGTC
GCGGTCGCCC TCGTCGTCAC CCTGATCACG TTTCTGTCGG TGGTCATCGG CGAGCTCATA
CCGAAGCAGT TGGCGCTTCG AAACTCGGAA GCGCTGGCGA TGTTCGTCGC CCGTCCGATG
GCGCTGCTTT CGCGTATCGT CGCCCCGGTA GTCTATCTGT TCGAAGGCGC GGCCAACCTT
TCGATGCGTA TCATGGGAAT GAGGCCCGAG GACGCGGATC ACGTGACCGA AGAGGAAGTT
CAGGCGATCA TGGCGGAAGG CGTCGAAAGC GGCGCCATCG AAAAGAGCGA ACACGAGATG
CTGCGGCGGA TCATTCGCCT TGGCGACCGC AATGTAAAAA CGATTATGAC GCATCGCACC
GAGGTGAGCT TCATCGACAT CCAGGACGAT CTGGAGACGA TCGGACACAA GATCCGGCAG
TCCGGCCACT CGCGCTATCC GGTGGTCGAC GGGCCTGCGG GCGATGTGAT CGGGGCAGTC
CTTGCAAAGG AGATATTGAA TGTTTCGCAA ACCGGAAAAT TCAATATCCG CGATTATGTC
CGTGACATTC TCACACTGCC GGAGACGGCC TCCTGCTTAA AGGCGCTCGA AGCCTTCAAG
ACGTCCAGCA TCAATATGGC CATGATCGTC GACGAATATG GGAGCACAGA GGGGATCATC
ACCACCGCCG ATATCCTCGA GGCGATCGTG GGCATCATTC CATCAAACTA TGACGATTCC
GAACATGCCC TCATTCACCT GCGCGACGAC GGCAGCTATC TCGTAGACGG ACGTACGCCA
ATCGATGAGA TCCACCTTCA GATCGGCATC GAGGGCATTG ACGCCGACAG CGATTTCGAA
ACCATCGCGG GCTTTCTGGT GCAGCAATTG CGCAAGTCGC CGGAAGAGGG CGACACGGCC
GAGGCTCACG GCTATCGATT CGAGGTGATC GATATGGACG GCCGCCGTAT CGACAAAATC
CTGGTCAGCC GAGCCGGTGA GGCACTTTCC TGA
 
Protein sequence
MAELFVILLL LLINAFFALS EMALVSASKP LLRQMVKQGI PRAEAALRLA EDPGKFLSTV 
QVGITLVGIL AGAYGGATIA ANIAPFLNDI AWISPYGDTV AVALVVTLIT FLSVVIGELI
PKQLALRNSE ALAMFVARPM ALLSRIVAPV VYLFEGAANL SMRIMGMRPE DADHVTEEEV
QAIMAEGVES GAIEKSEHEM LRRIIRLGDR NVKTIMTHRT EVSFIDIQDD LETIGHKIRQ
SGHSRYPVVD GPAGDVIGAV LAKEILNVSQ TGKFNIRDYV RDILTLPETA SCLKALEAFK
TSSINMAMIV DEYGSTEGII TTADILEAIV GIIPSNYDDS EHALIHLRDD GSYLVDGRTP
IDEIHLQIGI EGIDADSDFE TIAGFLVQQL RKSPEEGDTA EAHGYRFEVI DMDGRRIDKI
LVSRAGEALS