Gene Smed_5060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5060 
Symbol 
ID5319362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp6553 
End bp7743 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content61% 
IMG OID640776840 
Producthemolysin-type calcium-binding region 
Protein accessionYP_001313772 
Protein GI150377177 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCA ATGGAACTAG CGTCGCCAAC AAGCTCATCG GCACGAACCT TGCGGACACG 
CTCCGGGGTT ACGATGGCAA TGACGTCATT TGGGGCAATG CTGGTGACGA CTATCTCGAT
GGAGGGCTCG GCAACGACAC GCTCTATGGC GGCCTCGGCG ACGACTTTTT CAAGGTCGGT
GGGGGAGATG ACTACGCCGA AGGCGGCGAC GGCAACGACA GCTTCGATGG CGGCGCTGGC
GCCGATACGT TGATCGGCGG GCTCGGCAAC GACATCTTCT CCGGCGAGGA CGGCAATGAC
ATCATCGACG GCGGCGATGG TCATGACCAT ATCCTGGGGG GCGCCGGCAA CGACGAGATT
TATGGTGGCG CAGGTGAGGA ATACATCGAC GCCGGATTGG GAGACGACAT CATCTATGCC
GGCTCGGGAA ATGACGGCTT CAACAACCGC ATCGACCCCG CGACAGGCAA GCTGACGCAG
CAGGCGGTCG GTGGTGGCGC CGGCAACGAT ACGATCTATG GCGAAGAGGG CAGTGACGCC
CTGAAGGGCC AATCCGGGCA CGACCGGGTC TATGGCGGCA TAGGCGACGA CATTGTCGAC
GGCGGCGACG GGAACAACTA CCTCGACGGC GGTGATGGGA ATGACGTGCT CGATTCCGAG
GGCGGCATCG ACGAAGCCCG TGGCGGTATC GGCAACGATC GGATTGCCGT GGGCGGCGGC
AATGATCTGG CATACGGAGA TGCCGGCGAC GATATCCTGT CGGGAGCAGC GGGCGATGAC
ATCCTTGATG GCGGCCTCGG CAACGACCTT GTCACTGCCG GGGACGGAAA CGACACGCTA
CGCGGTGACG CTGGGAAAGA TACCCTTCTC GCCGAGGCGG GAAGCGACAT CCTCTGGGGT
GGGGCCGACG CCGATCGCTT CGTGTTCAAG GGCGCCGGTT CGCTCGTCGG ACGGGATTCC
GTTATGGACT TTCAGAACGG CGTCGATCTG TTTGTTCTGG AGAACCTTGG AATCAAGCAA
TATTCGAGTT CCGGCGCGGC GGGCACGATT TACGCGTACA ATGACGCGAG CGGAGCCGTG
ATGTTGAAAG GCTATGATTC TGCGGGCAAC GCGGTGACGA TACATGTCGA CGATCCGGCG
AATAGCCTCG CTGCCTCTCA CTTCAGCAGC GCGGATTTTC TCTTCGCTTG A
 
Protein sequence
MNINGTSVAN KLIGTNLADT LRGYDGNDVI WGNAGDDYLD GGLGNDTLYG GLGDDFFKVG 
GGDDYAEGGD GNDSFDGGAG ADTLIGGLGN DIFSGEDGND IIDGGDGHDH ILGGAGNDEI
YGGAGEEYID AGLGDDIIYA GSGNDGFNNR IDPATGKLTQ QAVGGGAGND TIYGEEGSDA
LKGQSGHDRV YGGIGDDIVD GGDGNNYLDG GDGNDVLDSE GGIDEARGGI GNDRIAVGGG
NDLAYGDAGD DILSGAAGDD ILDGGLGNDL VTAGDGNDTL RGDAGKDTLL AEAGSDILWG
GADADRFVFK GAGSLVGRDS VMDFQNGVDL FVLENLGIKQ YSSSGAAGTI YAYNDASGAV
MLKGYDSAGN AVTIHVDDPA NSLAASHFSS ADFLFA