Gene Smed_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0043 
Symbol 
ID5320870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp44129 
End bp45304 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content64% 
IMG OID640788974 
ProductCBS domain-containing protein 
Protein accessionYP_001325738 
Protein GI150395271 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0313302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000114145 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGACT TCAAGACAGA GCCGGCCGCT GTCGCGACCG AGGAGGCTGA AGCCTCCGGC 
GACGCCGAGG CCGGTAGTAG TACCGCCGCC CGATCCGAGG GCAGCAAATC CACATCATCC
TTCTGGAGCC GTGCCGCGCG CCTGTTGCGC GGTGTGAGTC CATCAAGCCT GCGCGAGGAT
CTTGCCGACG CGCTGATGAC CGACACCGGA GGCAATGCGG CCTTCTCGCC CGAAGAGCGG
GCGATGCTCA ACAACATCCT CCGCTTTCGC GAGGTTCGTG TTGAAGACGT GATGGTTCCG
CGAGCCGACA TAGAGGCGGT GGACCAGAAC ATCACCATCG GCGAACTCAT GGCCCTCTTC
GAGGAGTCTG GTCGCTCGCG TATGCCGGTC TACAGCGAAG GGCTCGACGA TCCCCGCGGC
ATGGTCCACA TCCGCGATCT CCTTGCCTAT GTGGCCAAGC AGGCGCGCAA CCGGCGCCGC
AACGGCAAAG CTCCAACGGC GCCGACGACC GCAACGACCA CGAATGGCGA CAAGCCCGAA
AAGGCCCCCC GACAGCAGAA GCCGGGTTTC GATCTCTCTC GCGTTGACCT CGACAAGACG
GTCGAGGAGG CGGGAATCAT CCGTCAGCTG CTGTTCGTGC CGCCGTCGAT GCTTGCCTCG
GATCTCATGC AGCGCATGCG CGCTGCGCGC ATTCAGATGG CTCTCGTCAT CGACGAATAC
GGCGGGACGG ACGGTCTTGT ATCGCTCGAG GACATCGTCG AGATGGTGGT CGGCGATATC
GAGGATGAGC ACGACGACGA GGAGGTGATG TTCGCGCGCA GCTCCGACGA CGTCTTCATC
GCCGACGCTC GTGTGGAGCT GGAGGAAATC GCCGAGGCGG TCGGGCCGGA CTTCGATGTA
CGCGAGCAAC TCGAGGACGT CGATACGCTT GGCGGTCTCG TTTTCGCATC GCTCGGCCGG
ATTCCCGTTC GAGGCGAGGT GGTGCAGGCG ATTCCCGGTT TCGAGTTCCA GATACTCGAT
GCGGATCCAC GTCGCGTCAA ACGCGTCAGG ATCATGCGCA AGCGCCCGTC TTCGCGCCGC
CGCCCGCCGA AGGTCGAGAA GGAGCCGCTG CCAGAGGCGT TTGCCACGAC CGGCGCCACG
GGCGCCGGTG TCCGGCCTCC GGCTTCGTTG GAATAG
 
Protein sequence
MSDFKTEPAA VATEEAEASG DAEAGSSTAA RSEGSKSTSS FWSRAARLLR GVSPSSLRED 
LADALMTDTG GNAAFSPEER AMLNNILRFR EVRVEDVMVP RADIEAVDQN ITIGELMALF
EESGRSRMPV YSEGLDDPRG MVHIRDLLAY VAKQARNRRR NGKAPTAPTT ATTTNGDKPE
KAPRQQKPGF DLSRVDLDKT VEEAGIIRQL LFVPPSMLAS DLMQRMRAAR IQMALVIDEY
GGTDGLVSLE DIVEMVVGDI EDEHDDEEVM FARSSDDVFI ADARVELEEI AEAVGPDFDV
REQLEDVDTL GGLVFASLGR IPVRGEVVQA IPGFEFQILD ADPRRVKRVR IMRKRPSSRR
RPPKVEKEPL PEAFATTGAT GAGVRPPASL E