Gene Smed_4137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4137 
Symbol 
ID5319279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp606754 
End bp607833 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content61% 
IMG OID640775942 
Productregulatory protein LacI 
Protein accessionYP_001312875 
Protein GI150376279 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACAA TCAAGGAAAT CGCATCGGCC GTCGGCGTGT CGTCAGCAAC CGTCTCCCGG 
GTGCTCAACT ATGATCCGAC ACTGTCCATT TCCACCAGGA AACGCCAGGC GATCATCGAA
ACGGCCGAGG CTTTGAACTA TGCGACGCCG CGCAACCGCA ACCGTGCCGC GGCTCAGGCA
GTCGGCAGCG GATTGAAGAT CGCGCTCGTG CACTTCCTAG ACCCGGCCCA GGAACTCGCC
GACCCCTATT ATGTCGGCGT CCGGCTCGGC ATCGAAAGCC GCTGTCAGGC CCTGAACAGC
GATGTGGTCA AGGTCTTTCT CACCGGCAAC ACTCCTGAAG CGACGATCCT TGAAGGCGCC
TCGGGCGTGG TGGCCGTCGG TCACTATTAC GGCGACGAGC TCGAATGGCT GCGCCGCCAC
AGCCGCCATC TCGTGTTTGC CGATTATGCG CCCGCCGGAG ACATGGAAGA CACGGTACTC
AGCGACGTCT CCCAGGCGAT GATCCGGCTC CTGGAGGCGG TGCATGCCAT GGGCTATCGC
CGCATCGGAT TCATCGGTTG GATCGACGCT TTCTACGGGC CGGACAACAT TCATTCGGAG
CGTCGCTGCC ACACCTATAT CGACTGGATG ACCAAAACCG GGCTCTTTGA TCCGGAATTG
TGCCTGGTCG ATCCGATGAC TCCGGACAGC GGCTACAGGC TTGCCAAGGC GATGCTGTCG
AAGCCCAATC CGCCGAAGAT CCTCATCACC TGCAACGACA ATATGGCGCT CGGCGCCTAT
AGGGCGATCA ACGAGATGGG GCTCAGGATT CCTGATGATG TCGCAGTCGC AAGCTTCAAC
GACATTCCGG TCGCGCAGTT TCTCGGGCCG CCGCTTTCCA CGGTTAAGAT CCCGGCGGAA
CTGATCGGCG AAACCGCCGT CGACCTGCTG GTCGAACGCC TGTCCGGCCG CGAGGTCGCC
AAGAAGGTGG TCTTTGGTAC CGAAATCATC TGGCGCGCAA GCACACCGGC ACCAACCGGG
GCTGCAAACC CGGCAGAGCA TATGGTGCCC GCAAGTTCCG CCTCAGAAGT CCCAGGGTGA
 
Protein sequence
MVTIKEIASA VGVSSATVSR VLNYDPTLSI STRKRQAIIE TAEALNYATP RNRNRAAAQA 
VGSGLKIALV HFLDPAQELA DPYYVGVRLG IESRCQALNS DVVKVFLTGN TPEATILEGA
SGVVAVGHYY GDELEWLRRH SRHLVFADYA PAGDMEDTVL SDVSQAMIRL LEAVHAMGYR
RIGFIGWIDA FYGPDNIHSE RRCHTYIDWM TKTGLFDPEL CLVDPMTPDS GYRLAKAMLS
KPNPPKILIT CNDNMALGAY RAINEMGLRI PDDVAVASFN DIPVAQFLGP PLSTVKIPAE
LIGETAVDLL VERLSGREVA KKVVFGTEII WRASTPAPTG AANPAEHMVP ASSASEVPG