Gene Smed_5729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5729 
Symbol 
ID5320031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp698757 
End bp700283 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content60% 
IMG OID640777447 
Producttranscriptional regulator domain-containing protein 
Protein accessionYP_001314379 
Protein GI150377784 
COG category[S] Function unknown 
COG ID[COG5616] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.816557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGAAC CGCTATATAC TTTCGGCGAC TTCGTGCTGG ATCCCCGCAG CGGGCTACTG 
CGGCGGAAGG GAGAGTCGGT GGAAATCGGC TCCCGCGGTC TCGCGCTTCT CCAAGCATTG
CTGGAAGCCA GGGGCGGGAT CGTATCCAAG GCCGAACTGA TCGAACGAGG TTGGCCGAAC
ACGATTGTCG AGGACAGCAA CCTTACCGTA CAGATCGCAA GTTTGCGAAA GGCGCTTGGA
CCGTCTCCCG ACGGGCGTGA GTGGATCACG ACAATACCTC GGGCTGGCTA TCGCCTAGTC
ACGGCTTATG CGGGAGCTTC CAGTCTGACG TCAACCAGGC CCGCACTGGC GGTTCTGCCA
TTTATCAATC TTGATGGCGG TTCTGACCAG GGTTATTTCG CGGACGGCGT TGTCAATGAG
ATCATCACCG CGCTCAGCCG CTTCAGAAGC TTCGCCGTGG TCGCGCGCAA CGGATCCTTT
TTCAACAATT TGCGATTTCC TGACGTACGA TTGGTCGCCA AGGAACTCGG TGTCGGCTAT
ATGCTCCAAG GCAACATAAG GCGTCCGGGC AGCCGATTGA GAATCTCGGT ACAGCTCGTT
GACGGCAGTG GCACACATCT CTGGGCGCAT AGCTTCGATG AGGAACTTGA TGATGTGTCC
GTTTTCCAGG ACCGAATAGC CGAGAGCGTC GTGTCACTGG TTGAGCCGCA TATTCAAGCG
GCCGAGATCG AACGCTCACG TCGAGACCGG CCGGGGAGCA CTGCCTCTTA CGATATCTAT
CTGCAAGCGC TGGCAAAAAT CTCAACGGAG TCTGAACTTG ACAATGCGGA GGCTTACGCC
CTCCTCATGA GAGGCATTGA GGCCGAGCCC GACAATGCTC TTCTGCTCGC CCATGCCGCG
TGGGCGCTCG AGCATCGGCA CACGATGGGT TGGCCATCGC TTGGTGAGGA CGATGTAGGA
GAGTGTGTGG CGCTGGCGCG GCGCGGGCTC GAACATGCTG CGGGAGATGC GATGGTGATG
GCGCATTGCG GCGTCGCCCT GCTGCAAACC GCAAAGGACT ACGATTGGGC CTTGGCGGTC
CTGCAATCTG CGGCGGAGGC TAATCCAAAC AACCTGATGG TCGTAGTCCG GGCCGGCCTT
GCGCACCTGC ATTGCGGGAG CCTTGACGAG GCCCTGACCC ACTTTCAAAG GGCAAGCCGG
TTGAGCCCGG GGGACCGCGG CGCTCACTTC TCGCTCTGCG GTATTGCAGA CGTGCACTTG
ATCCACGGCA ACTATGCCGA GGCGATTACC TGGGCCGCTC GCGCGCTCGC GAGCAATCCG
AATTTCGATC CGAACCTCTG GGTGCTGATC GCAGCGAATG CCCATCTTGG CCGAATGGAA
GAGGCGGATC GGTATCTGCG TGAGCTAAGG CGACGTGTTC CGGAGGTCAC AATCGCCCGC
ATCAAGGCCG GACAACCTCG CAAGGACCCG ACGAGGACCG CCGCCTTGCT CGAAGGGTTA
CGCAAGGTCG GCCTGAAGGA AGGCTGA
 
Protein sequence
MVEPLYTFGD FVLDPRSGLL RRKGESVEIG SRGLALLQAL LEARGGIVSK AELIERGWPN 
TIVEDSNLTV QIASLRKALG PSPDGREWIT TIPRAGYRLV TAYAGASSLT STRPALAVLP
FINLDGGSDQ GYFADGVVNE IITALSRFRS FAVVARNGSF FNNLRFPDVR LVAKELGVGY
MLQGNIRRPG SRLRISVQLV DGSGTHLWAH SFDEELDDVS VFQDRIAESV VSLVEPHIQA
AEIERSRRDR PGSTASYDIY LQALAKISTE SELDNAEAYA LLMRGIEAEP DNALLLAHAA
WALEHRHTMG WPSLGEDDVG ECVALARRGL EHAAGDAMVM AHCGVALLQT AKDYDWALAV
LQSAAEANPN NLMVVVRAGL AHLHCGSLDE ALTHFQRASR LSPGDRGAHF SLCGIADVHL
IHGNYAEAIT WAARALASNP NFDPNLWVLI AANAHLGRME EADRYLRELR RRVPEVTIAR
IKAGQPRKDP TRTAALLEGL RKVGLKEG