Gene Smed_5098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5098 
Symbol 
ID5319400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp46329 
End bp47372 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content64% 
IMG OID640776876 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_001313808 
Protein GI150377213 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.109281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.521536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGCC GGATGATCGC TCCATGTTTC ATGGACGACA CGCTCGAATG CCTTAGGCGC 
CATGGAGTCG ACGCCGGGCC CCTTCTTGCG CAAGTCGGCC TCCCGCCAGT CATCACCGGC
CCTGTGCCGG CCGAGCAATA CGGCGCATTG TGGCATGCGG TGGCTCAGGT CATGGATGAC
GAGTTTTTCG GCGAAGGCGC ACGGCCGATG CGCGCCGGCA GCTTCGCATT GCTGTGCCAT
GCCATTCTTT CGACGGTGAC CCTCGAACAC GCGCTGCGGC GAGCCCTGCG CTTTCTAAGG
ATCGTCCTCG ACGATCCCCA TGGCGAACTG GTCGTCGAGG ACGGACTGGC GCAGATCGTC
CTCGAGGATG CCGGCGCGAC ACGCTCCGCC TTTGCCTACC GCACCTTCTG GATCATCCTG
CATGGCGTCA ATTGCTGGCT GATCGGACGC CGGTTACCGA TCCGCTGGGT CGACTTCCGC
TGCAGCGCGC CGCCGGCCGG AACCGATTAC CGGCTCTTCT TCGGTGCGCC CGTCCGCTTC
GACCAGCCCC GGACGCGGCT CGTCTTCGAT GCCGAGTACC TGAAACTGCC GCCCATTCGA
GACGAGCGGG CGCTGAAACA CTTCCTCCGG CATGCGCCGG CGAACATTCT CGTGCGCTAT
CGCCACGATG CGGGCTTGTC CACGGCAATA CGCAGGCGGC TCCAGGCCTT GGATCCGTCC
GCCTGGCCCG GTTTCGAGAC GCTCGCGGCG CGGATGCGGA TTGCAGCGCC GACCCTTCGC
CGGCGATTGA AGCAGGAAGG TCAGACCTAC CGATCAATCA AGGAAGACCT GCGCCGGACA
CTTGCCATGG AAGCGCTTGC CGAGCGCGGA CAGAACGTGG CGCAGCTCGC CGTCGAACTC
GGCTTTTCCG AACCAAGCGC CTTTCATCGC GCATTCCGCA AATGGACGGG GAAATCCCCG
GCGCAATTCC GGCGCAGTGC CAGCGAAACA GGTCTGGCGG AAAGCGGCTC GTTGAAGAAA
CGCACTCAAC TCCAGGAGAG CTGA
 
Protein sequence
MERRMIAPCF MDDTLECLRR HGVDAGPLLA QVGLPPVITG PVPAEQYGAL WHAVAQVMDD 
EFFGEGARPM RAGSFALLCH AILSTVTLEH ALRRALRFLR IVLDDPHGEL VVEDGLAQIV
LEDAGATRSA FAYRTFWIIL HGVNCWLIGR RLPIRWVDFR CSAPPAGTDY RLFFGAPVRF
DQPRTRLVFD AEYLKLPPIR DERALKHFLR HAPANILVRY RHDAGLSTAI RRRLQALDPS
AWPGFETLAA RMRIAAPTLR RRLKQEGQTY RSIKEDLRRT LAMEALAERG QNVAQLAVEL
GFSEPSAFHR AFRKWTGKSP AQFRRSASET GLAESGSLKK RTQLQES