Gene Smed_4168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4168 
Symbol 
ID5319197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp642350 
End bp643339 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content66% 
IMG OID640775973 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_001312906 
Protein GI150376310 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.565249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.28666 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGACC ATTCGTCCAA AACCTCCTCT TCCGCTTCTG AGGATTTTCT GACCGAGCTG 
TTGCGCGGGC TCCGTCTCGA CGGGGTGGAT TACGTCCGCT GCGAATTGAC GGCACCCTGG
GGGATCTCAT TTCCGGCGCA GGAGACGGCG CGCTTCCATT TCATCTGCGG GGATTGCTGG
CTGCGCGTCG CCGACGGGGA CTGGATCGAG TTGAAGCGCG GCGATGCCGT GCTCCTGCCG
CGCGGCGGCG AGCACGCGCT GGCAAGCATG CCAGGCGAGA AACTCGCTCC GCTCGACGCC
TATTCGGTCC AGGAAGTATG CCATTGCGTC TACAATGTCT GCGGCGGCGG GCGCGGCGAG
ACCACCATTC TTTTCTGCGG CAGCCTCAGG TTCAACATGG ATTCCATGCA TCCGCTGCTG
CGCATGATGC CGGACGTGAT GCGAATCAAC GCACTGACCG CCAGCGAGCC GGCTATCCCG
CACATGCTCG ACGCCATGGC GCGGGAAGTC GGCGCCAGCC GCGTCGGTTC CGGCGGTGTC
CTGGCGCGGC TCGCCGACGT GCTCGCGGCC CTCATCATCC GTTCCTGGGT TGAACACGGA
TGCGGCAATA CCAGCGGCTG GGTGGCGGCG GTCCGCCACC CCGGCCTCGG CCGGGTCATC
GCGGCCATGC ACCTCGACCC GGAAAAGGCC TGGACCGTCG ACTCCCTCGC CAGGCTGATG
GGCGCCTCGC GTTCCGGCTT CGCTCAGCAA TTCGCCAGCG TGGTCGGCGA GACGCCGGCC
CGCTACCTTG CGCAAGTGCG TATGCACCAG GCACGTCAGT GGCTGACCCG CGACCGCATG
CGTATCTCGG TCGTGGCACG TCGCCTCGGC TATGATTCGG AAGCCTCCTT CAGCCGCGCC
TTCAAGCGCG TGATCGGCCA GCCGCCGAGT CATTATCGTG GCGCCGACCC GGCCGAGGTC
TCCACATTCG CTGGCGAGAG CAGACCCTGA
 
Protein sequence
MLDHSSKTSS SASEDFLTEL LRGLRLDGVD YVRCELTAPW GISFPAQETA RFHFICGDCW 
LRVADGDWIE LKRGDAVLLP RGGEHALASM PGEKLAPLDA YSVQEVCHCV YNVCGGGRGE
TTILFCGSLR FNMDSMHPLL RMMPDVMRIN ALTASEPAIP HMLDAMAREV GASRVGSGGV
LARLADVLAA LIIRSWVEHG CGNTSGWVAA VRHPGLGRVI AAMHLDPEKA WTVDSLARLM
GASRSGFAQQ FASVVGETPA RYLAQVRMHQ ARQWLTRDRM RISVVARRLG YDSEASFSRA
FKRVIGQPPS HYRGADPAEV STFAGESRP