Gene Smed_3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3666 
Symbol 
ID5318063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp105757 
End bp106824 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content59% 
IMG OID640775479 
Producthypothetical protein 
Protein accessionYP_001312412 
Protein GI150375816 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00707046 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGGCAAAT CCGTGAACGA AAAGCAAAGC GCTCCACTCA TTCACGGAGC CAGCGATTTG 
CCGTCAGTGA CGGTCGACGG CTACAATCTG GAACTGCGTG ACGGCGACGG CTTTCTCGGC
GACAAGGCCA ACAAATCCGC GTTCCAGGAA AAACTCGACG ACTGGCGCAA ACGGGTGCGC
AAAGGCGGCG ACGACCCGCT GGGAGAGGCG CTCACCCAGG ACCTGTCCAA GAAACAGATC
GACGCTCTTC TTCGTGGTGA CGACAAGCAG GCGGCCGCCC TCATCATGGG AGCGATGGAC
GAATTCGCCG GCGAGCTGGC CGGCGTTCTT GAGAAATTCC TGCAGCAGAA GAGCTGGAAG
AACTCGGAAC GTGTCGTGAT CGGAGGCGGC TTCAGAGGAA GTGCCGTAGG TGAGTTCGCG
ATTGCACGGG CGATGGTGCT GATGAAGGCA AAGGGTATCA AGATCGAGCT CTCTCCCATC
GTTCATCATC CGGATGATGC CGGGCTCATC GGTGCCGCAC ATCTCATGCC AGCCTGGATG
CTCAAGGGAC ACAAGACCAT CCTCGCTATC GACATCGGCG GTACCAATAT CCGCGTAGGC
ATCGTCGAGC TGCGTCTGAA AGATGATACG GACCTTTCAA GGGCCAAAGT CTGGAAATCG
GACATCTGGC GGCATGCGGA CGACAAACCC AACAGAAGCG CCACGATCGA AGCACTCATC
GGGATGATCG AAAAGCTCAT AGCCAAGGCG GACAAGGCGG ATCTTGCACC GGCGCCGGTC
ATCGGCGTTG CCTGCCCCGG TGTAATCAAT GCGGATGGCT CGATCCTGCG CGGAGGCCAG
AACCTGCCCG GCGGGAACTG GGAAAGCGAG CATTTCAACC TGCCTGCCAC GCTCAAGGAC
GCCATTCCGC AGATCGGCGA TCATGAGACC TTCGTAATCA TGCACAACGA CGCCGTCGTC
CAGGGCCTGT CGCAAGTACC ATTCGTGCAG AATGCTTCGA GCTGGGGTAT CCTGACGATC
GGGACCGGTC TCGGCAATGC GCACTTCAGC AACAAAGCCG GAAATTGA
 
Protein sequence
MGKSVNEKQS APLIHGASDL PSVTVDGYNL ELRDGDGFLG DKANKSAFQE KLDDWRKRVR 
KGGDDPLGEA LTQDLSKKQI DALLRGDDKQ AAALIMGAMD EFAGELAGVL EKFLQQKSWK
NSERVVIGGG FRGSAVGEFA IARAMVLMKA KGIKIELSPI VHHPDDAGLI GAAHLMPAWM
LKGHKTILAI DIGGTNIRVG IVELRLKDDT DLSRAKVWKS DIWRHADDKP NRSATIEALI
GMIEKLIAKA DKADLAPAPV IGVACPGVIN ADGSILRGGQ NLPGGNWESE HFNLPATLKD
AIPQIGDHET FVIMHNDAVV QGLSQVPFVQ NASSWGILTI GTGLGNAHFS NKAGN