Gene Smed_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3701 
Symbol 
ID5318310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp141684 
End bp143120 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content64% 
IMG OID640775514 
ProductGntR family transcriptional regulator 
Protein accessionYP_001312447 
Protein GI150375851 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCACTT TTGCATTGTC AATTGAGCTG ATTTTGATCA CTGTTCAAAT CATGACAAAT 
TGGACGCCCG ACACGACACT TCTGCGCCGC CCTGCCTATC TTTCGCTGGC GGATCAATTC
GCCCGCGCCA TTCAGGAAGG AAGGCTCCAG AACGGTGCAA GGCTGCCGAC GCACCGCAAG
CTTGCCGACG ATCTGAAGCT CTCGGTTCAG ACGGTAAGCC GCGCCTATGA CGAGCTCATC
CGGCGCGGGC TGATCTCCGG TGAGGTCGGG CGTGGCAGTT TCGTGCAGAC GCGGCCGCGC
GAGCCGGAGC CGCCCTATCT GCCGGAGCGT CTGGGCGAAT TGATCGATCT TTCGATTCTC
AAGCCGGTCT GCGAGCAATA CCATCTGGAG AAGATGCGGC AGGGCTTTGC CTGGCTTGCC
GAAAACCTGC CTGCAAGTTC GGCGCTTTCC TTTCGTCCGA ACATGGTCTT CCCGCGGCAC
CGCAACATCG CGGCTGAATG GCTGTTACGC TGCGGCCTGG ACGTCTCTCC GCTCAACATC
AATCTGACCA ACGGTGCGAC TTCGGCCATG ACCGTGGCGC TGATGAGCGT GGCGCCGCCC
GGCTCGACCG TCGCAACCGA GGCGATCAGT CACCACACTC TGGTGCCGCT TTCGAGCTAT
CTCGGAATCC ACCTCAACGG CATCGCCATC GACCGCGACG GCATGATTCC GGATGCCCTG
GACGAGGCAT GCCGCAAGGG CGTGGTTCGC GCGGTCTTCC TGCAGCCTTC GGTGATCAAC
CCGACCGCGA CCCTGATGAG CGCGGAGCGG CGGGCGGCGC TGGCGGAGGT GGCGCGGCGG
CACGACATCG CCATCATCGA GAACGATATC CTCGGGCCCC TGGTCGAGGA GCGCTTGCCC
CCGGTCGCCG CATTGGCTCC GGAGCGTACG CTCTACGTGA CGAGCTTCAC CAAGATAACC
GTGCCCGGCC TCAGGATCGG CTACCTGGTG GCCCCCGACC GTTATGTGGC CGCGGTTGCC
AACAGGCATC TCGTTTCCAA CTGGATGGCG ACTCCGGCGA TTGCGGAGAT CGCGACGCAG
TGGGTGAGCG ACGGCACGGC GATCGAACTC GTGAACTGGC AGCGCCGCGC GCTCGCCTCG
CGCCATGTGA TCGCCAAGGA GGTGCTCGGA AGTCTTGCTC ATCACACGCA CCCGCAGAGC
CTGCATGTCT GGCTGCCGCT GCCCGAAGGT CATATGGAGG ACGCCTTCGT CTCTCAGGCA
CGCCTGCGCG GCGTGGCGAT CGCACCGGGT GCATCCTTCC GAACGGCGGA CAGCGGCTGG
CGTCCGGCCG TCAGAATATC GCTCGGCTCG ACGACCGAGC AGGAACTCAG GTCCGGGCTT
GGCATTGTCG CCTCCCTGGC GCTTGGCAAG CCGGAGGCAT TATTGCTCGT CATTTGA
 
Protein sequence
MLTFALSIEL ILITVQIMTN WTPDTTLLRR PAYLSLADQF ARAIQEGRLQ NGARLPTHRK 
LADDLKLSVQ TVSRAYDELI RRGLISGEVG RGSFVQTRPR EPEPPYLPER LGELIDLSIL
KPVCEQYHLE KMRQGFAWLA ENLPASSALS FRPNMVFPRH RNIAAEWLLR CGLDVSPLNI
NLTNGATSAM TVALMSVAPP GSTVATEAIS HHTLVPLSSY LGIHLNGIAI DRDGMIPDAL
DEACRKGVVR AVFLQPSVIN PTATLMSAER RAALAEVARR HDIAIIENDI LGPLVEERLP
PVAALAPERT LYVTSFTKIT VPGLRIGYLV APDRYVAAVA NRHLVSNWMA TPAIAEIATQ
WVSDGTAIEL VNWQRRALAS RHVIAKEVLG SLAHHTHPQS LHVWLPLPEG HMEDAFVSQA
RLRGVAIAPG ASFRTADSGW RPAVRISLGS TTEQELRSGL GIVASLALGK PEALLLVI