Gene Smed_2213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2213 
Symbol 
ID5323073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2291664 
End bp2293502 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content63% 
IMG OID640791150 
ProductSARP family transcriptional regulator 
Protein accessionYP_001327880 
Protein GI150397413 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.226859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.3766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGTGC CGCATTTTCA TCTGCAGACA TTCGGGGAAT TGCGGCTGGT CGATTCCGGC 
GGCGTGGTCG TGGCGTGCCC GGAGCGGGGC CTGCTTATTC TGGCCTACCT CAGCATATCC
GGGCAGAGAA CGGTCTCCCG CGAAAAGCTC TCGATGCTGA TCTGGCCCGA GCGGGAGAAG
AGCGTCGCTT TCAAGAACCT GCGCTCTACC CTGTGGCGCA TGCGCCACCT TGCCGCGAAC
TCGGGGCCGC TCACGTCGGC GACGGAAATA GATGTGACGC TGGGGGAGAT TGCCTGCGAC
GCCGCCGGTT TCGAGGCGCT CGGGAGCTCG GAGGAAGCGT TTCGCGCCGG GCTCTCGCTC
TGGCGCGATA CGTTCCTAGC AGGCGCCGAT GTGCCTCCCG GGCGTTATGC GGACTGGGTT
GGAGAGCAGC GGAGCGTGTT TACGCAAAAG CTGCGCGCGG CACTTTTAGC CGGCGGAGAG
CGATTTGCCG ACAGGTCGAT GCTGCGGACG GCGTGCATGC ATCTCCTCGA ACTCGATCCG
GCGGATACTG CCGTGCGCGC AATCCTGGAG CGGTTGACCG GCGTGGGTCT CGTCATCGGG
AAAGGCTCGC CGGACGGGGC CTGGCTGCGG CAGGGGGCCT TTGCAGGGGA GTCGCCCCGG
GCGCCGGCAA GGGATGCGAT AACCCCGCCG GTTGCGCCCG CACCGCGCGT TGCGCTGCTT
CCGCCCGCGG CTTTCGGCAG TGATCCGATG CTTCACGCGG TCAGCATGGC CGTGGTCGAA
GACATCACGA TCGGATTATG CGCGCTGCGC TCGATGTCGG TTGTTGCCCC TTACACTGCC
GAGCGTATTC GGGATGCGGC GGACAAGGCT GCCTTCCTGG AAAAACATGC CGTCACTTAT
GCGCTCGACT GTCGCATGTC GGACCAGGGG CTGTTTACTC AACTCATCTT CCTGCCGTCC
GACGCGATCG TCTGGGCGGA GCGCTTTTCG ATCTCTCCGG TCGGGCTCCT GCAACAGCGT
CAGGAAATTG CCTTTCACGT GGCCAGCGCG GTAGCGGAAC AGGTGGAGAC CGGGCGTATC
GGGCACCTCG ATTACGTGAC CAATCCGGAC GCCTATTACG CCTATCTCGC CGGCCTGCGA
AACCTTTCGA ACGTCGGCCT TCCCGAAATC CGCAGGGCAC GCCGCGATTT CAAGACGGCG
CTCAGGCACA AGCCGGATTT TGCCCCGGCG CTAAGCGGCA TCGCACGCAC CTATGCGATC
GAGTGGGTCC TGACGGCTCG GGGCGACCAG GAGCTTTTGA CGGTAGCCGA GCGTCACGCC
AAGGAAGCGA TCGAGAGCGA TGAAGAGCTT CCCGGAGCAC ATCGCGAATT CGGCGTCGTC
AGACTTTATC AGGGCGACCT CGATGGAAGT CTTGCGGCGC TGGACCGCGC CGAAAATCTG
AGCCCGCATT ATGCCGACGT GCTGTACAGC CATGCGGACA CGCTCGTCCA TGCCTCCCGC
CCGCGTGAGG CTCTCGACAA GCTCGGCAAG GCGCTTTCGC TTAATCCCCT GGCGCCGGAC
ATGTATTTCT GGAGCGCTGC CGGAGCCAGC TACTTTCTCG AAGAGTATGA GGATGCGATC
GGCTACGTTC AGAAAATGAA GGACAAGTCG CCGGGCGACC GGCTGCTTGC AGCAAGCTGG
GCGATGCTCG GCGATCAGAA GAAAGCGCGG TCATACAAGG TGAAGGCCCT GAGGGCCAAT
CCGACATTCG ATGTCGACAA GTGGCTCGCC GTCGTTCCGA TGAAGGAACA ATGGCAGAAG
GACCTCTATC GCGAGGGGCT GAAAAGAGCG GGGTTTTGA
 
Protein sequence
MAVPHFHLQT FGELRLVDSG GVVVACPERG LLILAYLSIS GQRTVSREKL SMLIWPEREK 
SVAFKNLRST LWRMRHLAAN SGPLTSATEI DVTLGEIACD AAGFEALGSS EEAFRAGLSL
WRDTFLAGAD VPPGRYADWV GEQRSVFTQK LRAALLAGGE RFADRSMLRT ACMHLLELDP
ADTAVRAILE RLTGVGLVIG KGSPDGAWLR QGAFAGESPR APARDAITPP VAPAPRVALL
PPAAFGSDPM LHAVSMAVVE DITIGLCALR SMSVVAPYTA ERIRDAADKA AFLEKHAVTY
ALDCRMSDQG LFTQLIFLPS DAIVWAERFS ISPVGLLQQR QEIAFHVASA VAEQVETGRI
GHLDYVTNPD AYYAYLAGLR NLSNVGLPEI RRARRDFKTA LRHKPDFAPA LSGIARTYAI
EWVLTARGDQ ELLTVAERHA KEAIESDEEL PGAHREFGVV RLYQGDLDGS LAALDRAENL
SPHYADVLYS HADTLVHASR PREALDKLGK ALSLNPLAPD MYFWSAAGAS YFLEEYEDAI
GYVQKMKDKS PGDRLLAASW AMLGDQKKAR SYKVKALRAN PTFDVDKWLA VVPMKEQWQK
DLYREGLKRA GF