Gene Smed_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0031 
Symbol 
ID5320858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp31282 
End bp34131 
Gene Length2850 bp 
Protein Length949 aa 
Translation table11 
GC content62% 
IMG OID640788962 
ProductPII uridylyl-transferase 
Protein accessionYP_001325726 
Protein GI150395259 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000404003 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCAGAC ACGAAACCTC CTTTCCCGAA ATCCTGGATG TGGCGGCGCT TCGGGCCAGG 
TGCGACTTCA TCGCCTCCGC TCATGCCGAA CAGCGCGAAC CAATGCGCCG GGCCTTGCTC
GCGGCCTTCA AGGAGGCAAA TATTGCGGGC CGCGCCAAGG CGCGCGAATT GCTCGCCGCT
GACGGCGCAG GAATAAAATG CGCTGAGCGC ATCTCGTGGC TTCAGGACCA ACTCATTACG
CTGTTGCACG ACTTCGTGCT GAACCAGGTA TTCGACGCCG CTAAAGCCCC TGAGACTTCC
CGGATAGCCG TCACGGCTGT CGGCGGTTAC GGGCGCGGAA CGCTTGCACC CGGTTCCGAC
ATCGACCTCC TCTTCCTCCT TCCTGCCAAG AAGGCGGTCT GGGCGGAGCC GGCGATCGAG
TTCATGCTGT ATATTCTCTG GGACCTTGGC TTCAAGGTCG GCCACGCGAC GCGCACGATC
GAGGATTGCA TTCGCCTGTC GCGGGCCGAC ATGACCATCC GGACAGCGAT CCTCGAATGC
CGCTATGTCT GCGGTTCCGT CGCCCTGGCA AGCGAGCTCG AAACGCGCTT CGACCATGAG
ATCGTCCGCA ATACCGGCCC GGAATTCATT GCCGCCAAGC TCGCCGAACG CGACGAGCGA
CATCGCAAGG CGGGCGACAC GCGCTACCTC GTCGAACCGA ACGTCAAGGA AGGCAAGGGC
GGCTTGCGCG ATCTGCACAC GCTCTTCTGG ATTTCGAAAT ATTTCTACCG GGTCAAGGAT
TCCGCCGATC TCGTCAAGCT CGGCGTGCTT TCGAGGCAGG AGTACAAGCT CTTCCAGAAG
GCGGAAGATT TTCTCTGGGC GGTGCGCTGC CATATGCACT TCCTGACCGG CAAGGCTGAG
GAACGCCTCT CCTTCGATAT CCAGCGCGAG ATAGCCGAAG CGCTCGGCTA CCACGATCAC
CCCGGCCTTT CGGCGGTCGA ACGTTTCATG AAGCATTACT TTCTCGTGGC GAAGGATGTC
GGCGACCTGA CGCGCATCTT CTGCTCGGCT CTGGAAGATC AGCAGGCCAA GGACGCCCCC
GGTATTTCCG GCGTGATCAG CCGCTTCCGC AATCGTGTCC GCAAAATCCC CGGCACGCTG
GATTTCGTCG ACGATGGAGG GCGCATCGCG CTCGCGAGCC CTGACGTTTT CAAGCGCGAC
CCTGTGAACC TGCTGCGCAT GTTTCACATC GCCGATATCA ACGGGCTCGA ATTTCACCCG
GCGGCGCTGA AGCAGGTGAC GCGCTCCCTC AGCCTGATCA CACCGCATTT GCGAGAGAAC
GAGGAAGCGA ACCGGCTTTT CCTGTCTATC CTGACCTCCC GCCGCAATCC GGAACTGATC
CTCCGCCGAA TGAACGAGGC GGCGGTTCTT GGCCGCTTCA TTCCGGAATT CGGCAAGATC
GTATCGATGA TGCAGTTCAA TATGTATCAC CACTATACCG TGGACGAACA TCTTCTGCGC
GCGGTCGACG TTCTCTCCCG CATAGAGCGG GGACTTGAGG AGGAGGCGCA TCCCCTGACG
GCGATGCTGA TGCCGGCCAT CGAGGATCGC GAAGCCCTTT ATGTCGCGGT GCTGCTGCAC
GACATCGCCA AGGGACGCCC GGAGGATCAT TCGGTGGCCG GCGCAAAGGT CGCCCGCAAG
CTTTGCCCGC GTTTCAGGCT TTCGCCGAAA CAGACCGAGA CGGTCGTCTG GCTGGTCGAG
GAACATCTGA CCATGTCGAT GGTCGCGCAG ACCCGGGATC TCAACGATCG CAAAACCATC
GTCGATTTCG CCGAGCGGGT TCAATCCCTC GAGCGGCTGA AGATGCTGCT CATCCTGACG
GTCTGCGATA TCCGCGCGGT CGGACCCGGC GTATGGAACG GCTGGAAGGG GCAGCTGCTG
CGGACGCTCT ATTACGAGAC AGAGCTCCTG CTCTCCGGCG GCTTTTCGGA ACTGTCGCGC
AAGGAGAGGG CGAAGCATGC CGCCGACATG CTGGAGGAGG CGCTCGCCGA CTGGCCCAAG
GAGGAGCGGC AGACCTATGT GCGACTGCAC TACCAGCCCT ACCTCCTGAC CGTAGCGCTC
GACGAGCAGG TACGTCATGC GGCCTTCATC CGCGAGGCGG ATGCTGCGGG CAGGACGCTC
GCGACCATGG TGCGCACCCA TGACTTCCAC GCCATAACCG AAATCACGGT GCTGTCGCCG
GACCATCCGC GCCTGCTGAC CGTCATCGCC GGCGCTTGCG CTGCTGCGGG TGCCAACATC
GTCGGCGCCC AGATCCACAC GACTTCGGAC GGCCGGGCGC TGGACACGAT TCTCGTCAAC
CGCGAGTTCT CGGTCGCCGA GGACGAGACG CGTCGTGCGG CGAGCATCGG CAAACTGATC
GAGGACGTGC TCTCCGGCCG CAAGAAACTG CCCGACGTGA TCGCAAGCCG GACGCGCTCG
AAGAAGCGCA GCAGGGCATT CACCGTGACG CCGGAGGTAA CGATCAGCAA CGCGCTGTCG
AACAAGTTCA CCGTCATCGA GGTCGAGGGC CTCGACCGGA CGGGTCTTCT CTCCGAAGTG
ACCGCGGTCC TGTCAGACCT GTCGCTCGAC ATTGCGTCGG CCCATATCAC CACCTTCGGC
GAAAAGGTGA TCGATACATT CTACGTCACC GACCTGGTCG GCTCCAAGAT CACCAGCGAA
AACCGGCAGA TGAACATCGC GGCTCGCCTC AAGGCGGTGC TGGCGGGCGA GGTGGACGAA
GCCCGCGAGC GCATGCCCTC GGGGATCATC GCGCCGACGC CTGTGCCACG CGCATCCCAT
GGTTCCAAAG CGACAAAAGC CGAAACATGA
 
Protein sequence
MARHETSFPE ILDVAALRAR CDFIASAHAE QREPMRRALL AAFKEANIAG RAKARELLAA 
DGAGIKCAER ISWLQDQLIT LLHDFVLNQV FDAAKAPETS RIAVTAVGGY GRGTLAPGSD
IDLLFLLPAK KAVWAEPAIE FMLYILWDLG FKVGHATRTI EDCIRLSRAD MTIRTAILEC
RYVCGSVALA SELETRFDHE IVRNTGPEFI AAKLAERDER HRKAGDTRYL VEPNVKEGKG
GLRDLHTLFW ISKYFYRVKD SADLVKLGVL SRQEYKLFQK AEDFLWAVRC HMHFLTGKAE
ERLSFDIQRE IAEALGYHDH PGLSAVERFM KHYFLVAKDV GDLTRIFCSA LEDQQAKDAP
GISGVISRFR NRVRKIPGTL DFVDDGGRIA LASPDVFKRD PVNLLRMFHI ADINGLEFHP
AALKQVTRSL SLITPHLREN EEANRLFLSI LTSRRNPELI LRRMNEAAVL GRFIPEFGKI
VSMMQFNMYH HYTVDEHLLR AVDVLSRIER GLEEEAHPLT AMLMPAIEDR EALYVAVLLH
DIAKGRPEDH SVAGAKVARK LCPRFRLSPK QTETVVWLVE EHLTMSMVAQ TRDLNDRKTI
VDFAERVQSL ERLKMLLILT VCDIRAVGPG VWNGWKGQLL RTLYYETELL LSGGFSELSR
KERAKHAADM LEEALADWPK EERQTYVRLH YQPYLLTVAL DEQVRHAAFI READAAGRTL
ATMVRTHDFH AITEITVLSP DHPRLLTVIA GACAAAGANI VGAQIHTTSD GRALDTILVN
REFSVAEDET RRAASIGKLI EDVLSGRKKL PDVIASRTRS KKRSRAFTVT PEVTISNALS
NKFTVIEVEG LDRTGLLSEV TAVLSDLSLD IASAHITTFG EKVIDTFYVT DLVGSKITSE
NRQMNIAARL KAVLAGEVDE ARERMPSGII APTPVPRASH GSKATKAET