Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0031 |
Symbol | |
ID | 5320858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 31282 |
End bp | 34131 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640788962 |
Product | PII uridylyl-transferase |
Protein accession | YP_001325726 |
Protein GI | 150395259 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2844] UTP:GlnB (protein PII) uridylyltransferase |
TIGRFAM ID | [TIGR01693] [Protein-PII] uridylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000404003 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCAGAC ACGAAACCTC CTTTCCCGAA ATCCTGGATG TGGCGGCGCT TCGGGCCAGG TGCGACTTCA TCGCCTCCGC TCATGCCGAA CAGCGCGAAC CAATGCGCCG GGCCTTGCTC GCGGCCTTCA AGGAGGCAAA TATTGCGGGC CGCGCCAAGG CGCGCGAATT GCTCGCCGCT GACGGCGCAG GAATAAAATG CGCTGAGCGC ATCTCGTGGC TTCAGGACCA ACTCATTACG CTGTTGCACG ACTTCGTGCT GAACCAGGTA TTCGACGCCG CTAAAGCCCC TGAGACTTCC CGGATAGCCG TCACGGCTGT CGGCGGTTAC GGGCGCGGAA CGCTTGCACC CGGTTCCGAC ATCGACCTCC TCTTCCTCCT TCCTGCCAAG AAGGCGGTCT GGGCGGAGCC GGCGATCGAG TTCATGCTGT ATATTCTCTG GGACCTTGGC TTCAAGGTCG GCCACGCGAC GCGCACGATC GAGGATTGCA TTCGCCTGTC GCGGGCCGAC ATGACCATCC GGACAGCGAT CCTCGAATGC CGCTATGTCT GCGGTTCCGT CGCCCTGGCA AGCGAGCTCG AAACGCGCTT CGACCATGAG ATCGTCCGCA ATACCGGCCC GGAATTCATT GCCGCCAAGC TCGCCGAACG CGACGAGCGA CATCGCAAGG CGGGCGACAC GCGCTACCTC GTCGAACCGA ACGTCAAGGA AGGCAAGGGC GGCTTGCGCG ATCTGCACAC GCTCTTCTGG ATTTCGAAAT ATTTCTACCG GGTCAAGGAT TCCGCCGATC TCGTCAAGCT CGGCGTGCTT TCGAGGCAGG AGTACAAGCT CTTCCAGAAG GCGGAAGATT TTCTCTGGGC GGTGCGCTGC CATATGCACT TCCTGACCGG CAAGGCTGAG GAACGCCTCT CCTTCGATAT CCAGCGCGAG ATAGCCGAAG CGCTCGGCTA CCACGATCAC CCCGGCCTTT CGGCGGTCGA ACGTTTCATG AAGCATTACT TTCTCGTGGC GAAGGATGTC GGCGACCTGA CGCGCATCTT CTGCTCGGCT CTGGAAGATC AGCAGGCCAA GGACGCCCCC GGTATTTCCG GCGTGATCAG CCGCTTCCGC AATCGTGTCC GCAAAATCCC CGGCACGCTG GATTTCGTCG ACGATGGAGG GCGCATCGCG CTCGCGAGCC CTGACGTTTT CAAGCGCGAC CCTGTGAACC TGCTGCGCAT GTTTCACATC GCCGATATCA ACGGGCTCGA ATTTCACCCG GCGGCGCTGA AGCAGGTGAC GCGCTCCCTC AGCCTGATCA CACCGCATTT GCGAGAGAAC GAGGAAGCGA ACCGGCTTTT CCTGTCTATC CTGACCTCCC GCCGCAATCC GGAACTGATC CTCCGCCGAA TGAACGAGGC GGCGGTTCTT GGCCGCTTCA TTCCGGAATT CGGCAAGATC GTATCGATGA TGCAGTTCAA TATGTATCAC CACTATACCG TGGACGAACA TCTTCTGCGC GCGGTCGACG TTCTCTCCCG CATAGAGCGG GGACTTGAGG AGGAGGCGCA TCCCCTGACG GCGATGCTGA TGCCGGCCAT CGAGGATCGC GAAGCCCTTT ATGTCGCGGT GCTGCTGCAC GACATCGCCA AGGGACGCCC GGAGGATCAT TCGGTGGCCG GCGCAAAGGT CGCCCGCAAG CTTTGCCCGC GTTTCAGGCT TTCGCCGAAA CAGACCGAGA CGGTCGTCTG GCTGGTCGAG GAACATCTGA CCATGTCGAT GGTCGCGCAG ACCCGGGATC TCAACGATCG CAAAACCATC GTCGATTTCG CCGAGCGGGT TCAATCCCTC GAGCGGCTGA AGATGCTGCT CATCCTGACG GTCTGCGATA TCCGCGCGGT CGGACCCGGC GTATGGAACG GCTGGAAGGG GCAGCTGCTG CGGACGCTCT ATTACGAGAC AGAGCTCCTG CTCTCCGGCG GCTTTTCGGA ACTGTCGCGC AAGGAGAGGG CGAAGCATGC CGCCGACATG CTGGAGGAGG CGCTCGCCGA CTGGCCCAAG GAGGAGCGGC AGACCTATGT GCGACTGCAC TACCAGCCCT ACCTCCTGAC CGTAGCGCTC GACGAGCAGG TACGTCATGC GGCCTTCATC CGCGAGGCGG ATGCTGCGGG CAGGACGCTC GCGACCATGG TGCGCACCCA TGACTTCCAC GCCATAACCG AAATCACGGT GCTGTCGCCG GACCATCCGC GCCTGCTGAC CGTCATCGCC GGCGCTTGCG CTGCTGCGGG TGCCAACATC GTCGGCGCCC AGATCCACAC GACTTCGGAC GGCCGGGCGC TGGACACGAT TCTCGTCAAC CGCGAGTTCT CGGTCGCCGA GGACGAGACG CGTCGTGCGG CGAGCATCGG CAAACTGATC GAGGACGTGC TCTCCGGCCG CAAGAAACTG CCCGACGTGA TCGCAAGCCG GACGCGCTCG AAGAAGCGCA GCAGGGCATT CACCGTGACG CCGGAGGTAA CGATCAGCAA CGCGCTGTCG AACAAGTTCA CCGTCATCGA GGTCGAGGGC CTCGACCGGA CGGGTCTTCT CTCCGAAGTG ACCGCGGTCC TGTCAGACCT GTCGCTCGAC ATTGCGTCGG CCCATATCAC CACCTTCGGC GAAAAGGTGA TCGATACATT CTACGTCACC GACCTGGTCG GCTCCAAGAT CACCAGCGAA AACCGGCAGA TGAACATCGC GGCTCGCCTC AAGGCGGTGC TGGCGGGCGA GGTGGACGAA GCCCGCGAGC GCATGCCCTC GGGGATCATC GCGCCGACGC CTGTGCCACG CGCATCCCAT GGTTCCAAAG CGACAAAAGC CGAAACATGA
|
Protein sequence | MARHETSFPE ILDVAALRAR CDFIASAHAE QREPMRRALL AAFKEANIAG RAKARELLAA DGAGIKCAER ISWLQDQLIT LLHDFVLNQV FDAAKAPETS RIAVTAVGGY GRGTLAPGSD IDLLFLLPAK KAVWAEPAIE FMLYILWDLG FKVGHATRTI EDCIRLSRAD MTIRTAILEC RYVCGSVALA SELETRFDHE IVRNTGPEFI AAKLAERDER HRKAGDTRYL VEPNVKEGKG GLRDLHTLFW ISKYFYRVKD SADLVKLGVL SRQEYKLFQK AEDFLWAVRC HMHFLTGKAE ERLSFDIQRE IAEALGYHDH PGLSAVERFM KHYFLVAKDV GDLTRIFCSA LEDQQAKDAP GISGVISRFR NRVRKIPGTL DFVDDGGRIA LASPDVFKRD PVNLLRMFHI ADINGLEFHP AALKQVTRSL SLITPHLREN EEANRLFLSI LTSRRNPELI LRRMNEAAVL GRFIPEFGKI VSMMQFNMYH HYTVDEHLLR AVDVLSRIER GLEEEAHPLT AMLMPAIEDR EALYVAVLLH DIAKGRPEDH SVAGAKVARK LCPRFRLSPK QTETVVWLVE EHLTMSMVAQ TRDLNDRKTI VDFAERVQSL ERLKMLLILT VCDIRAVGPG VWNGWKGQLL RTLYYETELL LSGGFSELSR KERAKHAADM LEEALADWPK EERQTYVRLH YQPYLLTVAL DEQVRHAAFI READAAGRTL ATMVRTHDFH AITEITVLSP DHPRLLTVIA GACAAAGANI VGAQIHTTSD GRALDTILVN REFSVAEDET RRAASIGKLI EDVLSGRKKL PDVIASRTRS KKRSRAFTVT PEVTISNALS NKFTVIEVEG LDRTGLLSEV TAVLSDLSLD IASAHITTFG EKVIDTFYVT DLVGSKITSE NRQMNIAARL KAVLAGEVDE ARERMPSGII APTPVPRASH GSKATKAET
|
| |