Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4881 |
Symbol | |
ID | 5318043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1388942 |
End bp | 1390759 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640776666 |
Product | hypothetical protein |
Protein accession | YP_001313598 |
Protein GI | 150377002 |
COG category | [S] Function unknown |
COG ID | [COG5616] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.59857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.792267 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACCG CTGTGACACA CGGATCCGGG GGCGCGGAGC CCCTGCCCTA TCCGTTGCCG GGGCCGGACG AGATTCGTGC GCAGCTTGCA CGGGTCATTT CCAGTCCTGA ATTTCCGAAA GCCGGGCGTG GCGCCGCGTT TCTCACCTTC GTCGTAGAAG AAGCGATAGC TGGCCGTGCA CATCGCCTGA AGGGCTATAC CGTCGCGATC GAGGTATTCA AGCGCAGCGA CACCTTCACC CAGGACGATC CAGTTGTCCG CATCGAAGCG GGGCGCCTGA GACGCACGCT CGAGCGCTAT TATCTGGTCG CCGGACAGGG CGATCCCATC AGGATAGATA TTCCGAAAGG CGGCTATGCT CCGTCTTTCG CATGGAACGA TGCGGCGGTC ACGGGTCGCA AGGAAGCCGC GGCAGAAAAC TCCGACGAGC GGCCGGCGCA AGGCAGGCTG CGGACCCCAT TACCGATCCT TGCCGGCCTC GTCGGTGTCG TAATTACTGC GGTGCTGACG TATTGGGCAA CCGACCGTTT CATAGCCGGT ACGAGACAGT CCGGCTTCAG CGCGGTGCCT GACGAGCCGA CCCTGGTAAT AGCGCCCTTC GCCAATCTGG GTGATGGCCC ACAGTCCGAG CTCTATACCG TCGGATTGAC GGAGGAACTG CTGACCGCGC TGCCCCGGTT CAAAGAAATC AGGGTGTTCG GGCGGGAAAC GTCGAAATCG CTTGCACCGG AAGTGCAGGC GTCCGAGGTG CGTGGCGGGC TCGGCGCTCG CTACCTGCTC GCTGGCGGCG TCAGGGTGTC AGGCCCACGG GTTCGGGTGA CGGCACGGCT TGTCGACACG TCTGACGGCG CGATACTCTG GTCGCAGAAT TACGATGACG ATCTGACCAC GCGAGAACTT TTCGCCATCC AGTCCGACGT TGCAAGCAAG GTCGCCACGG CCGTTGCCCA GCCCTACGGC ATCATTGCGC AGGCTGTGAC GGCCAAGCCT CCGCCGGACG ACCTTGGCGT CTACGACTGC ACCCTTCGAT TTTATGCCTA TCGCGCCGAA CTCAGTCCGG AAGCCCATCT GAGTGCACGC GCCTGCCTGG AAAGCGCCGT CGCCCGCTAC CCGGCCTATG CGACGGCATG GGCCATGTTG TCCATCGCCT ATCTCGATGA AGACCGCTTC CGATATAACC TGAACCCGGC TCAGCGAGAG CCAATGGAAC GGGCGCTCCA CGCTGCCCGC CGTGCAATCG AGCTTGAACC GGACAATACG CGCGCGCTTC AATCCCTAAT GACCGCGTTG TTCTTCAATC AGCAACTTGC GGAGGCGCTG GAGGCCGGCG AAGAGGCACT GGCGACCAAT CCGAACGATA CGGAACTGCT GGCAGAGTTC GGAACTCGGC TTGCACTGTG CGGGCAATGG AAGCGGGGCG CGGATTTGCT CGACAGGGCG CTTGCGCTGA ATCCTGGCGG TGCGGCCTAT TATCACGGGA CACGCGCGCT TGCCGCCTAC ATGCTTGACG ATCATGCAAA GGCGGTGAGC CTGATCCGAA AAGCCGACCT GCAGAAGTTT CCTCTGTTCC ATATCGTAGC GGCCGCAATC TATGCCGAGG CTGGACTGAT CGACGATGCC CGGCGCGAGG GAACGATCTT CACGAAGATA CGACCGGAAT ACATCCGCGA CATCATCAAC GAGAACCGGA AGCGCAACAT TCAGCCGAAA GACAGCATTC GCATGATCGC CTCGCTTCGC AAGGCTGGAG TGCCCGTGCC GGATGCGGCC GGCATCGAAG CCGAGCTTCT GAAATCCGCT GTGACCGACC GTCGCTGA
|
Protein sequence | MVTAVTHGSG GAEPLPYPLP GPDEIRAQLA RVISSPEFPK AGRGAAFLTF VVEEAIAGRA HRLKGYTVAI EVFKRSDTFT QDDPVVRIEA GRLRRTLERY YLVAGQGDPI RIDIPKGGYA PSFAWNDAAV TGRKEAAAEN SDERPAQGRL RTPLPILAGL VGVVITAVLT YWATDRFIAG TRQSGFSAVP DEPTLVIAPF ANLGDGPQSE LYTVGLTEEL LTALPRFKEI RVFGRETSKS LAPEVQASEV RGGLGARYLL AGGVRVSGPR VRVTARLVDT SDGAILWSQN YDDDLTTREL FAIQSDVASK VATAVAQPYG IIAQAVTAKP PPDDLGVYDC TLRFYAYRAE LSPEAHLSAR ACLESAVARY PAYATAWAML SIAYLDEDRF RYNLNPAQRE PMERALHAAR RAIELEPDNT RALQSLMTAL FFNQQLAEAL EAGEEALATN PNDTELLAEF GTRLALCGQW KRGADLLDRA LALNPGGAAY YHGTRALAAY MLDDHAKAVS LIRKADLQKF PLFHIVAAAI YAEAGLIDDA RREGTIFTKI RPEYIRDIIN ENRKRNIQPK DSIRMIASLR KAGVPVPDAA GIEAELLKSA VTDRR
|
| |