Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1575 |
Symbol | |
ID | 5322433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1672262 |
End bp | 1674157 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640790519 |
Product | hypothetical protein |
Protein accession | YP_001327251 |
Protein GI | 150396784 |
COG category | [S] Function unknown |
COG ID | [COG2989] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00305451 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.645388 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTTA TCAAGTCGGC GGCGGTTGTC GCGCTCATGA GCGGGTGTGC GGTTGCAGCC ATGGATGTGC AGCCGGCGTC CGCGCTCACC TTCATGGACT TCATCCGCGG AGGCAAGAAG CGCGACGTGC AGGAACAGGT GCAGCCCCGC CAGCAGCGTC CGGGCGTGGA TATGTTTGCT CCGCAACAAT TGGAGCAGAG GGCAGCAAAG CCGCCGCGCA TCACCGGTCC TCGGTACTAC ACCTATAAGG CGGATGCGCT CCGCCGGATT GCAACGGACC GGTTGCTCGA TCCCGTGGTC ACCGGCTCGG TGGCCAACGG AGCGCTGCCG CCGATGGTGC GTACGCCGCT TTCCGAGGTG CGCCAGTTTC TGCCGCAGAT CGATGTGCGC GCCCCGGGCG AGGTCGCAAA AGCGATCGAG ACCTTCTACG GCACACGAAC TGATTTCCTC TGGATCGATG GCGTGGGCGT CAACGGCCGC GCCAAGGCGG CGCTTGCTGC CCTGTCCGCC GCCGCGAAGG TCGGCCTGGA CCCGCGCGAT TATGCCGTCG CTGTCCCGCC GGAGGATTTC GATCGCGGCG ACATGATCGC CCGCGAAAAG GATCTGGTGC GGTTCGAGAT CGCGTTGTCC GCAGCCATGC TCACCTATGT TCAGGACACG GTGCGCGGCC GGATCGATCC GAACCGGATT TCCGGCTATC ACGACTTCAA ACGGAAGAGC GTCGATCTTC TTGCCTTTCT GGAGAAGATC GAGGCATCCG GTGACGTTGC CGCGCTCATC GAAAGCCGGA ATCCCAAGAG CGCTCAGTTC GAGGCATTGA GGCAGGAACT CGAACGACTT CGGACCCAGG TCGAAGCGAC GCCTCGGGTC GAGATCGCAC CGGGGACGCT GTTGAAGCCC GGCGAGAGCA ATCCGGAACT CGCCAATGTG ATCGCCGGGA TCAAGCTTAA AGCCTCGGAT GCGCTGAAGA CGGAGCATGC AGTCGTGCTC GCCGCCTATG GCGGGACGCC GGATTACACG CCCGAGCTTG TGCCTGTCGT CGAAGCCTTC CAGAAGGAAC ATGGCCTCAA GGCGGACGGC ATTGTCGGCC AGGCGTCGAT ACGCGCTATC ACCGGCGGTG ACACGATCGG GGAGAAGATC AGAAAAATCG AAATCGCGAT GGAGCAGGCG CGCTGGTTGC CCGACGGGCT CGGCGACCGC TATGTCTTCA TCAACCAGCC GGCCTTCACC GCCTCCTATA CGGAGCAGGG CGCCGAGCAG TTCTCCATGC GTGTCGTCGT CGGCTCCAAG GCCAATCAGA CCTACTTCTT CCAGGACGAG ATTCAGACGG TCGAAGTCAA TCCATATTGG GGCGTTCCGC AGTCGATCAT CGTCAATGAG ATGCTGCCGA AGCTCAGAAG CGACCCCGGT TACCTTGACC GCATGGGCTA CCAGGTGGAA GTCGGTGGCC GGGTCGTTTC TTCGACCGCT GTGAACTGGT ACGGCTCGAC GAACTCCATC GCGGTGCGCC AACCGCCGAG TAGCGACAAT GCATTGGGTG AACTCAAGAT TCTATTCCCC AACGCCCACG CGATCTACAT GCACGATACG CCGTCGAAGA GCTTTTTCAA GCGCGATCAG CGGGCTCTCA GCCATGGCTG CGTGCGCCTC GCCGACCCGC GCCGCATGGC GGCGGCCGTA CTTGGCGTGA GCGTCGACGA GGTCGGCGAA GAGATATCCG GCGGTCGCAA CAAGGCACTC CCGGTATCCG CCAAGGTCCC GGTCTATGTC TCCTACTTCA CCGCCTGGCC GAACAAGGAC GGAACCGTCG AGTATTTCAA TGACGTTTAC GAGCGCGACA TGTACGTGAA CCGCGCCTTC GAGGCGACGC GCAAGGCGCG GCACGCGGAA GGGTGA
|
Protein sequence | MRVIKSAAVV ALMSGCAVAA MDVQPASALT FMDFIRGGKK RDVQEQVQPR QQRPGVDMFA PQQLEQRAAK PPRITGPRYY TYKADALRRI ATDRLLDPVV TGSVANGALP PMVRTPLSEV RQFLPQIDVR APGEVAKAIE TFYGTRTDFL WIDGVGVNGR AKAALAALSA AAKVGLDPRD YAVAVPPEDF DRGDMIAREK DLVRFEIALS AAMLTYVQDT VRGRIDPNRI SGYHDFKRKS VDLLAFLEKI EASGDVAALI ESRNPKSAQF EALRQELERL RTQVEATPRV EIAPGTLLKP GESNPELANV IAGIKLKASD ALKTEHAVVL AAYGGTPDYT PELVPVVEAF QKEHGLKADG IVGQASIRAI TGGDTIGEKI RKIEIAMEQA RWLPDGLGDR YVFINQPAFT ASYTEQGAEQ FSMRVVVGSK ANQTYFFQDE IQTVEVNPYW GVPQSIIVNE MLPKLRSDPG YLDRMGYQVE VGGRVVSSTA VNWYGSTNSI AVRQPPSSDN ALGELKILFP NAHAIYMHDT PSKSFFKRDQ RALSHGCVRL ADPRRMAAAV LGVSVDEVGE EISGGRNKAL PVSAKVPVYV SYFTAWPNKD GTVEYFNDVY ERDMYVNRAF EATRKARHAE G
|
| |