Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3890 |
Symbol | |
ID | 5318684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 347980 |
End bp | 349110 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640775702 |
Product | hypothetical protein |
Protein accession | YP_001312635 |
Protein GI | 150376039 |
COG category | [S] Function unknown |
COG ID | [COG4641] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0848246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATTTC TTTTCTACAC CCACTCATTG ATTTCCGATT GGAACCACGG CAATGCGCAT TTTCTGCGCG GTGTCATGCG TGAAATCACA CGCCGCGGCC ATCTGGCCGT GGCGCTTGAG CCGGGCGATT CCTGGAGCCG CCGCAATCTG ATTGCCGATC AGGGCATCGG ACCGATCGCC GCCTTTCGCA AACACTTTCC CGATTTGCAG GTCGCGATCT ACGAGTCCGA TTTCGACCAT GAAGCGGCAG TCGCCGAGGC TGACATCGTC ATTGTCCATG AGTGGACCGA TCCCGCTCTG ATCGCCGAAC TCGGCCGCAT CCGCCTGAAG GGTGGACGCT TCACGCTGGC GTTCCACGAC ACACATCACC GCGCCGTCAG CGCCAAACGG GACATCGCCA GGCTGGACCT TTCCGGTTAC GACTTCGTTC TGGCTTTCGG TGAGGCGCTG CGCGAGCGCT ATCTGCAGGC GGGATGGGGA AGGCACGTCC ATACCTGGCA TGAGGCTGCC GACACTTCGC TGTTCCATCC GATGCCGGAG GTGGAAAAGC GTGGCGAGCT CATCTGGATC GGCAATTGGG GCGATGACGA ACGCAGCAGC GAAATCATGT CCTTCCTCGT CGAACCGGCA AAAAAGCTGA AACTGAGGGC GACGGTCCGA GGCGTAAGAT ATCCCGACAC GGCGCTCAGG GCCTTGCGCG CCGCCGAAAT CGACTATGGC GGCTGGCTCG CGAACGCAGC CGTCCCGCGG GCCTTCGCGG AGCACCGGGT CACCATGCAC ATTCCGCGCC GACCCTACGT GGAGGCACTC CCGGGCATAC CCACGATCCG CGTTTTCGAG GCTCTTTCCT GCGGGATTCC GCTGGTCTCG GCACCATGGA CGGATGCCGA AGGGCTATTC CGGCCCGGCA AGGATTTCTG CATCGCCAGG GACGGCAAAG AGATGGCGCG ACTTCTGCGT CAACTTCTCG CGGAACCAGC CTTCGCAACG GAGATGGCCG CTTCCGGACT GGAGACCGTC CGAGCACGCC ACACATGCGG CCATCGCGTC GACGAGCTTC TCTCCATTCT GGCCGCCTAC ATGCCGCACA GCAACGTGGA ACGCACAGTC ACCGAGGAGG TCCAGCTATG A
|
Protein sequence | MRFLFYTHSL ISDWNHGNAH FLRGVMREIT RRGHLAVALE PGDSWSRRNL IADQGIGPIA AFRKHFPDLQ VAIYESDFDH EAAVAEADIV IVHEWTDPAL IAELGRIRLK GGRFTLAFHD THHRAVSAKR DIARLDLSGY DFVLAFGEAL RERYLQAGWG RHVHTWHEAA DTSLFHPMPE VEKRGELIWI GNWGDDERSS EIMSFLVEPA KKLKLRATVR GVRYPDTALR ALRAAEIDYG GWLANAAVPR AFAEHRVTMH IPRRPYVEAL PGIPTIRVFE ALSCGIPLVS APWTDAEGLF RPGKDFCIAR DGKEMARLLR QLLAEPAFAT EMAASGLETV RARHTCGHRV DELLSILAAY MPHSNVERTV TEEVQL
|
| |