Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0829 |
Symbol | |
ID | 4251969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 973516 |
End bp | 975078 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 638117392 |
Product | rhomboid family protein |
Protein accession | YP_732966 |
Protein GI | 113969173 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.186643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.166239 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGTGG CGTTAATCTT AAAAAAACTC CAGTTGATCT ATGTGCCCTT TATACTGCTC TGCCTGCTAT TTGTCTCAGG TTACAGCTTG CTCCATTGGT TGTTGATTAT CGAGTTCCAA CTACTGAGCA TTGACGAATC GATTATCAAT TTTTGGTTGC CGCTGATATT GCCTTGGCCC TTACTGTTTT TCTACTTAAG ACCTCGGCTC AAGTTATTCC GCTTTACCAA GAGTAATAGC CGTTTTATCT ATTTTGTGAT TGCCGTGCTG TTGCTGGCCG TGCCCACTAT CGTGGCGCAG GAATACCTTA ATAGCGCAAC GGGCAAGTTG ACTCAACTCG AGTCCATCAA TGCACTATTG CATCAGCCGC AGACCAAGTA TTACCAATTA AATCAGTTTT ATATCGACAA AAAACATATA GGGGTGCAGC GCAATATTGA GCCCATAGGC AAGGGCAACA GTGAGCTGCG CATGAGCCTG TATATTGCGA TGCCAATTTT CGCTAAACGT AACGAGAGTT GGCGGGTCGG TGCTAAGGCC TTGGCATGGT ATGGCAAAGT TTATGAGAAA ACGATCAGTA ATCGACTCGA ACAGAAAGAG AAAGAAGCGC TATTTCAAGA GTTTATTCAT CATAGCCAGC AAGAGTTTAA TGCCTTAAAC CCTGATGACT TTATCTATCT TGACCGTATC GGCCCGTCAA GCCGCTACAC TCAGTTACTC ACCGCAGCCC AAAAAAGCTC AGTCTACTTC GAGGGTTATC GAACCGTCTT GATGCCAGTG AATCAACCCT TCGAGGCGCG TAATGGTCAT AAACTGGCGT GGATTATCGC TTGGCTAAGT TTTGGGGCCA TTCTGTGGTT GTTGATGAGT CTGATGTTAA AGCTGGATGA GACGCAGGCG TCTAAGGCGG CACAGCTAAG CCTAGAAAAA CCTAAGGCGG AGCTCTATGG TTTTTTAGCC GAGTTTAGGC CTCGGCAGGG CTTTGTTATC ACGCCTATCT TGCTCTATAT CAACAGCTTG ATATTTGTAT TAATGGCCTT TGCCAGTCAA CACTTTATCG CTTTTCCAAA CAGCGTGTTA CTCGATTGGG GGGCAAACCT TCGTCAGTTG GTGCTTGAGC AACAAGTCTG GCGTTTACTC AGTAATGTGT TTTTGCATGG CGGCCTGATG CATTTGGTTT TTAATTTATA TGGGCTGTTT TTTGCGGGGA TGTTTTTGGA GCCGTTGCTG GGTAAATGGC GTTTACTCGG GGTCTATTTA TGTTGTGGCT TAGTGGCGAG CCTGGCCAGC ATAGGCTGGT ATGAGGCGAC AATCAGCATC GGGGCATCGG GAGCCATCAT GGGGCTATTT GGCGTGTTGA TTATCTGGAT ATGGTTGGGC TTGTTGCCCT TGGCGGACAA TATGCCACTC GCCCTCAATC TGGCCCTATT CGTCTCTGCC AGTCTTGTTA TGGGGCTCTT TGGCGGCGTG GACAATGCCG CGCACCTTGG CGGTTTAGGT TGTGGATTAT TGCTTGGGAG CTTATTACGG CCAACGCGAA AGGCGGCCAT ACAGCATAAA TAA
|
Protein sequence | MRVALILKKL QLIYVPFILL CLLFVSGYSL LHWLLIIEFQ LLSIDESIIN FWLPLILPWP LLFFYLRPRL KLFRFTKSNS RFIYFVIAVL LLAVPTIVAQ EYLNSATGKL TQLESINALL HQPQTKYYQL NQFYIDKKHI GVQRNIEPIG KGNSELRMSL YIAMPIFAKR NESWRVGAKA LAWYGKVYEK TISNRLEQKE KEALFQEFIH HSQQEFNALN PDDFIYLDRI GPSSRYTQLL TAAQKSSVYF EGYRTVLMPV NQPFEARNGH KLAWIIAWLS FGAILWLLMS LMLKLDETQA SKAAQLSLEK PKAELYGFLA EFRPRQGFVI TPILLYINSL IFVLMAFASQ HFIAFPNSVL LDWGANLRQL VLEQQVWRLL SNVFLHGGLM HLVFNLYGLF FAGMFLEPLL GKWRLLGVYL CCGLVASLAS IGWYEATISI GASGAIMGLF GVLIIWIWLG LLPLADNMPL ALNLALFVSA SLVMGLFGGV DNAAHLGGLG CGLLLGSLLR PTRKAAIQHK
|
| |