Gene Shewmr4_0829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0829 
Symbol 
ID4251969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp973516 
End bp975078 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content47% 
IMG OID638117392 
Productrhomboid family protein 
Protein accessionYP_732966 
Protein GI113969173 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.186643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.166239 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTGG CGTTAATCTT AAAAAAACTC CAGTTGATCT ATGTGCCCTT TATACTGCTC 
TGCCTGCTAT TTGTCTCAGG TTACAGCTTG CTCCATTGGT TGTTGATTAT CGAGTTCCAA
CTACTGAGCA TTGACGAATC GATTATCAAT TTTTGGTTGC CGCTGATATT GCCTTGGCCC
TTACTGTTTT TCTACTTAAG ACCTCGGCTC AAGTTATTCC GCTTTACCAA GAGTAATAGC
CGTTTTATCT ATTTTGTGAT TGCCGTGCTG TTGCTGGCCG TGCCCACTAT CGTGGCGCAG
GAATACCTTA ATAGCGCAAC GGGCAAGTTG ACTCAACTCG AGTCCATCAA TGCACTATTG
CATCAGCCGC AGACCAAGTA TTACCAATTA AATCAGTTTT ATATCGACAA AAAACATATA
GGGGTGCAGC GCAATATTGA GCCCATAGGC AAGGGCAACA GTGAGCTGCG CATGAGCCTG
TATATTGCGA TGCCAATTTT CGCTAAACGT AACGAGAGTT GGCGGGTCGG TGCTAAGGCC
TTGGCATGGT ATGGCAAAGT TTATGAGAAA ACGATCAGTA ATCGACTCGA ACAGAAAGAG
AAAGAAGCGC TATTTCAAGA GTTTATTCAT CATAGCCAGC AAGAGTTTAA TGCCTTAAAC
CCTGATGACT TTATCTATCT TGACCGTATC GGCCCGTCAA GCCGCTACAC TCAGTTACTC
ACCGCAGCCC AAAAAAGCTC AGTCTACTTC GAGGGTTATC GAACCGTCTT GATGCCAGTG
AATCAACCCT TCGAGGCGCG TAATGGTCAT AAACTGGCGT GGATTATCGC TTGGCTAAGT
TTTGGGGCCA TTCTGTGGTT GTTGATGAGT CTGATGTTAA AGCTGGATGA GACGCAGGCG
TCTAAGGCGG CACAGCTAAG CCTAGAAAAA CCTAAGGCGG AGCTCTATGG TTTTTTAGCC
GAGTTTAGGC CTCGGCAGGG CTTTGTTATC ACGCCTATCT TGCTCTATAT CAACAGCTTG
ATATTTGTAT TAATGGCCTT TGCCAGTCAA CACTTTATCG CTTTTCCAAA CAGCGTGTTA
CTCGATTGGG GGGCAAACCT TCGTCAGTTG GTGCTTGAGC AACAAGTCTG GCGTTTACTC
AGTAATGTGT TTTTGCATGG CGGCCTGATG CATTTGGTTT TTAATTTATA TGGGCTGTTT
TTTGCGGGGA TGTTTTTGGA GCCGTTGCTG GGTAAATGGC GTTTACTCGG GGTCTATTTA
TGTTGTGGCT TAGTGGCGAG CCTGGCCAGC ATAGGCTGGT ATGAGGCGAC AATCAGCATC
GGGGCATCGG GAGCCATCAT GGGGCTATTT GGCGTGTTGA TTATCTGGAT ATGGTTGGGC
TTGTTGCCCT TGGCGGACAA TATGCCACTC GCCCTCAATC TGGCCCTATT CGTCTCTGCC
AGTCTTGTTA TGGGGCTCTT TGGCGGCGTG GACAATGCCG CGCACCTTGG CGGTTTAGGT
TGTGGATTAT TGCTTGGGAG CTTATTACGG CCAACGCGAA AGGCGGCCAT ACAGCATAAA
TAA
 
Protein sequence
MRVALILKKL QLIYVPFILL CLLFVSGYSL LHWLLIIEFQ LLSIDESIIN FWLPLILPWP 
LLFFYLRPRL KLFRFTKSNS RFIYFVIAVL LLAVPTIVAQ EYLNSATGKL TQLESINALL
HQPQTKYYQL NQFYIDKKHI GVQRNIEPIG KGNSELRMSL YIAMPIFAKR NESWRVGAKA
LAWYGKVYEK TISNRLEQKE KEALFQEFIH HSQQEFNALN PDDFIYLDRI GPSSRYTQLL
TAAQKSSVYF EGYRTVLMPV NQPFEARNGH KLAWIIAWLS FGAILWLLMS LMLKLDETQA
SKAAQLSLEK PKAELYGFLA EFRPRQGFVI TPILLYINSL IFVLMAFASQ HFIAFPNSVL
LDWGANLRQL VLEQQVWRLL SNVFLHGGLM HLVFNLYGLF FAGMFLEPLL GKWRLLGVYL
CCGLVASLAS IGWYEATISI GASGAIMGLF GVLIIWIWLG LLPLADNMPL ALNLALFVSA
SLVMGLFGGV DNAAHLGGLG CGLLLGSLLR PTRKAAIQHK