Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0929 |
Symbol | |
ID | 4251860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 1083641 |
End bp | 1086787 |
Gene Length | 3147 bp |
Protein Length | 1048 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 638117492 |
Product | hypothetical protein |
Protein accession | YP_733066 |
Protein GI | 113969273 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTAA AGGATGACGT TGACCAACTC AAGGCAGAAT TGGCGCAGTT AAAACTGCAG CACCTGTCGC AGCAATCTTC ACTCAGCCGC CAACTGGCAG AATTCTCGGC CAAGCTCGAC AGCATCAGTC TTCAGCTTGA GCAGCAAGAC AATGGCCCTC TCACCACCCC ATCCACGAGT GTGGACGATA TCACGGCGCC ATTCGGCACG ACGATTGATT CAGGGGATGA GCGCGTTTCG CAGCAATCTG AAGAACTGGC CTCCCATATC CAGCCACAAA TGCCCGAAGT TAATTCTTGG CAAGACGATC CTTGGCAGCG CCACGCAAAG GCTCCCTCAA AAGCCATGCA TTCACCTGAG GATTCGCAGC AGGATGATTC GCAGAAAGAT AAGGCCTTTT CTCAGCAAGC CGGACTTGAA GTGGGTGTGC AGGCCGCAAG TCAGTTAGAG ACCTTATTAT CCCAAGGCGT TACCGCCATC ATGGCGCCAT TTGCGGCCAT CTCTGAACAG GTCAAAGGCT TCTATCAACA TTATCAAGCC AAGGGCTTAG GCCCAGTGTT TTTAATGACG GTTGCGGGCA TCATCACCCT GACCCTAGGC TTTGGTTATC TATTGCAATA TTCCATCAAT CATTGGTTTT CTGAACTCGG TAAAGCCCTG CTGGGTTTTG GCTGCGCCAA TGCGATTATT GCGGGCGGGA TTTTTATCCG CCAAAAAAGG GCGGGCATGG CCGACTTTGG CTCTGGAATT GTCGGATTAG GCCTCATCCT AAACTATCTC TGCGCCTATT TTATTGGCCC CTACTTTGAG ATTATCCCCA ATAGTGCCAG CTTTATCCTG CTGCTATTGA TTACTTTAGC GGGCTATGGC CTGTCGATGC GACTCGATGC CAAGGTCATT GCCGTCATTG CCTTAGTCGG CGGCTCGATG GCACCTATGA TGCTGCTATC CCAAAGCTAT GCGCCGTTAC TGTATATTCC CTATCTGCTA CTGATTGGTG CGGGCGCCCT CGCCCAAAGC CGTAAATTAC ACTGGCCATT ACTGCTGGAA ATCACCGCAT TCCTGCATAT CGCCTGCATC GAAGCCTTCA GCTATTTTGT CGATTTACCG CTCACCCAAT GGGGTGGCGG CGCAGTGCTA GCCCTAGTAA GCATTAACGC ATTGTTTTAT CTCTATGGTC TATATGGGTT TATGGTGAAT CCTTCAACGA CCTCACTCAG CCATAGAGCC CTAGTTCTAC CGATAGCCCT GCTGGCCTTT GTAATCTTCG AACTCACCCA ATTTACGCCA TTTGCCGGGG AGATTTTTAT CGCCAACGCA CTCATCTGTG CGGGCTTATA CCTTAAGCTC AAGTCTCGAC TCACCCCGTC CCAGAGCAGT CTCTTATTAG TCTTTGGCGG TAGTTTTGCC GCTTTTGCCG CCCTTTATCT ATTAAGCCAT GATTTCCTAG GATTAGTGCT TATCCTCGAA GCATTGCTCC TGCTCTGGAT TGGCCTTAAG GAGAAACTGA TTTCGGTGCG CGCCGAAGCC TATGTGTTAT TACTGACAGG GCTGATGCTC AATTTATATG CGGTGCTGAC AGGTTTTTCC TTTCTCGGAC TTGGCGAGCA AACGCCCATG GCGGCACTTG GGTTCCCGCT GCTCGCCTTA GCCTTAAGCG CGGCGGCCCT GCTATTTGTT ATCCGCCAAA TAGGCAATGA TGAGTTGCTA CTGTCACGGC TCGAGCGGCA ACTGAGCTAT TTACTCAAAG AGTTACTCAG CGGTTTTTAT GCGGCGACCC TTTTGCTGGC GGCTTTCCTT GTCAGCAGCG ATTATTATGT CGCCATCATC CCGCTCGTGA GTCTATTGCT CTTGTATTTA AGTGGTAAAG ACAAGTTGAT GGTCAGCGAG TTTGCCGCTT GGCTATTGCT GCTGCCACTC TTGTTTACTG TGGTTGAAGG CGTGACCCTT ACTGACAGCC TGAGCTTTAG TGCGCAACCA CTGATGGCCA AGCTGGCACG CATCGAATTG TTCGCCGCCC TGCTACTGGC CCACTATTGG TATCGTAAGC ATTATCCAAG TTCGCTGTTC GCCCAAGCCG CCAAGCAAAT ACAACTCGGC TGTTATCTGC TACTGCCGCT GATGTTATTG CCTAAGGTAA TGCGCAGTTA TTGGGAATTT ACCGCGGTTG CACTCTGGCT CAGCAGTTTG TTCAGCCTCG CCCTTGCCTA TGTTATCAAG CATAAAGCCC TTGAAATTGA AACCAAGATT TTGACTTGGC TGGCGATACT CACCACCGCC AGTTTGTGCC TGTTTGAAGT TTGGCAAGGG CTTATCGCAC TGCTAATTGG CGTCCTAGTC ATGGGCTTTG TACTGCTGCG TTATCGCAAG CTTGCCAAAC CTTGGGCATC ACTGCTACCC CTGCAATGGC AACTTAGCCC TTATTATTTC GCGCTAGTGC TGGCCGTATT AGTCTATGGA TTTAACCGCG TCGATGTTGT GGCATGGTCG ATGACGGCTT TAGCCCTGAG TGGCTACTTT GCCCTCTTGA TCCAAAGACG CACCCAACAT GGCTCAGTGC AAACGGGATT GATGGCGCAC ATAGCCATAG AAACCCAAGC CGCGGTGCGC TCAAGCTACA GCATGGCCTA CGGGCTGTGT CTGGGGCTCG CGCTATTGCC GATAATGCTG CACTTTGAGT TGTCGCTAAG ACTCGATCTG AATAACGTGT TATTAATCCT CTGTGAATTG ATGGCATTAG CCCTGCTCGC GCTACTTATC TTACGCCGCG GCGTCGCTAT TCGTTTGCAT CAGCGCTTCT TACCGCTGCA GTGGCTCAAA TGGGGCTGGC ACGGGTTGCT CGCACTGAGT TATCTCACCT GGAGTTATAC GTTCGATAAC ATTATTGCCG CACCACTCAG TGCCATTCTG CTCGTCGTGC ATGGCAGCGT GTTGATGTTT ATCAGCCTAA AACCCCAGCA TGCGGATATG ATCCGCCTCG CCGCCGGACT CTTTACCTTA GCCACACTCA AAGTGTTATT GCTGGATATG GCATCCTTCG AGCTAGTACA AAAGGTTATC GCCTTTATGG TGATTGGGGT GATCTTACTG ACCGTATCTT ACTTCTATCA AAAGGCGCGT AATCGCCTAC AAGAAGCAAA CCAATAA
|
Protein sequence | MPLKDDVDQL KAELAQLKLQ HLSQQSSLSR QLAEFSAKLD SISLQLEQQD NGPLTTPSTS VDDITAPFGT TIDSGDERVS QQSEELASHI QPQMPEVNSW QDDPWQRHAK APSKAMHSPE DSQQDDSQKD KAFSQQAGLE VGVQAASQLE TLLSQGVTAI MAPFAAISEQ VKGFYQHYQA KGLGPVFLMT VAGIITLTLG FGYLLQYSIN HWFSELGKAL LGFGCANAII AGGIFIRQKR AGMADFGSGI VGLGLILNYL CAYFIGPYFE IIPNSASFIL LLLITLAGYG LSMRLDAKVI AVIALVGGSM APMMLLSQSY APLLYIPYLL LIGAGALAQS RKLHWPLLLE ITAFLHIACI EAFSYFVDLP LTQWGGGAVL ALVSINALFY LYGLYGFMVN PSTTSLSHRA LVLPIALLAF VIFELTQFTP FAGEIFIANA LICAGLYLKL KSRLTPSQSS LLLVFGGSFA AFAALYLLSH DFLGLVLILE ALLLLWIGLK EKLISVRAEA YVLLLTGLML NLYAVLTGFS FLGLGEQTPM AALGFPLLAL ALSAAALLFV IRQIGNDELL LSRLERQLSY LLKELLSGFY AATLLLAAFL VSSDYYVAII PLVSLLLLYL SGKDKLMVSE FAAWLLLLPL LFTVVEGVTL TDSLSFSAQP LMAKLARIEL FAALLLAHYW YRKHYPSSLF AQAAKQIQLG CYLLLPLMLL PKVMRSYWEF TAVALWLSSL FSLALAYVIK HKALEIETKI LTWLAILTTA SLCLFEVWQG LIALLIGVLV MGFVLLRYRK LAKPWASLLP LQWQLSPYYF ALVLAVLVYG FNRVDVVAWS MTALALSGYF ALLIQRRTQH GSVQTGLMAH IAIETQAAVR SSYSMAYGLC LGLALLPIML HFELSLRLDL NNVLLILCEL MALALLALLI LRRGVAIRLH QRFLPLQWLK WGWHGLLALS YLTWSYTFDN IIAAPLSAIL LVVHGSVLMF ISLKPQHADM IRLAAGLFTL ATLKVLLLDM ASFELVQKVI AFMVIGVILL TVSYFYQKAR NRLQEANQ
|
| |