Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0443 |
Symbol | |
ID | 4251567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 502722 |
End bp | 505943 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638117002 |
Product | hypothetical protein |
Protein accession | YP_732580 |
Protein GI | 113968787 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATTA GACTCATTTC TGCCGCCATC GCCTTGATAC TCGCGGGTTG TGGCGGAGGC TCAGAGGACA CGGGTAGCAC AACACCCCCG CCGGAGCCGC CAGTCCAACC TCCTGTGGTG ACTCAATATA CAGCGAGTAC CAGTTCAGTC TTGGGTGGTA AGCTCATCCC TAGCAGCCAA AAGGTCGATG CGGGTAAAAG TGCCAGCTTT AGCGTTCAAG CCGACAACGG TTTCGTACTG GATAGCATTA GCGGCTGTGG CGGCGCGCTG ACCGCACTGA CCTACATCAC TGCGACGATA AGCGCAGATT GCACTATCAC CCCAAGTTTT ATCAGCAATG CTGAAAATGC CATTCGCCAT CAAGACCATA CCCTTGCCAG CGCTAACGAG CTGATTGATT TCAGCACCAC CGAACTGGCG AACATAGAGA CCGCCCGCAA GACACAAATC ACCGAACTCT ATCAAGGTGT GGGTAACAGC ATCAGTTGGC ATCCGACCCA CGATTCCATC ACCTTTTCGA GCTTTATGCC GGAAAACACT TTTACCTTAC TGCCATCCAA TGTTGATGGC AGCGGCGCCA GCGCCGTACG CGGTTTAGTG ATGGCGGGTG AACAACAAGG CCAGCGTTAT GCCGCCATGG GTGGCAACCT GTTTGCGGTA AATAATTCGG ATCAGAGTGA TAAGCTGCTG AAAAACCTCA TCGGATGGCT AACCAAAGGC GGCGATCAGC AGAATGGCCT GAGTATAGTC ACGGCACAGA TGCCAAGTCG CGCCGACAGT TGGTATTTTC CCCATAACGA AGGGATCCGC ACGTGGCTAA GCAAGTACTA CCCCGACCCC CACAGCATCA ACGACGCCAA TAGCTGTGAT TACACCGCCC TTGCCAGCTG TATCGATACC TTAAAACCCG ATTTAATCGT TATCAGTGAT ATTGACCGCG ATAACCTAGG TTACGCGGGC ATTCAAGCGG CCATCACTAA AGCCAAGGCG GCAGGGATCC CGCTGCTGCT CTCAAACTAT CGACGCGAAC AAAGCGCTAT GCTCTCACCA CTCTATCTTG AGATGGGGCT CTCTACTGCA GGAAACTATT GGTCGAAACT CAATGCCAAT AATCTGAGTG TGAGCACTAT TCTGGCGGAA GATAAGACGC TCGCGGATGT CGAAACCCTG CTGGCAAATT TGCGCGAACA ACGCTTCGAT ACGCAAGTGC TCAATGACTG CGGCGGCAAC TATTTAAGCT GTAACGGCAG CGCCTTTGTC GAGGCCTTCA AGGCGGGTGC AGATTGGTAT CGCAGCAATG CCGAAACCTT AGATAACAAT GATATCGATG CTTTTAACAG CCCTAATTTT AGCTTGATGA AAGCGGGATT ACTCCTCGCT GACAAGTATC GCAGTGAAAT CGATTATCCC ATCGCCTACA GCGAATCCGC CCAGTGGCAG CAAGCCCTGT TTGCCGATTG GACAGTCAGC TATGCCCGCG CCCACAACCT CGCCCAACCT GACTTAGGTG AGTATGTCAC CGACAAGACC AATCTAAGTA AGGGTAGCAA TGCCCATTAC GCTTATCCTG CCACAGTATC GGAGCGTAAG AGCATCACAG TACCTTATTC TGGCCAGTGG ACAACAACGG GCTGGTACGC CTTACCCGGG CAGACCATCA CACTCAATCG CTTAGATAAT ACAGAAGCTA ATGTCGAAAT TAAGCTGAAT TATCATAGAC GTAATACTAA CCGCGCCTAT GAGCAAAAGG TTTACCGCGC GCCACTGGAA CTTACCCAGC AACGCCTCAA GCTCGCTAAA GGGCAAAGTC TTGAATTCTC AACCCCCTAT GGCGGCCCTA TTTATCTCTA TATCAGTGGT GGCGAAGGTG CGCTCAGTGT CGATGTGCAA GCCAAGAATG TGGCTAAGCA CCCCAGCATT ATGGATTTCT CTAATCCCGC AGAAATCACG GCCTTTAACG ATAAGATCCA AAATACCGAG TTGCCCCATG TGGATCTTCG CACCGATGGC GCCGAGCAGC ATTTGCGCCG TGACCGTTTT ATGAATGCGA TTGGCGGTAA GATCCCGGAT GTCAATGCGT TGCTCAAGAG CATTGTCGAG GATCATATCA ACAGCGTGTA TACCCTCGCT GGGCTGAAAA TCCAAGGTAA GAGCTTAAGT GAATCTCTCC CCTCGGATGT GCTTGCAAGC TGTCAGGCAC TGTTTGGCGC AGACTGCACC GATGCTAGCC TGCATACCCG CAGCATCATC CAACACGCTA ACTATGACCA AAACGCCCAC TGCGGCTCGG GTTGTAGCGG TAATCCATGG GATGCGGCTT GGAATATTTC GCCAACGGGT TGGGGAGATA ACCACGAACT CGGCCACAAT CTGCAAACCA ACCGCCTCAA TGTGCAATAT GCCACTGCCG CCAATCGCGA CAATTGGGCT GGTTATGGCA GCCGCGCGGG GGAAAACTCC AACAATATTT TCCCCTATGT GGTGTTGTGG AAGACCCATT ATCTGCGCGA CGGCAACACG AGCAGCATCA CCGATGGTCA TATGAATCAC AAAGATCTCT TCTATGTGTT TATGTCCGAT GCGGCAGGCA CCACTGATAC CAGTGGCAAA CGCGTGGTCT TTGGCGCCAA CTGCAAAGTG CTCGATGTGG GCGAAGACAG ATACACCGCC CCTTGGGCAA GCAATGCCTA CGCGATACAC AACGGTTATC GCATGGCGTT TTACATCCAA ATGGCGCTAA AAGCCCATGG CATGACGCTT AACGACGGTA CCAGCCTAAG CAATGGTTTC AATATCTTTA CCCTGCTATA TCAGCACAGC CGCATCTTTG GTAAATATGC CAATAGTGCC AGCGACTGGG AGGCTAACCG CAGCAAACTG GGCTTTGAAC TCTTCCCATA CGAGGGCCAT AGCGTTTACG GCGGCAAAAC AGTGAGGGAT ATTCCGGGCA ATGACTTTAT GCTGGTGTCC CTGAGTAAGC TCACAGGCAA AGATTGGCGC AGCCATTTTG ATATGTTGGG CCTGCGTTAC TCCAGCCTCG CCGCCGCGCA GGTAGCGGCA AACGCCAGCC AAGGAGCGGT TCCTATGGGC ATGTATGAGC TTGAAACCGA TTTGCCACCA GCAAACATGA GCCAAGGCTT AAGCTTTATC CCTCTGTCAG TTAGCGATGG CAGCACATTG TGGAAAGGCG TTGGATCTCC CAGTCAATGT GCCAAACCTT AA
|
Protein sequence | MEIRLISAAI ALILAGCGGG SEDTGSTTPP PEPPVQPPVV TQYTASTSSV LGGKLIPSSQ KVDAGKSASF SVQADNGFVL DSISGCGGAL TALTYITATI SADCTITPSF ISNAENAIRH QDHTLASANE LIDFSTTELA NIETARKTQI TELYQGVGNS ISWHPTHDSI TFSSFMPENT FTLLPSNVDG SGASAVRGLV MAGEQQGQRY AAMGGNLFAV NNSDQSDKLL KNLIGWLTKG GDQQNGLSIV TAQMPSRADS WYFPHNEGIR TWLSKYYPDP HSINDANSCD YTALASCIDT LKPDLIVISD IDRDNLGYAG IQAAITKAKA AGIPLLLSNY RREQSAMLSP LYLEMGLSTA GNYWSKLNAN NLSVSTILAE DKTLADVETL LANLREQRFD TQVLNDCGGN YLSCNGSAFV EAFKAGADWY RSNAETLDNN DIDAFNSPNF SLMKAGLLLA DKYRSEIDYP IAYSESAQWQ QALFADWTVS YARAHNLAQP DLGEYVTDKT NLSKGSNAHY AYPATVSERK SITVPYSGQW TTTGWYALPG QTITLNRLDN TEANVEIKLN YHRRNTNRAY EQKVYRAPLE LTQQRLKLAK GQSLEFSTPY GGPIYLYISG GEGALSVDVQ AKNVAKHPSI MDFSNPAEIT AFNDKIQNTE LPHVDLRTDG AEQHLRRDRF MNAIGGKIPD VNALLKSIVE DHINSVYTLA GLKIQGKSLS ESLPSDVLAS CQALFGADCT DASLHTRSII QHANYDQNAH CGSGCSGNPW DAAWNISPTG WGDNHELGHN LQTNRLNVQY ATAANRDNWA GYGSRAGENS NNIFPYVVLW KTHYLRDGNT SSITDGHMNH KDLFYVFMSD AAGTTDTSGK RVVFGANCKV LDVGEDRYTA PWASNAYAIH NGYRMAFYIQ MALKAHGMTL NDGTSLSNGF NIFTLLYQHS RIFGKYANSA SDWEANRSKL GFELFPYEGH SVYGGKTVRD IPGNDFMLVS LSKLTGKDWR SHFDMLGLRY SSLAAAQVAA NASQGAVPMG MYELETDLPP ANMSQGLSFI PLSVSDGSTL WKGVGSPSQC AKP
|
| |