Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1791 |
Symbol | |
ID | 4252365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 2124035 |
End bp | 2127247 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638118402 |
Product | hypothetical protein |
Protein accession | YP_733922 |
Protein GI | 113970129 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.684722 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAACA CGCCGACATC GCCAGATCAG GAGACTGCCG AGCACGCGCA TCAGGCGATG CCTCAAGAAG CGACTGCGGC CGCAGCGCCA GATTCGGCGG CGTACCGCCG CAGCTCTACA CTCAAAAAGC GCTTAAAACA AAGCCTGATT GCCACCACTG TGGCCGGCCT GTTAGCTGGC GCGGCGCTAA CCATTTGGAT AATAAACAGT CAAACCGAAG CCCTGATCCT TCGCGTCGCG AATTATGCCT TAAGCGGCAT GGACGGTGAG CTTAGCGATA TTCGTTTAGG CCCCATGGGC TTAGAACATT GGCACATTCG CTCTGCGAGC CTGCGTGTAC ATGACTCGCA TTTAGTGATC AATAATCTGG ATATTCAGCT TGAACTCAAC TGGCCCAAAA GCCTTGAAGA ACTTAAGCAA CTTGTCCAAG TTGAGAGCTT GACTCAAAAA ATCAAGCGCA TTAGCACGGG CGAGATAGAC GTTGAACTCG GCGCATCGCT GCTGGAGCGA AGCCCCACCA TCGCCGATGA ACAAACGCCA GCGTTGGCGT TAAATATCAA ATCCTTACCT TTAATTGATA TAGGTAAAAC CACGCTCAGA TTAGCGCCGC AGGCAGAGTT TCCTGCCTAT CAACTGGTGA TGGATAAACT CAGCCTGAAT CATCAAGCTG AATTAACCAC AGCCTTTAGC AGCCCTGAGG GTGAACCGTT AGCCCAGCTT GCCGCGACGC TTGGCAACGA ACAATGGCGC TTGAAGAGCG AACTGAATAT CGCGCCGCTA CTGGAAAACC TGCATCAAAT TGGCCTGCGC CAAACCCAAG GGAGCATCCT CAGCCAATTA ACCCTGTGGG ATCAACAGTG GCAACAACTT GGGATAGGAT TAAGTGGGCA ACTCAGTTCT GAGAGCACGA TGACACTCGC CAGTGGCGAA ATAACAAGCC ACCACCGCAT TCAGCAGCCC AGCATCAGCT TAAGCCATTT TGCCGACTTA ACGCTCGCGC CGCAGCCTGC TTTGGGGTTT GAGCTTAGTG GATCACTTGC TTCACTTAAT CTCACCCTCG AGCCATTTCG TCTTGCGCTC ACACCCAATG CCGCACAGCA CACGCAGCTA TTAGCGGTGC TTAATCAGTC TCTGCAATTG AGCGATGAAA ACTCTCAAGC GCTCCTTACC CTGCTGTCGG GGCTCAAAAG CACCGAGGCG CCCGTGGGCC TTGCCTTTTC GATGACAGCG CCACTGCACT ATGCACTGGC ATCTCAGGCG AACACCCATG AACCCATAGC GCTGCCCGCA TTCGAGTTAA CCACCCTAGG CAGTAAGCTT GAGACGCGTA TTAGCTTACA AGATATCCAG TTAACGCCCA CGCCAGATGC TTGGAAGGTC GCGAGCCGCT GGCAACTGGC GCTTACGCAA ACGACCCCGC TGACACTGCG GGAGCTTTGG CATGCAGCCC CGCAGGATCT CAGTTGGGGA GCGGGAATGC TGCAAACAGC AGGTCATATC AGTGTTGCTC AGTCAGCTCA AGGACTGAAT TGGCAAATCA GCACAGCGCC AGTGACGAGT GACTCGAATG TCTCAAGCGA TACCTTACAA TTTGCACTTG AAGATCTGCA GCTACAGCAA ACTGCGCAAG CCGCCGAGCA TCAAACTAAA CAAACGCAGT TAAGCCTTGG CAGTATTCAG CTCAACGCTA AGGCCCCCAT GGCCGCGAGT GCAACCCCGT TAGCGACTAA GGATACGACT GGCGCGCAAT CGACCGAGTT TGCCTTGAAT CTGCCGCCAT TATCACTCGC CCTGTCGCAC TTGCGCGTGA GCCAAGCGGT AGAGAATCTT ACTGGCAACA ACAACAGCAA CAGCGCCGCC GCAGTGCAGA GCAGTCGCAA CGATATTAGC CTCAAGGCGT TCTCCCTTGA GACATCAAAG GCGATGACCC TCGATTACTC CTCGTTGCAA TCCATTGAAA ATGCTATCCA ATCGAGTCAG TTGAGTAATC AAGTGAACTG GCAAGCGCAG CAACTCTTGA TTGAAAAGCA GCTCAGCGCC AAAGGCCGCA CACGTAAACA GACTGTGCTT AAACTGGATA ATTTGGCACT CGCGCAGACG TTAAACTGGC AAAACCAACG TCTTCACGGC CATGAACAGT GGCAAGTCGG CACGGTTGAG CTGCAAAGCG ACCATCAATT ACAGTTAGCC GCGCCTCATA AACCACTGCT ATTAACGGGC CAATGGGTTG TCGATACCAG CATGACAGAA GCGCTATCTT TGCTGAATCA AACCCAGCCC TTGCCCGCTG AGTTAAATGT GACAGGCCAT AACCAATTAC AGGCACAATT TAAGCTGACA CATGAGCCAG AACAGACTCA ATTTGCCATG CAAATTACCC AGTCGATGAC AGAGCTGGAA GGTTTTTATC AAGACACGAC CTTTGAAGGC GGGAAATTAC AGGCCCAATG CGAGTTCACT TGGGGGCAAT CCTATAAGGC GCCGCAAGCG CCAGGCTATT TCAGCAGTTT AAGCCGACTG AACTGCCCGC AGACCATGAT GACCTTTAAC TTGTTCGATC CCGGCTTTCC CTTGACCGAT ATCGAAGTGG AGGCCGATAT TGTCCTCGCC AAGGATGCCG AAAAGCTTCC CGACAATTGG CTGCAACAAC TCACGGGCTT GAGTGATACC GACGTGTCGA TGACCGCCAA GGGCAAAGTA TTGAGCGGCC AGTTTTTACT GCCTGATTTT AATTTAAAAT TGCAGGACAA ATCCCACGCC TATCTATTAT TGCAGGGCAT GAGCCTTGAG GAAGTACTGC GCATTCAGCC GCAAATTGGT ATCTATGCCG ATGGAATTTT CGATGGTGTA TTACCCGTGG ATTTGGTCGA TGGCAAAGTC TCGATCAGCG GTGGCCAGTT GGCGGCCCGT GCGCCCGGTG GCCTTATTGC GATTTCAGGC AATCCGGCGG TCGATCAAAT GCGTCAATCG CAGCCTTATC TCGACTTTGT ATTTTCGGCA CTAGAACATT TAGAATACAG CCAGTTATCC AGTAGTTTCG ATATGGATCA AACCGGCGAT GCCAACCTCT TAGTGGAAGT CAAAGGCCGC AGCCGAGGGA TTGAACGCCC CATCCACTTG AATTACTCTC ATGAAGAGAA CATGCTGCAA TTATTCAGGA GTCTTCAGAT TGGTAACGAT CTGCAGGACA GAATCGAAAA ATCCGTGAAG TAA
|
Protein sequence | MVNTPTSPDQ ETAEHAHQAM PQEATAAAAP DSAAYRRSST LKKRLKQSLI ATTVAGLLAG AALTIWIINS QTEALILRVA NYALSGMDGE LSDIRLGPMG LEHWHIRSAS LRVHDSHLVI NNLDIQLELN WPKSLEELKQ LVQVESLTQK IKRISTGEID VELGASLLER SPTIADEQTP ALALNIKSLP LIDIGKTTLR LAPQAEFPAY QLVMDKLSLN HQAELTTAFS SPEGEPLAQL AATLGNEQWR LKSELNIAPL LENLHQIGLR QTQGSILSQL TLWDQQWQQL GIGLSGQLSS ESTMTLASGE ITSHHRIQQP SISLSHFADL TLAPQPALGF ELSGSLASLN LTLEPFRLAL TPNAAQHTQL LAVLNQSLQL SDENSQALLT LLSGLKSTEA PVGLAFSMTA PLHYALASQA NTHEPIALPA FELTTLGSKL ETRISLQDIQ LTPTPDAWKV ASRWQLALTQ TTPLTLRELW HAAPQDLSWG AGMLQTAGHI SVAQSAQGLN WQISTAPVTS DSNVSSDTLQ FALEDLQLQQ TAQAAEHQTK QTQLSLGSIQ LNAKAPMAAS ATPLATKDTT GAQSTEFALN LPPLSLALSH LRVSQAVENL TGNNNSNSAA AVQSSRNDIS LKAFSLETSK AMTLDYSSLQ SIENAIQSSQ LSNQVNWQAQ QLLIEKQLSA KGRTRKQTVL KLDNLALAQT LNWQNQRLHG HEQWQVGTVE LQSDHQLQLA APHKPLLLTG QWVVDTSMTE ALSLLNQTQP LPAELNVTGH NQLQAQFKLT HEPEQTQFAM QITQSMTELE GFYQDTTFEG GKLQAQCEFT WGQSYKAPQA PGYFSSLSRL NCPQTMMTFN LFDPGFPLTD IEVEADIVLA KDAEKLPDNW LQQLTGLSDT DVSMTAKGKV LSGQFLLPDF NLKLQDKSHA YLLLQGMSLE EVLRIQPQIG IYADGIFDGV LPVDLVDGKV SISGGQLAAR APGGLIAISG NPAVDQMRQS QPYLDFVFSA LEHLEYSQLS SSFDMDQTGD ANLLVEVKGR SRGIERPIHL NYSHEENMLQ LFRSLQIGND LQDRIEKSVK
|
| |