Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_3842 |
Symbol | |
ID | 7088877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 4552785 |
End bp | 4556018 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643462721 |
Product | hypothetical protein |
Protein accession | YP_002359742 |
Protein GI | 217974991 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00660382 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGGATTA GACTGCTGTG CGCCTCCATC GCCATACTGC TTAGCGGATG CGGTGGCGGC TCAGAAGATG CTGGCACGAC CACACCACCG ACTCAGCCGC CAGTCCAACC TCCCGCGCCG GTGCAATACA TTGCTAGTAG TGTGTCAGCA AATGGCGGAG CCTTAGATCC AAGCAGTCAA AAAGTGGAGT CGGGTAAAAG CGCTATTTTT AGCATTACTG CCGATATTGG CTTTGCACTC GAAGGCATTA CTGGCTGTGG TGGCACGCTC AATGCGCTAA CTTACACCAC TGCGTCCATG ACGGCGGATT GCACTGTCAC GCCGACATTT ATTGCTAACG CCGAAAATGC CATTAAGCAT CAAGACCACA GGCTTGCCAG TGCAAGTGAG TTAATTGATT TCAGCATCGC CAAACTCGCT AGTATCGACA TCAAACGTAA AGCCCAGATA AATCAGCTAT ATCAAGGTGT CGGTAGCAGC ATCTCTTGGC ACCCGACTCA CGACTCAATT ACCTTTTCGA GCTTTATGCC AGAGAACACT TTTACTGTTT TGCCATCGAA TGTCGATGGC AGTGGCGCAA GCGCGGTTCG CGGGTTAGTG ATGGCAGGCG AGCAACAAGG GCAAAGATAT GCGGCCATGG GAGGAAATCT CTTTTCCGTC AACACTTCAG CTCAAACCGA TACTTTGCTG AAAAATCTTT TAGGCTGGCT GACCAAAGGC GCCGATAAAA CCGACGGCCT CAGCATAGTT ACCACCCAAA TGCCGAGCAA AGCCGACAGT TGGTACTTTC CCCATAACGA AGATATTCGC ACTTGGCTTG CAAACAACTA CCCCGCAGCC CATAGCATTA ATGCGGCAAA TACCTGTGAC TACAGTGAGT TAAGCAACTG TATCGACACA CTCAAGCCCG ATGTGATAGT GATTAGTGAC ATAGACCGTC AGTCCCTCGG TTTTGCAGGC ATTAAAGCTG CTATCGCTAA GGCCAAAGCG GCGGACATTC CCTTGCTGCT CTCAAACTAC TGGCGCAATC AAAGCCCAAT GCTATCGCCG CTTTACCTTG AAATGGGGCT TTCAACCTCA GGTAATTATT GGTCAAAGCT CAATGCCAGC GACCTCAGTG TGACTGATGT ATTGGCTGAA GATACTGCCC TGACCAAGGT AAATAATTTG CTCGGCAAGC TCAATGAAGC AAGCTTCGAT ACCTCAGTGC TTAACGACTG CACAGGCAAT TATCTCAACT GCAATGCTGC CGTATTTGTT GACTCGTTCA AAGCCGGCGC CGATTGGTAT CGAGCCAATG CCGAAACCTT AGATAACAAT GCAATTGATG TGTTTACCAA GGCAGACTTC CCTTTGATGA AGGCCGGACT GTTACTCGCC GATAAGTATC GCAGCGAGAT TGATTACCCC ATAGCCTACA GCGAGCACGC ACAATGGCAG CAAGCCCTGT TTGCCGATTG GACCGTCAGC TATGCCCGTA CCCATAACTT AGCCCAGCCA GATTTGGGTG AATATGTCAC CGACATAGCC AACCTCAGCA AGGACAGCAA TGCGCACTAC GCTTACCCTG CGACAGTCTC AGAGCGTAAA ACCATTAGTG TGCCCTATGC AGGCCAATGG ACGACTACAG GCTGGTATGC ATTGCCGGGG CAAACCATCA AACTTACTCG CCTCGATAGC AACGCTGCGA ATGTGGAGAT CAAACTCAAC TACCACAGAC GTAACACTAA CCGCGCCTAT GAGCAGAAAA TCTACCGCGG TCCCTTGGAG CTGGCGCAGC AACGCCTGCG CTTAGCACAA TGGAAAAGTA TTGAGTTTTC CTCCCCCTAC GGCGGGCCGA TTTATCTATA CATCAATGGT GATGCTTCGA GTGTTGATGG CGCGTTAAGT GTTGATGTCA ACGCTGAGCA TGTCGCTAAA CATCCGACCA TTATGGATTT CTCCAATCCA GCTGAAATCG CGGCCTTTAA CGAACGCATT CAAAATACCG AATTACCCCA TGTGGACTTG CGCACCGATG GTGCAGAGCA GCATCTACGT CGCGACCGCT TCTTAAACGC CATAGGTACC GATGTGCCTG ATGTCAACGC GCTGCTCAAC AGCATTGTCG ATGACCATAT CAACAGCGTT TACACCTTAG CTGGCCTTAA AATTCAAGGT AAGAGTTTAA CTGAGTCCTT ACCCGCCGAT GTGCTGGCGG CCTGCAAAGG ACTGTTTGGC GACGACTGTA TCGACAATAG TCTGCATATC CGCACTATTA TTCAACACGC CAATTATGAC CAAAATGCCC AGTGCGGCGC CGGTTGTAGT GGTAATCCTT GGGATGCGGC ATGGAATATC GACCCAACGG GCTGGGGTGA TAACCATGAG TTAGGCCATA ATCTGCAAAC GAACCGCCTC AATGTGCAAT ATGCAGCGGC AAACAATAGT GACAACTGGG CAGGTTATAG CAGTCGCGCG GGCGAAAACT CCAACAATAT CTTCCCTTAT GTAGTGAAGT GGAAAACCCA TTATCTGCGC GATGGCAATA CAGGCACAGT CACCGATGGC CACATGAACC ACAAGGATCT CTTCTATGTG TTTATGTCCG ATGCCGCAGG TACAACAGAC ACTAGTGGTA AACGCGTAGT GTTTGGCGCC AATTGTAAGG TGCTCGATGC TGGTGAAGAC AGATACACTG CGCCTTGGGC CAGCAATGCT TATGCTGTCC ATAACGGTTA TCGCATGGCG TTTTACATCC AAATGGCGCT TAAGGCTCAC GGCATGACGC TCAGTGACGG CACCACTCTG AACAATGGCT TTAATCTCTT TACCTTGCTG TATCAACACA GTCGTATTTT CGACAAATAT GCCAATAACG CCAGTGATTG GGAAGCGAAC CGCAGCAAAC TGGGCTTTAG CCAATTTCCG TTTGACGGCA ATAGTGTCTA CGGCGGAAAA ACCGTGAAAG ATATCCCCGG CAACGACTTT ATGCTGGTGT CTTTGAGTCA GCTCACAGGC AAAGATTGGC GTAGCCACTT TGATATGTTG GGTCTGCGCT ACTCAAGCCT TGCAGCCGTG CAAACGGTCG CTAATGCGAC CTCGGGTACT ATGCCCATGG GCATGTATGA GCTAGAAACC GACTTGCCGC CAGCGAACAT GAGCCAAGGT TTAACCTTTG TGCCACTGTC ACTCACTGAT GGCACAACAC AGTGGAAAGG CGTGGGTTCG CCAACCCAGT GCACTAAACC ATAA
|
Protein sequence | MRIRLLCASI AILLSGCGGG SEDAGTTTPP TQPPVQPPAP VQYIASSVSA NGGALDPSSQ KVESGKSAIF SITADIGFAL EGITGCGGTL NALTYTTASM TADCTVTPTF IANAENAIKH QDHRLASASE LIDFSIAKLA SIDIKRKAQI NQLYQGVGSS ISWHPTHDSI TFSSFMPENT FTVLPSNVDG SGASAVRGLV MAGEQQGQRY AAMGGNLFSV NTSAQTDTLL KNLLGWLTKG ADKTDGLSIV TTQMPSKADS WYFPHNEDIR TWLANNYPAA HSINAANTCD YSELSNCIDT LKPDVIVISD IDRQSLGFAG IKAAIAKAKA ADIPLLLSNY WRNQSPMLSP LYLEMGLSTS GNYWSKLNAS DLSVTDVLAE DTALTKVNNL LGKLNEASFD TSVLNDCTGN YLNCNAAVFV DSFKAGADWY RANAETLDNN AIDVFTKADF PLMKAGLLLA DKYRSEIDYP IAYSEHAQWQ QALFADWTVS YARTHNLAQP DLGEYVTDIA NLSKDSNAHY AYPATVSERK TISVPYAGQW TTTGWYALPG QTIKLTRLDS NAANVEIKLN YHRRNTNRAY EQKIYRGPLE LAQQRLRLAQ WKSIEFSSPY GGPIYLYING DASSVDGALS VDVNAEHVAK HPTIMDFSNP AEIAAFNERI QNTELPHVDL RTDGAEQHLR RDRFLNAIGT DVPDVNALLN SIVDDHINSV YTLAGLKIQG KSLTESLPAD VLAACKGLFG DDCIDNSLHI RTIIQHANYD QNAQCGAGCS GNPWDAAWNI DPTGWGDNHE LGHNLQTNRL NVQYAAANNS DNWAGYSSRA GENSNNIFPY VVKWKTHYLR DGNTGTVTDG HMNHKDLFYV FMSDAAGTTD TSGKRVVFGA NCKVLDAGED RYTAPWASNA YAVHNGYRMA FYIQMALKAH GMTLSDGTTL NNGFNLFTLL YQHSRIFDKY ANNASDWEAN RSKLGFSQFP FDGNSVYGGK TVKDIPGNDF MLVSLSQLTG KDWRSHFDML GLRYSSLAAV QTVANATSGT MPMGMYELET DLPPANMSQG LTFVPLSLTD GTTQWKGVGS PTQCTKP
|
| |