Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_1907 |
Symbol | |
ID | 7090074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 2251295 |
End bp | 2253337 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643460811 |
Product | protein of unknown function DUF1302 |
Protein accession | YP_002357835 |
Protein GI | 217973084 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00456554 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.102822 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTG TTAAAAATAG TTTTAACAAA TCGGCGCTTG CTTTGGGGGT AGCGTCGGCA TTGAGTTTGT TAATGATCCC AAGCGTTAAT GCAATATCTT TTGATTGGGG AGAGGTTGAG GGTACATTTG ACTCAACTTG GACTGCTGGT GCGAGCTGGC GTGTTGGCGC GCGCGATTGG GAAGGTCAAA TTGGTAAGGT GAATCAACCC CAATTCGATT GGTCTAATTA CAGCGCATTT GGCAATACAA AATATACTTC AGCTGAGATT TGGGCCCAGC CCGGGTCGTA TTCCAGCAAC AATGACTTAA GTAACTTACT TTATTCTCAA GGCGATACTA CGTCTGAGAT CGTCAAAGGT TTGCATGAAT TATCGTTGAA ATACAAAAAC TACGGTTTGT TCGCCCGCGG TATGTATTTT TATGATCGTA AACTGAACGA TGGCAATTAT GACTACAACG ATCCGATCAC GGGTAAAGAA TTTGACCCTT GCGAAGATAG CCGCGCTTCA GAAGTACAAT GTAAAGATAT CCGTTTATTA GATGCGTTTG TCTATGCCAA CTTCGATTTG AATGAGGGTG CGAATCCATT ATCTATTCGT GTTGGTAATC AAGTGGTGTC ATGGGGTGAA AGTACACTGA TTGCCCATGG GATCAGTGAA ATTAACGCCG TGGATTTGAA CATTCTTAAC GCACCTGGTG CTGAGTTAAA AGAAGCGTTT AGACCGCAAG GTATGGTGTG GGCATCCCTT GGCCTGACAG ATTCACTAAC GGTTGAAGCA TTCTATCAAT ATGATTGGCA ACCTATTTGG GTCCCAACTC CGGGCTCAAT TTTTGCGACT AACGATTTTG CCGGTTTTGG GGGTTATGCT CAAAACGCTC AGCTTGGTTT TAACGCTAAC CCTGACATCA ACTTAGATTT TGTGATGCAA GAATATGAGC GTTTAGCAGG CATGATCGCC AGCGGCCAAT CTATTCCGAC ACAGCAATTG GTTTCTATGG CTTTAGCCTA CCCAACGAAA GTGACGCTAG TGCAGGATGA AGAAGAACCA AGCAATGATG GCCAATACGG TATCAAGCTG GGTTATTACG CGCCTGAGCT AGGTGAAACG GAATTTGGTT TCTATTTCAT GAATTACCAT AGTCGTCGTC CACTGATCAG TGGTACTGCT GCCGACTTTA GCACGGGTGC ATTGTTATCT GACTTGGCGA CGGTAGGCCA AAACTCGGGC AACATTAACC GTGAACTGCT GCTCAGCCTG AAGAGCTTCT CTAAGGCGCA AATCGTCTAT CCAGAGGATA TTAAACTCTA CGGTTTCAGC TTCAACACCT TGATTGGTGA TACGTCTGTC GCGGGTGAGA TTGCCCATCG TCAAGATGAG CCACTGCAGA TTGATGATGT TGAGTTGCTG TTCGCCGGTA TGCCACAACA GTTAGCCAAT GCGGGGCTTC GTCCCGATCT CGATGGTATT TCACAGATTA AAGACGTTCA ACCGGGTGAA ACCGTCGATG GCTTTATCCG CCTTGATACC ACACAGGCGC AGGTGACTTT CACCCATCTA TTTGGTCCAA CTTTAGGTTT AGATAACTTA ACTATGTTGG CCGAAGTGGG TGGTGTGTGG ATCCACGATA TGCCAGGTTT CGATGAGCTA CGTCTCAATG GCCCAGGTAC GTCACGTTCT GGCGGCAATC CCGATATGCC AGGCATCATT CAAGCCTTGC ACAATGGTCC TGAAACTAAC CCGTTCCCAA CTGACTTTGC TTGGGGCTAC CGTTTAGTGG CAAAAGCGGA TTTCAATAAC GTCTTTGCTG GGGTCAATAT GTCGCCACGG GTCATTTTCT CCCATGACGT TGACGGTATT ACTCCGGATC CTATGTTCCT GTTCACGGAA GGACGCAAGT CTGTCGCGCT AGGCGTTAAC TTCGACTATC AAAACCGTTG GGGTGCAGAT ATCTCCTATA ACAATTTCTT TGGCGGTGTC GGTACGACGA ATGCGATGGC AGATCGTGAT TATGTTTCAT TCAATATCAA GTATTCGATC TAA
|
Protein sequence | MKIVKNSFNK SALALGVASA LSLLMIPSVN AISFDWGEVE GTFDSTWTAG ASWRVGARDW EGQIGKVNQP QFDWSNYSAF GNTKYTSAEI WAQPGSYSSN NDLSNLLYSQ GDTTSEIVKG LHELSLKYKN YGLFARGMYF YDRKLNDGNY DYNDPITGKE FDPCEDSRAS EVQCKDIRLL DAFVYANFDL NEGANPLSIR VGNQVVSWGE STLIAHGISE INAVDLNILN APGAELKEAF RPQGMVWASL GLTDSLTVEA FYQYDWQPIW VPTPGSIFAT NDFAGFGGYA QNAQLGFNAN PDINLDFVMQ EYERLAGMIA SGQSIPTQQL VSMALAYPTK VTLVQDEEEP SNDGQYGIKL GYYAPELGET EFGFYFMNYH SRRPLISGTA ADFSTGALLS DLATVGQNSG NINRELLLSL KSFSKAQIVY PEDIKLYGFS FNTLIGDTSV AGEIAHRQDE PLQIDDVELL FAGMPQQLAN AGLRPDLDGI SQIKDVQPGE TVDGFIRLDT TQAQVTFTHL FGPTLGLDNL TMLAEVGGVW IHDMPGFDEL RLNGPGTSRS GGNPDMPGII QALHNGPETN PFPTDFAWGY RLVAKADFNN VFAGVNMSPR VIFSHDVDGI TPDPMFLFTE GRKSVALGVN FDYQNRWGAD ISYNNFFGGV GTTNAMADRD YVSFNIKYSI
|
| |