Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_3662 |
Symbol | |
ID | 7089596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 4341620 |
End bp | 4343920 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643462542 |
Product | peptidase S9B dipeptidylpeptidase IV domain protein |
Protein accession | YP_002359563 |
Protein GI | 217974812 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0549477 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000000187493 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTAAAA ATGGGTTAGC TTGGTTCCTT ACGCGGGCGC TTAAATTGAG CGCTGTTCCC CTACTGATAA CGAGTCAGGT AGTTATGATT TCAGCAACGA CCCCAGCTTT GGCAATGGAA GGTGGCAAAA AGCCACTGAC GATTGAGCGA ATGAATGCCT CGCCCGCCTT AGCAGGCACC AGTCCCCGTG GTTTAAAATT GTCACCCGAT GGGCTGCGCG TCACCTATCT TGCTGGCCGT AAAGACAATC AAAGTTTTTA TGATCTCTGG CAGATGGATG TCAAAAGCGG TGAGTCGAGT TTGCTATTAA ACGCTGATAA ACTCGCGACC AATGAATTAT CCGATGAAGA AAAAGCCCGC CGCGAGCGCC AACGTATTTA TGGTGAAGGC ATTATGGAGT ACTTCTGGGC CGATGATAGC CAAGCGCTAT TAATTCCGGC CTCTGGCAAT CTGTATTACT TTTCCCTCGT CGACAATAGC GTGACTCAAT TGCCGATTGG CGAAGGGTTT GCTACAGATG CACGCTTATC GCCCAAGGGA CATTTTGTGT CTTTCGTGCG GGATCAAAAT TTGTATGTGT TAGATCTGGC GACGAAAAAA CTCCAAGTTA TGACGACCGA TGGCGGCGGG GTGATTAAAA ATGCCATGGC CGAGTTTGTC GCTCAAGAAG AAATGGATCG CATGACTGGC TATTGGTGGG CGCCCGATGA GTCGGCTATC GCCTTCACTC GTATCGACGA GTCCGCGGTC GAGCAAGTGA CGCGCAATGA GATCTATGCC GATGGCATTA AACTCACCGA GCAGCGTTAC CCAGCAGCAG GTAAAAACAA CGTCGACATC GCGTTAGGTG TGGTCACGCT AAAAGATAAG GCCATCAATT GGGTGAGCCT GCGCGAAGAG AATAGCAAAG AAAAGAGCAA AGACATTTAC CTGCCGCGTG TCGATTGGTT GCCCGATAGC AAACACTTGT CGTTCCAGTG GCAGAGCCGC GATCAACATC AGCTCGATTT GCAGTTAGTG GCGTTAGATG CACTGACTAA GCCAAAAACC TTAGTGAAAG AACGCAGTGA TGCTTGGGTA AACCTCAATA ACGATCTGCA TTTCTTAAAA CAGCAGTCTG CCTTTATTTG GGCGTCTGAG CGTGACGGCT TTAATCATCT GTATCTTTTT GACTTAAAAG GCAAACTCAA AACGCAATTG ACTAAGGGCG AGTGGGCTGT CGATGAGTTG GAATACATAG ATGAAACCGC AGGCTGGGTG TATTTCACCG GCAGCAAAGA CACGCCAATC GAGAAACAGC TTTATCGCGT ACCGTTAGCG GGTGGCAAGG TTGAGCGCGT GAGCAAGCAA GCGGGGATGC ACAATCCTGT TTTCGCCGAT AATCAGAGTG TATATCTGGA TTATTTCAAT AGCTTATCTC AGCCACCGCA AATCAGTTTA CACGGTGACA AGGGCCAGCA ACTGGCTTGG GTCGAGCAAA ATGCGGTTAA GCAAGGTCAT CCTTTATATG ATTATGCAGG GCTGTGGCAA ATCCCTGAAT TTGGTGAACT GAAAGCCGAA GATGGCCAAG TGCTACAAAC TCGTTTATTC AAACCCGTTC CCTTCGATGC GAGTAAGAAA TACCCTGTCG TAGTGCGGGT TTATGGTGGG CCGCACGCCC AGTTAGTGAC TAATAGTTGG AGCGAGCAGG ACTACTTTAC CCAGTATCTT GTGCAACAAG GGTATGTGGT ATTCCAATTA GATAACCGCG GCAGTGCCCA CAGAGGCACT CGGTTTGAGC AGGTGATTTA CCGTCACTTG GGCGAAGCTG AAGTGAATGA TCAAAAAGTG GGGGTGGAGT ATTTACGGAG TCTGCCCTTT GTCGATGCCG ATAATGTGGC GATTTATGGC CACAGCTACG GTGGTTACAT GGCTTTGATG AGTTTATTTA AGGCGCCGGA TTACTTTAAA GCCGCGATTT CGGGCGCACC TGTGACCGAC TGGCGCTTGT ATGACACCCA TTATACTGAG CGTTATTTAG GTCATCCCGA AGGTAATGAA AAGGGTTATG AAGCCAGTAG CGTGTTCCCT TACGTGAAAA ACTATCAAGC GGGTCTATTG ATGTATCACG GCATGGCTGA CGATAACGTC TTGTTTGAAA ACAGCACTCG AGTTTATAAA GCGCTGCAGG ATGAAGGCAA ATTATTCCAG ATGATCGATT ATCCGGGATC TAAACATTCG ATGCGTGGCG AGAAAGTGCG TAATCACTTA TACCGCTCAT TAGCGGATTT CCTCGATAGA CAGCTGAAAA GCGCTAAGTA G
|
Protein sequence | MIKNGLAWFL TRALKLSAVP LLITSQVVMI SATTPALAME GGKKPLTIER MNASPALAGT SPRGLKLSPD GLRVTYLAGR KDNQSFYDLW QMDVKSGESS LLLNADKLAT NELSDEEKAR RERQRIYGEG IMEYFWADDS QALLIPASGN LYYFSLVDNS VTQLPIGEGF ATDARLSPKG HFVSFVRDQN LYVLDLATKK LQVMTTDGGG VIKNAMAEFV AQEEMDRMTG YWWAPDESAI AFTRIDESAV EQVTRNEIYA DGIKLTEQRY PAAGKNNVDI ALGVVTLKDK AINWVSLREE NSKEKSKDIY LPRVDWLPDS KHLSFQWQSR DQHQLDLQLV ALDALTKPKT LVKERSDAWV NLNNDLHFLK QQSAFIWASE RDGFNHLYLF DLKGKLKTQL TKGEWAVDEL EYIDETAGWV YFTGSKDTPI EKQLYRVPLA GGKVERVSKQ AGMHNPVFAD NQSVYLDYFN SLSQPPQISL HGDKGQQLAW VEQNAVKQGH PLYDYAGLWQ IPEFGELKAE DGQVLQTRLF KPVPFDASKK YPVVVRVYGG PHAQLVTNSW SEQDYFTQYL VQQGYVVFQL DNRGSAHRGT RFEQVIYRHL GEAEVNDQKV GVEYLRSLPF VDADNVAIYG HSYGGYMALM SLFKAPDYFK AAISGAPVTD WRLYDTHYTE RYLGHPEGNE KGYEASSVFP YVKNYQAGLL MYHGMADDNV LFENSTRVYK ALQDEGKLFQ MIDYPGSKHS MRGEKVRNHL YRSLADFLDR QLKSAK
|
| |