Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sama_3014 |
Symbol | |
ID | 4605261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella amazonensis SB2B |
Kingdom | Bacteria |
Replicon accession | NC_008700 |
Strand | - |
Start bp | 3587752 |
End bp | 3590226 |
Gene Length | 2475 bp |
Protein Length | 824 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639782425 |
Product | dipeptidyl peptidase IV, putative |
Protein accession | YP_928886 |
Protein GI | 119776146 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.003026 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCGG TTTACCGTCA CTTTGCTGTC AGTGCGTTGG CGCTGGCTAT CCTGTCCGGC TGCGCTGCCA CCGAACACGC CGCAACTGCT CCATCAGCTG CCAATCAGAC ACCTGTGTCA CAAGTGGAGC TGGCAACACC GCCTGTTGCC TCCGAACACC TGACCATGAA GCAAATTATG GCCAATCCAG ACTGGATGGG CATATTCGCC CGCAATCCAT ACTGGAGCGA TGACGCCGGC AGCGTACTGT TTGAGCGTCA GCCAAGTGCA TCGGCGCTTA AGTCCTACTA TGTTCAGGCG CTGGATGGTG AAGCCCGTGA GTTGTCTTTG GGCGAGTTAC ATCAGGTCTC CCAGCGCAGT GGTGTGCTGG ATAAGGCGAA AAAGCGCAAG GCCTACATCT ATCAGGGCAA CCTCTTCGTT AAAGAGCTGT CCACCGGTGA AGTCGCACAG CTGACCCGTA CCAATACCCC AGTGGATGGT GTGCGTTTCT TGGATAATGG TGACCTGGCC TTCTGGCAGG GGAACGCCGT ATTCCGCCTG CATGCCAACG GCATGGTAGA GCAGCTTGCC GATATCCGAA TGAGCAATGC GCCCAAGGGC GTTAAGGCCC CAGAGGCCTA TTTGGCTAAA GAGCAGGCAA AGCTCATCCA GTATGTTGCT GCCCAGCATG ACAAAGCCAA AGCCCGCGAG GACTATAAAA AGGCGCTGGC TGAAGCCGAT ACCACCATGA CGGCACAGCC CTGGTATTTG GGGGAGAGTG AGGTTGTCGT CGAACTGTCT CTGTCAGCCG ATGGTCGTTA TGTGCTCGTC AGCCTTCAGG ATAAGAGTTA TCAGGGCCGC AGTGAGCACG ACATCATGCC CAACTATCTG GGTAAGGATG GCTATGTGGA TGCGGTGCCT GCCCGCGCGC GGGTAGCGGA AGATAATCCA CCGGGTCAGC GTCTGGTATT GATTGATCTT AAAGACAAAC AAAAGCATGA CATCAGTATC GAAGGTCTGC AGGGCTTTGA TGAGGATGTG CTCGCCGCGG TGAAGGCCGA CAACGCCAAA GCCCGGGGCG AAAGCTATCA AAGCGAGAAA GCGCCCCGCA AAGTGCAGCT GATGCAGGAT TGGGGCTGGA GCCAGAGTGC CATCCAGTGG AGTGATGATG GCAAGCTGCT GGTTATGCTC GAAGCGGTGG ATAATAAAGA CCGCTGGATT GCATCTGTGA ATCTGGACTC CGGCAAGCTA ACCACAGAGC ATCGTCTCCA TGACGACGCC TGGGTGAACT ACGACTACAA CCAGTTTGGT TGGCTCGAAG ATGGGCGTTT CTTTTTCCTG TCAGAAGAGA CTGGCTATTC CCAGCTGTAC ATCAAGGCGC CGGGCAAGGC TGCAACGGCA CTGACATCGG GTCAATTCGT GGTCAGTGAT GTCACCTTAA GCCCCGATGG CAAGCATCTG TATTACAAGG CCAATAAAGA TCATCCCGGC GTTTACAATG TGTACCGCGT CGCTCTGGCC GATGGCAAAG ACGAGCAATT GACCCATTGG GAAGGCACGC TGGATTACAG TTTGAGCCCG GATGGTAAAC GCTTGCTGCT GACTGCCTCC AGCCGTATTC AGCCTGAAGA GCTTTATGTG CAGGAGATTG GTGGCGAACT CAAGCGCCTG ACCCACTACA CCTCAGAGGT CTTTGCCAAG TATACCTGGC AAGCGCCCAA CGTGGTGGCC GTGCCGTCCA ATCATGGTGC CGGTGTTGTT TACGCCAGAG TGTATCTGCC CGAGGGGTAC GATGCTGCCA AGGCAGACAA GTATCCAGCG GTGATTTTTA ACCATGGTGC CGGGTATCTG CAAAACGCGC ACTTTGGTTT CTCCGGCTAT TTCCGCGAGT TTATGTTCCA CAACCTGCTG GCCCAGCAAG GTTATGTGGT GATGGACATG GACTATCGTG GCTCCAAGGG GTATGGCCGT GACTGGCGCA CCGCCATCTA TCGTCAGATG GGTCACCCTG AGGTGGAGGA TCTCAAAGAC GGCGTGGCCT GGATGGTGAA AAATGCCAAT GTGGACAGCA AACGTGTGGG CACTTACGGC GGATCCTATG GTGGCTTCCT CACCTTTATG GCGTTGTTCC GCGAGCCTGA GCTCTTCCAG GCTGGTGCAG CGCTGCGTCC GGTAACCGAC TGGGCACATT ATAATGCGCC TTATACCTCC AACATCCTCA ATACACCGGA TGTGGACCCC ATTGCCTTTG AACGCAGCTC ACCCATCTAT CATGCCGAGG GGCTGAACAA ACCGCTGCTT ATCATGAGCG GTGTGCTGGA TGATAACGTG TTCTTCCAGG ACAGTGTGCG CCTGGTACAG CGTTTGATTG AGCTTGAAAA GCCCATGTTC GAAACCGCTA TCTACCCGGT GGAGCCCCAT GGTTTCCGAG AGCCGTCCAG TTGGCTCGAT GAATACCGCC GCATTTATAA GCTGTTTGAA GCCGAGCTCA AGTAA
|
Protein sequence | MNPVYRHFAV SALALAILSG CAATEHAATA PSAANQTPVS QVELATPPVA SEHLTMKQIM ANPDWMGIFA RNPYWSDDAG SVLFERQPSA SALKSYYVQA LDGEARELSL GELHQVSQRS GVLDKAKKRK AYIYQGNLFV KELSTGEVAQ LTRTNTPVDG VRFLDNGDLA FWQGNAVFRL HANGMVEQLA DIRMSNAPKG VKAPEAYLAK EQAKLIQYVA AQHDKAKARE DYKKALAEAD TTMTAQPWYL GESEVVVELS LSADGRYVLV SLQDKSYQGR SEHDIMPNYL GKDGYVDAVP ARARVAEDNP PGQRLVLIDL KDKQKHDISI EGLQGFDEDV LAAVKADNAK ARGESYQSEK APRKVQLMQD WGWSQSAIQW SDDGKLLVML EAVDNKDRWI ASVNLDSGKL TTEHRLHDDA WVNYDYNQFG WLEDGRFFFL SEETGYSQLY IKAPGKAATA LTSGQFVVSD VTLSPDGKHL YYKANKDHPG VYNVYRVALA DGKDEQLTHW EGTLDYSLSP DGKRLLLTAS SRIQPEELYV QEIGGELKRL THYTSEVFAK YTWQAPNVVA VPSNHGAGVV YARVYLPEGY DAAKADKYPA VIFNHGAGYL QNAHFGFSGY FREFMFHNLL AQQGYVVMDM DYRGSKGYGR DWRTAIYRQM GHPEVEDLKD GVAWMVKNAN VDSKRVGTYG GSYGGFLTFM ALFREPELFQ AGAALRPVTD WAHYNAPYTS NILNTPDVDP IAFERSSPIY HAEGLNKPLL IMSGVLDDNV FFQDSVRLVQ RLIELEKPMF ETAIYPVEPH GFREPSSWLD EYRRIYKLFE AELK
|
| |