Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sama_2025 |
Symbol | |
ID | 4604275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella amazonensis SB2B |
Kingdom | Bacteria |
Replicon accession | NC_008700 |
Strand | - |
Start bp | 2460445 |
End bp | 2462355 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 639781402 |
Product | prolyl oligopeptidase family protein |
Protein accession | YP_927900 |
Protein GI | 119775160 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00251963 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGT TTCTGTTAAG CCTTACCGCA GTTACCTGCA TGGCATCACA GCCACTCTAT GCGGAGCCAG TGACTGCGCA GTCACAAGCC GTGAGCACTA TTGAAGCCTT TTCTACTGCG CCTCTTATTC GCAGTGTTGC TGTTTCAAGC GACGGAACCA AAGTAGCCAT CATCCGGGCC ACGTCGAAGG AAGGTGACTA TGTGATTGAA ATCCGCGACA CGAGCAAACT CGATGCCCAA CCAATCGTTT TGGGAGCAAA TCGTATGATG GTTTCGGGTG TCGTGTGGCT TAACAATGAC AAGATTGGCG TAATGTTCCG TCAGATCTTG AAAGATGGTG CCAGAAGGTA TTGGGTAAAC CGATTTGCAA TTACCGATGC AGATGGGAAA GGAGATTGGT TAATTCCGTT TCAAAACAAT CCCCGCGCAG GTTTTTCTAT CCTGAATACC TTGCCTGATA ACAAGGATGA AATCCTGGTT GAAACCGACC TGAACAACAA CTACAAGCCT GATGTTGTCC GTTTTAACGT AAATAATGGT CGCACAGTCT CAGTTCTTCG AGGCAGTGAC AAAATTTCTG GTGGTTATAT CTCAGATGCA GATGGTGATG TACGCGCTGC GACAGGCTGG AATTTGTCAG ACAATGCCAT CGACCTTTAT GCCAGAGCGA AGGGCGATAG TGACTGGAAG CTCGTGAAAC AGATTTCACC CAAATCCAGA GAAAGTTACA GCTTTATCGG CTTTTCAAAG GAAAATCCCG ATGAAATTTA TGTCAACGCC AACCAAGGGC AGGACAAAAC CGGAATCTAC CTTTACAACA TTAAAACGGG CCAGTATTCC GAACGTCTAT TCGGTTTGGA AAACGTCGAT GCGGATGGCG TGATCCAGAA AAAAGACGGT AGCTTGGTTG GATTTGGTTA TTCAGCAAAA TGGCCTCAGC GCTATCTTAC CGATGCCAAT GAACAAGCAA TTTATGATGG ATTGGATAAA CTGTTTGAAG GCAAGTTTGT TTCCATTATT AGCCGTTCTA ATGATGACTC AGTCATGGTG GTCAGTGCGT CATCCGACAA GGATCCTGGG ACTTATTACC TTGTTTTAAA CAAAAGTAAA ATTGAAACCA TCGGCAGTAA GTTGCCCTTT GTCGACCAAG AAAAACTTGG TAGCGTCAAA TACATCTCTT ACAAAGCCCG TGATGGCCGT AAAGTCTACG CATATGTCAC TCTTCCAAAA GGAAAAGGAC CGTTTCCTGC GGTAGTATTG CCACATGGTG GCCCTTGGGT TCGAGACACA ATCGTATTTG ACGATTGGGC GCAATTACTG GCTTCCAATG GTTATGTTGT TATCCAGCCC AACTACCGAG GTTCAACCGG TTATGGTATC GAGCACTGGA CTGCGGGTGA CAACAATTGG GGCCTGAAAA TGCAGGACGA CCTTGATGAT GCTGCCATGT ATCTGGTTGA AAAAGGACTG GCCACCAAGG ACAAGTTGGC CATGTTTGGT TGGAGTTATG GTGGTTACGC CGCTTTCGCC GCGTCAATGC GCGATAATAA CATTTACCAG TGTACTGTTG CCGGAGCGGG TGTCAGCGAT TTAAGTAAAA TCAATGCAAC CTTGAACGAA AATCGGTTCT TAAGCCGTCT GCAACGTCCA ACAATCACGG GTGTTTCACC TCTTTCACAG GTTGAGAAAG TCAATGTGCC AATACTCGTG GTACACGGTG ATATCGATGG ACGTGTGCCA GTTGCCCACA GCCGTGAGTT TGTAGAAAAA CTCAAGGATT TGAAAAAGGA TCATAAATAC GTTGAGCTGG TGGACGCTGA TCACTTTTCA GACACTTTGT TTTATGAACA TAAAATGGCG TTTTACTCAG AACTTATTGA TTGGTTAGAC AATAAGTGCG GTCTGAAGTA A
|
Protein sequence | MKKFLLSLTA VTCMASQPLY AEPVTAQSQA VSTIEAFSTA PLIRSVAVSS DGTKVAIIRA TSKEGDYVIE IRDTSKLDAQ PIVLGANRMM VSGVVWLNND KIGVMFRQIL KDGARRYWVN RFAITDADGK GDWLIPFQNN PRAGFSILNT LPDNKDEILV ETDLNNNYKP DVVRFNVNNG RTVSVLRGSD KISGGYISDA DGDVRAATGW NLSDNAIDLY ARAKGDSDWK LVKQISPKSR ESYSFIGFSK ENPDEIYVNA NQGQDKTGIY LYNIKTGQYS ERLFGLENVD ADGVIQKKDG SLVGFGYSAK WPQRYLTDAN EQAIYDGLDK LFEGKFVSII SRSNDDSVMV VSASSDKDPG TYYLVLNKSK IETIGSKLPF VDQEKLGSVK YISYKARDGR KVYAYVTLPK GKGPFPAVVL PHGGPWVRDT IVFDDWAQLL ASNGYVVIQP NYRGSTGYGI EHWTAGDNNW GLKMQDDLDD AAMYLVEKGL ATKDKLAMFG WSYGGYAAFA ASMRDNNIYQ CTVAGAGVSD LSKINATLNE NRFLSRLQRP TITGVSPLSQ VEKVNVPILV VHGDIDGRVP VAHSREFVEK LKDLKKDHKY VELVDADHFS DTLFYEHKMA FYSELIDWLD NKCGLK
|
| |