Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sama_2779 |
Symbol | |
ID | 4605026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella amazonensis SB2B |
Kingdom | Bacteria |
Replicon accession | NC_008700 |
Strand | + |
Start bp | 3321415 |
End bp | 3322554 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639782190 |
Product | renal dipeptidase family protein |
Protein accession | YP_928651 |
Protein GI | 119775911 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAATC TTCACCGGCG CAGCCTGATA AAAGCCCTCG GCGCCAGCGC CCTCTTATCT GCCCTGCCCA CCGGCGTGCT TGCCGCCAAG TCACTGCGAC CACTCTACAT AGATGGCCTG TCGTTTTTAC CGGACTCTCT GGACGATCTC GCCGCCTCCG GTCTTTCGGC CTACCTGTGC GATATCTCCG CCATCGAAGA AGTAAAACAG GAAGATGGCA CCCTCAACTA CAAGCGCACC TACAACGCCT GCATCAAGTC GATTGCCGAC GCCGGAAAAC GCGTCAGCGA TAACCCGGGG CAGCTGCTGC AGGGGCTTTC AGCCAAAGAC ATCAAAAACG CCCGTGAGTC GGGCCGCACC GCGGTCTTTT TTCAAATTCA GGGGGCAGAC TGCGTAGAAG AACGCCTGTC ACAGGTGGAT GAGTTCTACC AAAAGGGCCT CAGGGTAATG CAGCTCACCC ATCACTATGG CAACAGCTTT GCCGGTGGCG CACTGGACAG CGATGAGCAC GGCGGCCTCA ATCTCCCCCT GAGCCCCAAG GGCTATGCCC TGGTGGATAA GCTCAACGAC AGCGGCATTC TCATCGACCT GAGTCACTCC AGCCCTCAAA CCGCGCTGGA CACCATAGCT GCCTCCCGCA TGCCGGTGGT GCAAAGCCAC GGTGCGGCCC GTGCCATCGT CAACCATGCC CGCTGTTCAC CGGATCAGGT GATCCGCGCC ATCGCAGACA GTGGCGGTGT ATTCGGGACC TTTATGATGA GCTTTTGGCT GACCACCAGC AGCACTCCCA CGGTTGAGCA CTATCTGGCA CAGCTGAAGC ACGTGGCCAG GGTGGGAGGT ATCGACGCGG TCGCCATTGC CAACGACTAT CCCCTGCGCG GCCAGGAAAA CCTGCTCAAA CTCAACAATG ACAACGCCGA AGGGGTGAAG GAGTATCTGG ACTGGTGGCA CAGCCTGCGG GCCAAAAAGG TACTCGGCTT CGACCATGAG CCGGTGCACG TGGTTATCCC CGAACTCAAT CACATTGAGC GCATGAGCCG CATCCACGAT GCCCTCAAGG ATGCAGGCTT CAGCGCCGCT GACGCCGATA AAATCATGGG CGGCAACTGG CAGCGGGTAT TGCAGCAGGT ACTGGTGTAA
|
Protein sequence | MTNLHRRSLI KALGASALLS ALPTGVLAAK SLRPLYIDGL SFLPDSLDDL AASGLSAYLC DISAIEEVKQ EDGTLNYKRT YNACIKSIAD AGKRVSDNPG QLLQGLSAKD IKNARESGRT AVFFQIQGAD CVEERLSQVD EFYQKGLRVM QLTHHYGNSF AGGALDSDEH GGLNLPLSPK GYALVDKLND SGILIDLSHS SPQTALDTIA ASRMPVVQSH GAARAIVNHA RCSPDQVIRA IADSGGVFGT FMMSFWLTTS STPTVEHYLA QLKHVARVGG IDAVAIANDY PLRGQENLLK LNNDNAEGVK EYLDWWHSLR AKKVLGFDHE PVHVVIPELN HIERMSRIHD ALKDAGFSAA DADKIMGGNW QRVLQQVLV
|
| |