Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1047 |
Symbol | |
ID | 4027835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1180318 |
End bp | 1181430 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637966224 |
Product | aminopeptidase DmpA |
Protein accession | YP_573103 |
Protein GI | 92113175 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0633499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGAA TGAGAGCACG CGAGATGGGG CTGCCGCTGC CCGGCGAACC GGGCTTGAAC AACGCGATTA CGGACGTGCC GGGCGTGTTG GTCGGCTATG AAACGATTGA TGGAACGGCC GAAAACGGAC GGCCCATCAA GACCGGCGTG ACCGCCATTC TGCCGCGCAC CCGAAGCAAC ACGCCTGCTC CTGTCTGGGC CGGCTTTCAT GCGCTCAACG GCAACGGGGA AATGACAGGG ACGCATTGGA TCGAGCAAGG CGGCTATTTC GTCGGCCCGA TATGCCTGAC CAATTCGCAT AGCGTGGGCA TCGTTCATCA CGCCGCGACA CGCTGGATGC TCGATACCTA CGCGCAGACT TTCGACGCAC ACCACCTGTG GGCAATGCCC GTGGTCGCGG AGACCTATGA CGGCGTACTC AATGACATCA ACGGTCAGCA CGTGGAAGCC GCCCATGTCC ACGCCGCCCT CGCCAGCGCA AGTGGCGGCG CCATAGAAGA GGGAAACGTC GGCGGCGGCA ACGGCATGAT CTGTTACGGC TTCAAGGGCG GTACGGGTAC CGCTTCGCGC CGTGTCGGCA TCGATGGACA GGACTACACG TTGGGCGTGC TTGTCCAGGC CAACCACGGC AAGCGGGACT GGTTGAACGT ACTAGGCGTG CCTGTTGGAG AGGCACTGCA TGATGCCGAC TTGCCCGAAG AGCTCAATCG CGAACGCGGC TCCATCATCG CCGTCATCGC GACAGACGCT CCCATGCTGC CCCATCAGCT CAAGCGCCTC GCCCAACGCG CCGGACTGGG TATCGCACGC TCCGGCAGCC CCGGCGGCAA CGATTCAGGC GATATGTTTC TGGCCTTCAG CACGGCGAAC GAGGGTCCCT TGCCCCAGCT CGGGCCGGCC CGGCAACAGA TGCACCACAT GAACGACGAG TATTTCGATG ACTTCTACAT GGCGGTCGTG CAAGCGACGG ACGAAGCCGT CCTCAATGCC ATGTGCATGG CCAGGGGAGC GCCCATGGCA AAGCCGGAGG GCTGGTGCCC AGCTCTCGAT CCGGAACGGC TCGAGCCGTT ACTACGCCGG GCCGGTATCA GCATAGGAGA ACGTAACGAT TGA
|
Protein sequence | MQRMRAREMG LPLPGEPGLN NAITDVPGVL VGYETIDGTA ENGRPIKTGV TAILPRTRSN TPAPVWAGFH ALNGNGEMTG THWIEQGGYF VGPICLTNSH SVGIVHHAAT RWMLDTYAQT FDAHHLWAMP VVAETYDGVL NDINGQHVEA AHVHAALASA SGGAIEEGNV GGGNGMICYG FKGGTGTASR RVGIDGQDYT LGVLVQANHG KRDWLNVLGV PVGEALHDAD LPEELNRERG SIIAVIATDA PMLPHQLKRL AQRAGLGIAR SGSPGGNDSG DMFLAFSTAN EGPLPQLGPA RQQMHHMNDE YFDDFYMAVV QATDEAVLNA MCMARGAPMA KPEGWCPALD PERLEPLLRR AGISIGERND
|
| |