Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2732 |
Symbol | |
ID | 4028784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 3063661 |
End bp | 3064920 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637967940 |
Product | peptidase M24 |
Protein accession | YP_574778 |
Protein GI | 92114850 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | [TIGR02993] ectoine utilization protein EutD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.17461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCCTGC CTCCCGTCCG GGAGAACAGG GATGAGCAAT TCTGTCGGGC GAGTGGCTTC ATGTCTCAGG TTTCCCTGCC TTTCACTCGT GAGGAATACG CCAGCCGGCT GCACAAGGTG CGTTCGGCCA TGGCGCATCG AGGGATCGAC GTGCTGATCG TCAGCGATCC GTCCAACATG GCGTGGTTGA CGGGCTACGA TGGCTGGTCC TTTTATGTCC ATCAGTGCGT GCTGCTCGGC CTCGAAGGCG AGCCGGTCTG GTACGGGCGT CGCATGGATG CCAACGGTGC GCTGCGCACC TGCTGGATCC ATCCGGACAA CATCACCTAT TACCCGGATT ATTACGTCCA GAATCCCGAT ATGCATCCCA TGGAATACCT GGCGCAGTCG ATCCTGCCGG ATCGAGGGTG GCACAACGGT ATCATCGGCA TGGAGATGGA TAACTACTAT TTCTCGGCGA AGGCGTACCT GAGCCTCGTG CGTGAGCTGC CGCATGCCCG TTTCGAGGAT GCCAATTCGC TGGTCAACTG GTGCCGGGCG ATCAAGTCGC CGCAGGAAAT CGAGTACATG CGTATCGCGG CGAGGATCGT CGAGGGCATG CATTCGCGCA TCCTCGAGGT GATTCAGCCG GGCCTGCCCA AGAGCAAGCT GGTCTCCGAG ATCTACCGGG TGGGCATCGA GGGCTGGACA GATCCCGATG GTCGCGTCTT CGGTGGCGAT TACCCGGCCA TCGTGCCCAT GCTGCCCACC GGCAAGGATG CTGCCGCGCC GCACCTGACC TGGGACGATA CGCCCTTCCG CCAGGGCGAG GGCACCTTCT TCGAGATCGC CGGGGTCTAC AAGCGCTACC ATGCGCCGAT GTCGCGCACG GTGTTCCTGG GCACGCCGCC GAGTGCGTTC ATCCGTGCCG AGTCGGCCCT GCTCGAGGGG ATCGAGAACG GCCTGGCGGT GGCCAAGCCG GGCAATCGCA CCGCCGACAT CGCCATGGCG CTGGGCGCGG CGATGGACAA GTATGGCTTC GACCGGGGCG GGGCGCGTTG CGGCTACTCG ATCGGCATCA GCTATCCCCC GGACTGGGGC GAGCGCAACA TGAGCCTGCG CCCATCCGAC GACACCATCC TCGAGCCGGG GATGACGTTC CACTTCATGC CGGGGCTCTG GGAGGACGAC TGGGGCCTGG AAATCACCGA AAGCATTCTG ATTACCGAGA CTGGCTGCGA GACGCTGGCC AACTTCCCCC GCCAGCTCTT CGTGTGCTGA
|
Protein sequence | MVLPPVRENR DEQFCRASGF MSQVSLPFTR EEYASRLHKV RSAMAHRGID VLIVSDPSNM AWLTGYDGWS FYVHQCVLLG LEGEPVWYGR RMDANGALRT CWIHPDNITY YPDYYVQNPD MHPMEYLAQS ILPDRGWHNG IIGMEMDNYY FSAKAYLSLV RELPHARFED ANSLVNWCRA IKSPQEIEYM RIAARIVEGM HSRILEVIQP GLPKSKLVSE IYRVGIEGWT DPDGRVFGGD YPAIVPMLPT GKDAAAPHLT WDDTPFRQGE GTFFEIAGVY KRYHAPMSRT VFLGTPPSAF IRAESALLEG IENGLAVAKP GNRTADIAMA LGAAMDKYGF DRGGARCGYS IGISYPPDWG ERNMSLRPSD DTILEPGMTF HFMPGLWEDD WGLEITESIL ITETGCETLA NFPRQLFVC
|
| |