Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1115 |
Symbol | |
ID | 4029053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1268446 |
End bp | 1269627 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637966292 |
Product | isoaspartyl dipeptidase |
Protein accession | YP_573170 |
Protein GI | 92113242 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR01975] isoaspartyl dipeptidase IadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.201525 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGTCCG GCAAGGCGCG TCAGGCCGCG CCCATGACCT TGTTGCGCGT GGCTGAGGTG TTCGCCCCCG AACGGCTGGC GGCCACCGAC ATTCTCATGG CGGGCGGACG GATCGTCGCC CTGGGACAGG GGCTCGCCGT TCCCCAGGGC TGGCCGGTCC AGGTGGTGGA CGCGCGCCAC CTGATCGCCG TGCCCGCCTT CATCGATCAG CACGTGCATG TCACCGGGGG CGGTGGCGAA GGCGGTTGCG GCACCCGCTG CCCGCCGATC ACCACCCGCG ACATCGTGGC GATGGGCATC GGCACGGTGG TGGGCGTGCT GGGGACCGAC AGCATCAGCC GCTCGCCGGC CGACCTGCTG GCCGCGGTGC GAGGACTGGC GGCCGACGGG CTGGCCGCCT ACATGTACAC GGGCGCGTAT CGAGTGCCGG CGCCGACCCT CACCGGCGAT ATCCAGCGCG ACCTGGCCTG GATTCCCGAG GTCATCGGGG TGGGCGAAAT CGCCATTTCG GATCACCGCT CTAGCCAGCC GCGCCAGGAC GAGCTCGAAC GGCTGGTGAG CGACGCGCGG GTCGGCGCCA TGCTGGCCGG CAAGCGGGGC ATCTGTCATT TCCATCTCGG CGACGGCAAG CGGGGCCTGG AGCCTCTGCG GCGCTTGCTG ACGGAGACCG AGATTCCGGC GGATCAAGTG ATCCCCACCC ACGTCAATCG TCGTGCCGAG CTGCTCGAGG AGGCGGCGGA GTATGCGCTG GCATTCGATG CCAGCGTGGA CGTCACCGCG TTCGAGGATG CCGGCGACGG GCTTTCCGCG TTCGATGCGG TGTCGCGCCT GCTGGCGCGT GGCGTGTCGC CGGCACGCAT CACCCTGAGT TCGGACTGCA ATGGCAGCCT GCCGGAATTC GATGCCGATG GCGCCTACGT GGGCATGCAG GTGGCGCGCA ATACCACGTT GATCGCGGAT TGGCGCCGAT TGGTCCATGC CCGGGTGCTG CCGCTCGAAT CGGCGCTGGG GCTGCTCGCC GGCAATGTCG CGCGCGTGCT GGGGCTGGCC GACAAGGGCC GCCTGGCGGT GGGCAGCGAT GCCGACATCA CGCTGCTCGA CAAGGCCCTG CAGCCCCAGC GCACGTTCGT CGCGGGGCGC TGTCTGTACG GCGCCGTCGA CCATCACGAG ACCGCGCGGT GA
|
Protein sequence | MMSGKARQAA PMTLLRVAEV FAPERLAATD ILMAGGRIVA LGQGLAVPQG WPVQVVDARH LIAVPAFIDQ HVHVTGGGGE GGCGTRCPPI TTRDIVAMGI GTVVGVLGTD SISRSPADLL AAVRGLAADG LAAYMYTGAY RVPAPTLTGD IQRDLAWIPE VIGVGEIAIS DHRSSQPRQD ELERLVSDAR VGAMLAGKRG ICHFHLGDGK RGLEPLRRLL TETEIPADQV IPTHVNRRAE LLEEAAEYAL AFDASVDVTA FEDAGDGLSA FDAVSRLLAR GVSPARITLS SDCNGSLPEF DADGAYVGMQ VARNTTLIAD WRRLVHARVL PLESALGLLA GNVARVLGLA DKGRLAVGSD ADITLLDKAL QPQRTFVAGR CLYGAVDHHE TAR
|
| |