Gene Csal_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1115 
Symbol 
ID4029053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1268446 
End bp1269627 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content70% 
IMG OID637966292 
Productisoaspartyl dipeptidase 
Protein accessionYP_573170 
Protein GI92113242 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR01975] isoaspartyl dipeptidase IadA 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.201525 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTCCG GCAAGGCGCG TCAGGCCGCG CCCATGACCT TGTTGCGCGT GGCTGAGGTG 
TTCGCCCCCG AACGGCTGGC GGCCACCGAC ATTCTCATGG CGGGCGGACG GATCGTCGCC
CTGGGACAGG GGCTCGCCGT TCCCCAGGGC TGGCCGGTCC AGGTGGTGGA CGCGCGCCAC
CTGATCGCCG TGCCCGCCTT CATCGATCAG CACGTGCATG TCACCGGGGG CGGTGGCGAA
GGCGGTTGCG GCACCCGCTG CCCGCCGATC ACCACCCGCG ACATCGTGGC GATGGGCATC
GGCACGGTGG TGGGCGTGCT GGGGACCGAC AGCATCAGCC GCTCGCCGGC CGACCTGCTG
GCCGCGGTGC GAGGACTGGC GGCCGACGGG CTGGCCGCCT ACATGTACAC GGGCGCGTAT
CGAGTGCCGG CGCCGACCCT CACCGGCGAT ATCCAGCGCG ACCTGGCCTG GATTCCCGAG
GTCATCGGGG TGGGCGAAAT CGCCATTTCG GATCACCGCT CTAGCCAGCC GCGCCAGGAC
GAGCTCGAAC GGCTGGTGAG CGACGCGCGG GTCGGCGCCA TGCTGGCCGG CAAGCGGGGC
ATCTGTCATT TCCATCTCGG CGACGGCAAG CGGGGCCTGG AGCCTCTGCG GCGCTTGCTG
ACGGAGACCG AGATTCCGGC GGATCAAGTG ATCCCCACCC ACGTCAATCG TCGTGCCGAG
CTGCTCGAGG AGGCGGCGGA GTATGCGCTG GCATTCGATG CCAGCGTGGA CGTCACCGCG
TTCGAGGATG CCGGCGACGG GCTTTCCGCG TTCGATGCGG TGTCGCGCCT GCTGGCGCGT
GGCGTGTCGC CGGCACGCAT CACCCTGAGT TCGGACTGCA ATGGCAGCCT GCCGGAATTC
GATGCCGATG GCGCCTACGT GGGCATGCAG GTGGCGCGCA ATACCACGTT GATCGCGGAT
TGGCGCCGAT TGGTCCATGC CCGGGTGCTG CCGCTCGAAT CGGCGCTGGG GCTGCTCGCC
GGCAATGTCG CGCGCGTGCT GGGGCTGGCC GACAAGGGCC GCCTGGCGGT GGGCAGCGAT
GCCGACATCA CGCTGCTCGA CAAGGCCCTG CAGCCCCAGC GCACGTTCGT CGCGGGGCGC
TGTCTGTACG GCGCCGTCGA CCATCACGAG ACCGCGCGGT GA
 
Protein sequence
MMSGKARQAA PMTLLRVAEV FAPERLAATD ILMAGGRIVA LGQGLAVPQG WPVQVVDARH 
LIAVPAFIDQ HVHVTGGGGE GGCGTRCPPI TTRDIVAMGI GTVVGVLGTD SISRSPADLL
AAVRGLAADG LAAYMYTGAY RVPAPTLTGD IQRDLAWIPE VIGVGEIAIS DHRSSQPRQD
ELERLVSDAR VGAMLAGKRG ICHFHLGDGK RGLEPLRRLL TETEIPADQV IPTHVNRRAE
LLEEAAEYAL AFDASVDVTA FEDAGDGLSA FDAVSRLLAR GVSPARITLS SDCNGSLPEF
DADGAYVGMQ VARNTTLIAD WRRLVHARVL PLESALGLLA GNVARVLGLA DKGRLAVGSD
ADITLLDKAL QPQRTFVAGR CLYGAVDHHE TAR