Gene Csal_0399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0399 
Symbol 
ID4025993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp444149 
End bp446233 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content65% 
IMG OID637965548 
Productoligopeptidase A 
Protein accessionYP_572460 
Protein GI92112532 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.905041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCATT CACACGTTCA TATCGAGGTG CCCATGTCAC GCAACCCCTT GCTCGAATCG 
CATGTCCTGC CGCCGTTCGA TGACATCCAG CCCGAGCATG TGGTGCCGGC CATCGAGCAG
CTCCTGGCGG AGAATCGCCG TGACATCGAG GCCCTCGCCC AGCAGTCGCA GATCAGTTGG
GAAAGCCTCG CCGCACCGTT GGAGGCCCTC AACGACCGGC TCTCCCAAGC CTGGTCGCCC
GTGTCGCACC TCAATTCCAC CATGAACAAC GAGGCGCTGC GCGAGGCATA TCAGGCCTGT
CTCGCCATGC TGTCCGACTA CAGCACCTGG CTCGGCCAGC ACCAGGGACT GTTCGAGGCC
TTCACGCGCC TCAAGGAAAG CGACGAATAC GCACGTCTCG AGGAGGGCCA GCAGCGCTCC
ATCGACAATA CCCTGCGCGA TTTCCGTCTC GCCGGTGTCG ATCTTCCCGA GGACCGGAAA
CGCCGCTATG GCGAGATTCA AGCGCGCCTG TCGGAACTGG CCAACACGTT CTCCAATCAT
GTGCTCGACG CCACCCAGGC ATGGCACCTC GACCTGACCG ACGACACGCG CCTTGGGGGC
CTGCCCGACA GCGCCCTGGC CACGCTCAAG GCCAATGCCG AGGCCAAGGG CGTCGACGGT
TACCGCATCA CGCTCGACTT CCCCAGCTTC TACCCGGTGC TCTCCTTCGC CGACGACCGC
GCGCTGCGTG AAGAAGTCTA TACCGCCTTC GTGACCCGTG CCTCGGACAA GGGCCCGCAC
GCCGGGCGTT TCGACAACGC CCCGATCATC GAGGAAACGC TGCGTCTGCG TCGCGAACTG
GCCGAGCTGC TCGGCTTCGA CACCTATGCC GACTATTCGC TGGCCACCAA GATGGCCGAC
TCGCCCCAGC AGGTACTGGG CTTCCTCGGT GACCTGGCCG ATCGTGCGCA CCCCCAGGCC
CAGCGCGAAT TCAACGAGCT GGAGGCCTTC GCCCGCGAAT CGCTGGGACT CGAGACGCTG
AAGCCGTGGG ATATCGGCTA TGTCAGCGAG AAACTGCGCG AGGCACGCTA TGCCATCTCC
CAGGAACAGC TGCGCCCCTA CTTCCCGGCC CCCAGGGTGA TCGAGGGACT GTTCCAGGTT
ACCGGCACCC TGTACGGCAT CGATTTCGCC GAACGTGACG ACGTGCCGCG CTATCACCCG
GACGTGCGCT ACTTCGAGAT CCTCGACGGT GATACGCCCA TCGCCGGGTT CTACCTCGAC
CTGTATGCCC GCGAAGGCAA GCGTGGCGGT GCCTGGATGG ACGAATGCCG TGTGCGGCGC
ACCCGCGAGG ACGGCAGCCT GCAGCTGCCC GTCGCCTACC TGACCTGCAA CTTCACGCGC
CCCGTGGGCG GCAAGCCCGC CCTGCTCACG CACGACGAGG TGCTGACGCT CTTCCACGAG
TTCGGGCATG GCCTGCACCA CATGCTGACG CGGCAGACCG TCGCCGATGT CTCCGGCATC
AATGGCGTCG CCTGGGATGC CGTCGAGCTG CCCAGCCAGT TCATGGAAAA CTTCTGCTGG
GAGCGCGAGG GGCTGGACAT GATCGCCGCT CATGTGGATA CCGGCGAAAA ACTGCCTGAC
GCCCTGCTCG ACAAGCTGCA GGCCGCACGC AACTTCCAGT CGGCCATGGG CATGATGCGC
CAGCTCGAAC TGTCCCTGTT CGACTTCCGC CTGCATCATG AAAGCCAGGC GCCCAGTGCC
GACGAGGTCC AGGCCCTGCT CGACGACGTG CGCGACAAGA CATCCGTCAC GCCGCGCGTC
GACTTCAACC GTTTCCAGAA CGGCTTCGGC CATATCTTCG CCGGCGGTTA TGCCGCAGGC
TATTACAGCT ACAAATGGGC CGAAGTCCTC TCGGCGGATG CCTACAGCGC CTTCGAGGAA
GCCGGCATCT TCGACACGGC GACGGGCCAG CGCTTCCGTC AGGAAATTCT CGAACGGGGC
GGTTCGCGCG ACGCCGCCGC CTTGTTCGAA GCCTTTCGGG GACGTGCACC GAGCATCGAA
CCGCTACTGC GCCATTCCGG CATCGAGAGC GCCGAGGCGG CCTGA
 
Protein sequence
MLHSHVHIEV PMSRNPLLES HVLPPFDDIQ PEHVVPAIEQ LLAENRRDIE ALAQQSQISW 
ESLAAPLEAL NDRLSQAWSP VSHLNSTMNN EALREAYQAC LAMLSDYSTW LGQHQGLFEA
FTRLKESDEY ARLEEGQQRS IDNTLRDFRL AGVDLPEDRK RRYGEIQARL SELANTFSNH
VLDATQAWHL DLTDDTRLGG LPDSALATLK ANAEAKGVDG YRITLDFPSF YPVLSFADDR
ALREEVYTAF VTRASDKGPH AGRFDNAPII EETLRLRREL AELLGFDTYA DYSLATKMAD
SPQQVLGFLG DLADRAHPQA QREFNELEAF ARESLGLETL KPWDIGYVSE KLREARYAIS
QEQLRPYFPA PRVIEGLFQV TGTLYGIDFA ERDDVPRYHP DVRYFEILDG DTPIAGFYLD
LYAREGKRGG AWMDECRVRR TREDGSLQLP VAYLTCNFTR PVGGKPALLT HDEVLTLFHE
FGHGLHHMLT RQTVADVSGI NGVAWDAVEL PSQFMENFCW EREGLDMIAA HVDTGEKLPD
ALLDKLQAAR NFQSAMGMMR QLELSLFDFR LHHESQAPSA DEVQALLDDV RDKTSVTPRV
DFNRFQNGFG HIFAGGYAAG YYSYKWAEVL SADAYSAFEE AGIFDTATGQ RFRQEILERG
GSRDAAALFE AFRGRAPSIE PLLRHSGIES AEAA